RedPajama Develops Open-Source and Cutting-Edge LLMs with LLaMA

Date:

This article discusses RedPajama, a collaborative project between Together, Ontocord.ai, ETH DS3Lab, Stanford CRFM, Hazy Research, and MILA Québec AI Institute to create leading, open-source large language models. This project was begun with a 1.2 trillion token dataset that follows the LLaMA recipe. This data enables any organization to pre-train models that can be permissively licensed.

It is the same procedure the creators of the LLaMA model went through, however they did not release their dataset. The RedPajama team followed the same recipe and recreated a dataset from scratch to provide organizations full access to open-source language models. Vipul Ved Prakash, founder and CEO of Together and previously co-founder of Cloudmark and Topsy, emphasized the importance of providing open-source models that are commercially viable.

The open source AI debate has recently stirred up conversations about competition among corporations and the ethical concerns. Companies such as OpenAI insist that the level of access needs to be governed for organizations to maintain their lead. On the other hand, Databricks, released Dolly 2.0, which is the first open, instruction-following LLM for commercial use.

The RedPajama project attempts to address both of these perspectives as the models are open-source yet commercially viable. It is hoped that the data and script availability could lead to a broader level of research to improve AI models and applications. In addition, the models are trained on openly available data and should not reproduce the training data.

Lastly, the team working on this project, namely Chris Re – co-founder of Together, Stanford associate professor, and co-founder of SambaNova, Snorkel.ai and Factory – and Vipul Ved Prakash are pushing for open access to large language models and software systems. This gives organizations, both big and small, equitable access to the same tools, which otherwise may be out of reach.

See also  Developers Willing to Use AI Tools Despite Trust Issues, Says Stack Overflow Survey

Frequently Asked Questions (FAQs) Related to the Above News

Please note that the FAQs provided on this page are based on the news article published. While we strive to provide accurate and up-to-date information, it is always recommended to consult relevant authorities or professionals before making any decisions or taking action based on the FAQs or the news article.

Share post:

Subscribe

Popular

More like this
Related

Obama’s Techno-Optimism Shifts as Democrats Navigate Changing Tech Landscape

Explore the evolution of tech policy from Obama's optimism to Harris's vision at the Democratic National Convention. What's next for Democrats in tech?

Tech Evolution: From Obama’s Optimism to Harris’s Vision

Explore the evolution of tech policy from Obama's optimism to Harris's vision at the Democratic National Convention. What's next for Democrats in tech?

Tonix Pharmaceuticals TNXP Shares Fall 14.61% After Q2 Earnings Report

Tonix Pharmaceuticals TNXP shares decline 14.61% post-Q2 earnings report. Evaluate investment strategy based on company updates and market dynamics.

The Future of Good Jobs: Why College Degrees are Essential through 2031

Discover the future of good jobs through 2031 and why college degrees are essential. Learn more about job projections and AI's influence.