RedPajama replicates LLaMA dataset to build open source, state-of-the-art LLMs
![](https://venturebeat.com/wp-content/uploads/2023/04/RedPajama-landscape.png?fit=3640%2C2048&strip=all)
RedPajama, which creates fully open-source large language models, has released a 1.2 trillion token dataset following the LLaMA recipe.
![](https://i.ytimg.com/vi/bh1ADwKkYdk/maxresdefault.jpg)
RedPajama: New Open-Source LLM Reproducing LLaMA Training Dataset of over 1.2 trillion tokens
![](https://i.ytimg.com/vi/gT9Zyxq16V8/maxresdefault.jpg)
LLLMs: Local Large Language Models
![](https://s10251.pcdn.co/wp-content/uploads/2023/04/2023-Alan-D-Thompson-LLM-Emerging-Rev-0.png)
Inside language models (from GPT to Olympus) โ Dr Alan D. Thompson โ Life Architect
![](https://klu.ai/_next/image?url=%2F_next%2Fstatic%2Fmedia%2Fmeta-llama-2-model-card.0568a39c.jpg&w=3840&q=100)
Best Open Source LLMs of 2024 โ Klu
![](https://miro.medium.com/v2/resize:fit:1400/1*bSvJwvgboRfPxehPB3Hicw.png)
Open-Source LLM Explained: A Beginner's Journey Through Large Language Models, by ByFintech @ AI4Finance Foundation
![](https://ar5iv.labs.arxiv.org/html/2311.17035/assets/x1.png)
2311.17035] Scalable Extraction of Training Data from (Production) Language Models
![](https://www.playstationlifestyle.net/wp-content/uploads/sites/9/2023/11/Purearts-Promo_Eredin_04.jpg?w=1024)
๐ฎ Replica News
![](https://miro.medium.com/v2/resize:fit:1400/0*DWyi_UKzt1klFtMV.png)
Beyond LLaMA: The Power of Open LLMs, by Cameron R. Wolfe, Ph.D.
![](https://substackcdn.com/image/fetch/w_1200,h_600,c_fill,f_jpg,q_auto:good,fl_progressive:steep,g_auto/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25e8acf8-0545-4d7e-a296-d612b2fb9270_6804x4846.png)
The Latest Open Source LLMs and Datasets