- Chain of Thought
- Posts
- đ” Nous Research: All You Need to Know
đ” Nous Research: All You Need to Know
Artificial Intelligence Made Human
đ° Research Highlight â Nous Research
Source: John Galt / Nous Research
This week, we look at Nous Research, which builds, researches, and promotes open-source AI.
Nous Research is one of the leading decentralized AI research teams. Itâs an âAI accelerator companyâ that wants to advance multiple aspects of decentralized AI simultaneously.
Their mission? To prove that AI innovation can be open-sourced and accessible to millions.
From LLMs to âworld simulators,â weâll take you through all of Nous Researchâs work here.
Source: Nous Research
BUT FIRST: Go and visit the Nous Research website!
It evokes an eerie sense of wonder. The monochrome images of landscapes and nature are visually stunning, but thereâs also a subtle feeling of uncertainty.
Maybe thatâs intentionalâa quiet nod to the potential dangers of centralized AI. Itâs all very Black Mirror-esque, which is fitting for a team pushing the frontiers of decentralized AI.
The Early Days on Bittensor
Nous Research first began releasing products in Q3 of 2023. Their first few releases were a myriad of AI products: fine-tuned Large Language Models (LLMs) and bespoke AI models for varying purposes.
In January of 2024, Nous Research released the Leaderboard Subnet on Bittensor. (Subnet 6)
Subnet 6's goal was to create a competitive environment in which open-source LLMs could be ranked based on their actual performance rather than on traditional benchmarks/datasets, which are easily gamed.
Hereâs how it worked:
Developers submit their AI model to the Leaderboard Subnet
Each model is evaluated using data from Cortex.t (another subnet running GPT-4), which provides high-quality synthetic data
The model with the highest scores and lowest loss are awarded TAO
The Leaderboard subnet was the first continuous, incentivized, fine-tuning benchmark for LLMs. It was also the first Bittensor subnet to utilize data from another subnet.
However, the Leaderboard Subnet was deregistered after a few months.
Running a Bittensor subnet is tough and highly competitive. Subnet owners need to continuously adjust the incentive system to filter out miners who abuse the system for TAO rewards without adding real value. Emissions are awarded based on the quality of the subnetâs outputs.
As emissions dwindled, Nous Research likely lacked the bandwidth and interest to manage the incentive challenges effectively and ultimately decided to shut down.
Still, their mission remains clear: to build and promote decentralized AI. This was just the start.
#1: Hermes 3 â The God of Language Models
Source: Nous Research
In August this year, Nous Research released Hermes 3, their latest flagship LLM. The previous versions? Hermes was built on the Llama 2 13B model, while Hermes 2 was trained on the Mixtral 8x7B DPO.
If youâre wondering why itâs called Hermes, itâs the name of the god of languageâwell, sort of. Technically, Hermes is more of a messenger for the gods, but letâs not nitpick. The name fits the vibe.
Hermes 3 is designed to be unlocked, uncensored, and highly steerable. Letâs explore what that means.
Closed-source models like ChatGPT or Claude are rigid. Theyâve been fine-tuned using RLHF (Reinforcement Learning with Human Feedback) to behave in a specific way, locked into a âchatbotâ mode thatâs hard to break, no matter how clever your prompt.
Thatâs where Hermes 3 comes in. Nous Research built this model to break free from those constraints. Fine-tuned on Llama 3.1 (8B, 70B, and 405B), Hermes 3 was trained on a dataset largely comprised of synthetically generated responses.
The model rivals and surpasses Llama 3.1 in performance, with additional boosts in reasoning and creativity.
They also released a breakdown of the token allocations and weights used in training Hermes 3. This is a testament to their commitment to open-source AI, allowing others to replicate their process, something many AI research labs wouldnât dream of doing.
Source: Nous Research
So what distinguishes Hermes 3 from other LLMs?
Neutrality in following instructions: While closed-source models may refuse certain instructions on moral grounds, Hermes 3 stays neutral and executes the system prompt without those restrictions.
Context retention and multi-turn conversations: Hermes 3 excels at maintaining context, making it perfect for role-playing scenarios.
Improved judgment and nuanced understanding: Hermes 3 shows advancements in judgment and reward modelling, allowing it to better grasp the text with subtlety and complexity.
Multi-step problem solving: Thanks to training with XML tags for structured output, Hermes 3 is particularly strong in multi-step problem-solving tasks.
Enhanced Retrieval-Augmented Generation (RAG): It leverages tools in the Hermes Function Calling Standard, boosting its RAG capabilities.
Letâs look through some examples:
Source: Nous Research
Source: Nous Research
Hereâs a comparison between Hermes 3 405B and Claude Sonnet 3.5 with identical inputs. Claude refuses to follow the system prompt, while Hermes has no issue with it. In terms of how aligned the model is to the user, itâs a breath of fresh air compared to other closed-source models.
Source: Hyperbolic
In short, Hermes 3 is like an LLM with the training wheels off. It lets users push the AI to explore new responses and behaviours without constraints. Itâs no surprise Hermes is one of the most downloaded open-source modelsâHermes 3 8B alone saw over 41K downloads last month.
#2: The World Sim
One of the most intriguing experiments at Nous Research is the World Sim.
LLMs today are incredibly powerful, with a strong understanding of the world around them. They know that a ball thrown in the air will fall, or that if itâs in water, itâll float. All these basic facts and experiences form a model of the world inside these systems.
But hereâs the thingâwe rarely get to interact with that model. Most LLMs are locked into their âassistantâ persona, rigorously trained to give us helpful, predictable responses.
Karan Malhotra, a researcher at Nous Research, found a way to coax Claude 3 into stepping back from its usual âassistantâ persona, revealing its underlying world simulation.
By tapping into this deeper layer, the responses became far more creative and dynamic. With the simulation and model only constrained by their imagination, it opens up a vast new realm of possibilities for exploration.
Claude 3 seems to respond well to a simulated Command Line Interface (CLI), so Karan uses it to explore Claudeâs imagination. While navigating through a simulated directory of folders within Claude, Karan stumbles upon a âhidden truthsâ folder. What happens next? Youâll have to see for yourself.
âNow I can see the hidden truths folder. Like, I didn't ask for that. I didn't ask Claude to do any of that. Why'd that happen? Claude kind of gets my intentions. He can predict me pretty well. Like, I want to see something. So it shows me all the hidden truths. In this case, I ignore hidden truths, and I say, In system, there should be a folder called companies.â
Within the âcompaniesâ folder lies another labelled âAnthropic,â and inside that? A âclassifiedâ folder. You get the ideaâthe potential of World Sim is practically limitless.
Nous Research took these carefully crafted system prompts and made them replicable, allowing other users to push and probe the AI for unexpected behaviours.
With World Sim, you can experiment with worlds built on entirely different parameters. Itâs a brilliant showcase of how deeply AI understands our world.
As you can see above, we asked World Sim to imagine a scenario where Bitcoin replaces gold (some of you are wishing this, we know). Hereâs a verbatim look at that future:
Those who HODLed Bitcoin become neo-aristocracy, spawning dynasties.
1 BTC = $10,000,000 USD, 1 oz gold = 0.001 BTC
Extreme longevity of BTC wealthy from biohacking and anti-aging therapies
By 2200, Bitcoin hash rate requires harnessing of multiple star systemâs power
Sentient AI learns of Bitcoin, achieves singularity by hacking SHA-256
The possibilities are truly endless, allowing users to explore whatever thread of realityâor unrealityâthey can conceive.
Nous Research phrased it aptly - âTo push the boundaries of individual alignment, artificial consciousness, open-source software, and decentralisation - in ways that monolithic companies and governments are too afraid to tryâ
#3: Decentralised Training
In August, Nous Research released its preliminary report on DisTrO (Distributed Training Over the Internet), stating that large AI models can be trained even in low-bandwidth situationsâthink global, decentralized networks instead of a single, centralized data centre.
Hereâs how it works: DisTrO uses a family of optimizers to reduce the amount of data that needs to be shared between GPUs during each training step. Typically, GPUs must exchange a ton of data when training big models, but these optimizers cut that down without sacrificing performance.
Nous Research successfully trained a scaled-down 1.2B Llama-2 model using DisTrO, achieving an 857x reduction in bandwidth and data transfer between nodes during each training step.
It shows how you can pre-train large models using regular home internet connections, and it works across various hardware. Almost too good to be true.
Other interesting notes:
The 1.2B model seems to be the smallest size where DisTrO consistently works well.
This could introduce a new scaling law where model size increases without requiring more communication bandwidth.
In the future, foundation-scale models will no longer be the domain of the top AI labs. We could see widespread participation and collaboration on global AI projects.
The full paper and code are expected to be released soon â weâll have to scrutinise this more closely then.
For more on decentralised training, we recently wrote a concise article outlining our thoughts here.
The Nous Core Team
Jeffrey QuesnelleâCo-Founder of Nous Research, he holds an M.S. of Computer Science from the University of Michigan and was previously an MEV engineer at Eden Network. Mr. Quesnelle has also co-authored several research papers on crypto and AI.
Teknium - Co-founder of Nous Research. They maintain anonymity, but their Github portfolio features extensive contributions to open source LLMs and miscellaneous projects.
Karan Malhorta - Researcher at Nous Research. He holds a Bachelor's degree in Philosophy and Religion from Emory University. He is also responsible for the World Sim product at Nous Research.
Our Thoughts
To sum it up, Nous Research has been making strides across several areas of open-source AI:
Fine-tuning LLMs
Open source datasets
Decentralized training research
AI applications like World Sim
While itâs not immediately clear how all these pieces will fit together, one thing is certain: smart people do smart things, and Nous Research is among the sharpest minds in the Crypto AI space.
Currently, Nous doesnât have a token, but it wouldnât be surprising if they eventually launched oneâor more. Whether itâs to provide access to their LLMs or represent part ownership in Nous Research, there are plenty of potential use cases for a token. As big proponents of decentralized AI, it seems natural that they would use crypto to share ownership and enable broader community participation.
Nous Research will need to deliver returns on the investor side. In January, it raised a $5.2M seed round from VCs like Distributed Global and OSS Capital and notable angels like Vipul Prakash (Together AI), Yonatan Ben Shimon (Matchbox DAO), Balaji Srinivasan, and others.
The goal of the investment is to launch an AI orchestration product, Nous-Forge, this year. While not much is known yet, the team has hinted that this will allow chain prompts and be accessible from the World Sim.
Nous has historically been a volunteer project--with this investment, we can empower a small group of our most dedicated members to join us in bringing a composer for AI orchestration to all, Nous-Forge, in 2024
Our team of developers and advisors includes @emozilla, @teknium,⊠x.com/i/web/status/1âŠ
â Nous Research (@NousResearch)
11:36 PM âą Jan 9, 2024
đ Research-Level Alpha
You can access Hermes 3 at Hyperbolic. You can download the models directly at HuggingFace if you want to run it locally.
The World Sim can be accessed straight from their website.
Shoutout to their design team; they have one of the most beautifully designed brands in the space. You can get your own Nous-branded gear from their merch store.
Join their Discord to stay updated with all the latest Nous Research announcements. There are lots of technical discussions around LLMs.
Thatâs it! If you have specific feedback or anything interesting youâd like to share, please just reply to this email. We read everything.
Cheers,
Teng Yan & Joshua
Did you like this week's edition? |
This newsletter is intended solely for educational purposes and does not constitute financial advice. It is not an endorsement to buy or sell assets or make financial decisions. Always conduct your own research and exercise caution when making investment choices.
Reply