NEW WEBSITE LAUNCH
Subscribe to our newsletter

YaLM

YaLM 100B is a GPT-like neural network for generating and processing text. It can be used freely by developers and researchers from all over the world. It took 65 days to train the model on a cluster of 800 A100 graphics cards and 1.7 TB of online texts, books, and countless other sources in both English and Russian. Researchers and developers can use the corporate-size solution to solve the most complex problems associated with natural language processing.
Training details and best practices on acceleration and stabilizations can be found on Medium (English) and Habr (Russian) articles. The model is published under the Apache 2.0 license that permits both research and commercial use.

FREE

Licence

Model

Unit/Currency

YaLM
This website uses cookies to improve your experience. By using this website you agree to our Privacy Policy Policy.