Compare Models
-
Microsoft, NVIDIA
MT-NLG
OTHERMT-NLG (Megatron-Turing Natural Language Generation) uses the architecture of the transformer-based Megatron to generate coherent and contextually relevant text for a range of tasks, including completion prediction, reading comprehension, commonsense reasoning, natural language inferences, and word sense disambiguation. MT-NLG is the successor to Microsoft Turing NLG 17B and NVIDIA Megatron-LM 8.3B. The MT-NLG model is three times larger than GPT-3 (530B vs 175B). Following the original Megatron work, NVIDIA and Microsoft trained the model on over 4,000 GPUs. NVIDIA has announced an Early Access program for its managed API service to the MT-NLG model for organizations and researchers. -
Google
PaLM 2 chat-bison-001
$0.0021535PaLM 2 has just launched (May 2023) and is Google’s next-generation Large Language Model, built on Google’s Pathways AI architecture. PaLM 2 was trained on a massive dataset of text and code, and it can handle many different tasks and learn new ones quickly. It is seen as a direct competitor to OpenAI’s GPT-4 model. It excels at advanced reasoning tasks, including code and math, classification and question answering, translation and multilingual proficiency (100 languages), and natural language generation better than our previous state-of-the-art LLMs, including its predecessor PaLM.PaLM 2 is the underlying model driving the PaLM API that can be accessed through Google’s Generative AI Studio. PaLM 2 has four submodels with different sizes. Bison is the best value in terms of capability and chat-bison-001 has been fine-tuned for multi-turn conversation use cases. If you want to see PaLM 2 capabilities, the simplest way to use it is through Google Bard (PaLM 2 is the technology that powers Google Bard).Watch Paige Bailey introducing PaLM 2: view here
-
ChatGLM
PaLM 2 text-bison-001
$0.004PaLM 2 has just launched (May 2023) and is Google’s next-generation Large Language Model, built on Google’s Pathways AI architecture. PaLM 2 was trained on a massive dataset of text and code, and it can handle many different tasks and learn new ones quickly. It is seen as a direct competitor to OpenAI’s GPT-4 model. It excels at advanced reasoning tasks, including code and math, classification, question answering, translation and multilingual proficiency (100 languages), and natural language generation better than our previous state-of-the-art LLMs, including its predecessor PaLM.PaLM 2 is the underlying model driving the PaLM API that can be accessed through Google’s Generative AI Studio. PaLM 2 has four submodels with different sizes. Bison is the best value in terms of capability and cost, and text-bison-001 can be fine-tuned to follow natural language instructions and is suitable for various language tasks such as classification, sentiment analysis, entity extraction, extractive question answering, summarization, re-writing text in a different style, and concept ideation.If you want to see PaLM 2 capabilities, the simplest way to use it is through Google Bard (PaLM 2 is the technology that powers Google Bard).
Watch Paige Bailey introducing PaLM 2: view here
-
Google
PaLM 2 textembedding-gecko-001
$0.0004PaLM 2 has just launched (May 2023) and is Google’s next-generation Large Language Model, built on Google’s Pathways AI architecture. PaLM 2 was trained on a massive dataset of text and code, and it can handle many different tasks and learn new ones quickly. It is seen as a direct competitor to OpenAI’s GPT-4 model. It excels at advanced reasoning tasks, including code and math, classification and question answering, translation and multilingual proficiency (100 languages), and natural language generation better than our previous state-of-the-art LLMs, including its predecessor PaLM.PaLM 2 is the underlying model driving the PaLM API that can be accessed through Google’s Generative AI Studio. PaLM 2 has four submodels with different sizes: Unicorn (the largest), Bison, Otter, and Gecko (the smallest) and the different sizes of the submodels allow PaLM 2 to be more efficient and to perform different tasks. Gecko is the smallest and cheapest model for simple tasks and textembedding-gecko-001 returns model embeddings for text inputs.If you want to see PaLM 2 capabilities, the simplest way to use it is through Google Bard (PaLM 2 is the technology that powers Google Bard).Watch Paige Bailey introducing PaLM 2: view here
-
OpenAI
text-davinci-003
$0.02Text-davinci-003 is recognized as GPT 3.5 and is a variant of the GPT-3 model. While both Davinci and text-davinci-003 are powerful models, they differ in a few key ways. Text-davinci-003 is a newer and more capable model explicitly designed for instruction-following tasks. Text-davinci-003 was trained on a more recent dataset containing data up to June 2021. It can do any language task with better quality, longer output, and consistent instruction-following than the Curie, Babbage, or Ada models. Text-davinci-003 supports a longer context window (max prompt plus completion length) than Davinci.For those requesting the OpenAI’s API, GPT-3.5-turbo may be a better choice for tasks that require high accuracy in math or zero-shot classification and sentiment analysis than text-davinci-003. To note, GPT-3.5-turbo performs at a similar capability to text-davinci-003 but at 10 percent the price per token. OpenAI recommends GPT-3.5-turbo for most use cases. -
OpenAI
text-embedding-ada-002
$0.0001An embedding API model, such as Ada, is a powerful tool that converts words into numerical representations, enabling computers to understand and process natural language more effectively. This process is crucial for developing machine learning algorithms and artificial intelligence systems that can interact with humans, analyze text, or make predictions based on text. OpenAI’s text embeddings is built for advanced search, clustering, topic modeling, and classification functionality.Access is available through a request to OpenAI’s API. -
TruthGPT
TruthGPT
OtherTruthGPT is a large language model (LLM), and according to Elon Musk, TruthGPT will be a “maximum truth-seeking” AI. In terms of how it works, it filters through thousands of datasets and draws educated conclusions to provide answers that are as unbiased as possible. TruthGPT is powered by $TRUTH, a tradable cryptocurrency on the Binance Smart Chain. $TRUTH holders will soon access additional benefits when using TruthGPT AI. When we learn more, we will update this section. -
Microsoft
VALL-E
OTHERVALL-E is a LLM for text to speech synthesis (TTS) developed by Microsoft (technically it is a neural codec language model). Its creators state that VALL-E could be used for high-quality text-to-speech applications, speech editing where a recording of a person could be edited and changed from a text transcript (making them say something they originally didn’t), and audio content creation when combined with other generative AI models. Studies indicate that VALL-E notably surpasses the leading zero-shot TTS system regarding speech authenticity and resemblance to the speaker. Furthermore, it has been observed that VALL-E is capable of retaining the emotional expression and ambient acoustics of the speaker within the synthesized output. Unfortunately, VALL-E is not available for any form of public consumption at this time. At the time of writing, VALL-E is a research project, and there is no customer onboarding queue or waitlist (but you can apply to be part of the first testers group). -
OpenAI
Whisper
0.006Whisper is an automatic speech recognition (ASR) system capable of transcribing in multiple languages as well as translating them into English. With Whisper, you can easily transcribe speech into text, allowing you to capture conversations and meetings for future reference. And if you need to communicate with someone who speaks a different language, Whisper can help with that too — it can translate many different languages into English, making it easier than ever to bridge the gap and ensure that everyone is on the same page.
Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification. The speech to text API has two endpoints (transcriptions and translations) and file uploads are currently limited to 25 MB, and the following input file types are supported: mp3, mp4, mpeg, mpga, m4a, wav, and webm.