- Emergent Behavior
- Posts
- Mistral Falls
Mistral Falls
Microsoft announces a partnership with French AI startup, Mistral
š· Subscribe to get breakdowns of the most important developments in AI in your inbox every morning.
![](https://media.beehiiv.com/cdn-cgi/image/fit=scale-down,format=auto,onerror=redirect,quality=80/uploads/asset/file/0e0158b6-b9cc-47dd-91d1-e485f8582d9b/Screenshot_2024-01-22_at_2.13.39_PM.png?t=1705950837)
Microsoft announces a partnership (read: licensing their latest models + taking a small equity stake in the firm) with Mistral. Mistral is a French AI startup founded by ex-Meta AI lab researchers.
We're announcing a multi-year partnership with @MistralAI, as we build on our commitment to offer customers the best choice of open and foundation models on Azure.
ā Satya Nadella (@satyanadella)
5:18 PM ā¢ Feb 26, 2024
Mistral originally focused on releasing smaller open-source models, scaling them up over the last 6 months until now, when itās in striking distance of GPT-4.
![](https://media.beehiiv.com/cdn-cgi/image/fit=scale-down,format=auto,onerror=redirect,quality=80/uploads/asset/file/c78df6f1-f561-4c9f-8cd2-2348e3edca5f/Screenshot_2024-02-26_at_11.55.07_AM.png?t=1708977319)
The Mistral-Large model:
is natively fluent in English, French, Spanish, German and Italian
has 32k tokens context
has precise instruction following: donāt know how this claim is measuredā¦ as it could potentially be full AGI if it means what itās supposed to mean (obviously not, but they chose the language)
function calling - JSON output formatting
Theyāre not open-sourcing it for now; itās available via API or lease in its own environment from MSFT.
Interestingly, Mistral seems to have created multilingual versions of popular benchmarks, and this could be a significant contribution if they decide to open-source this
Mistral published these multilingual benchmarks. As far as I know there are no published translated versions of these anywhere.
Did they just use machine translation?
Does a translated version of HellaSwag even make sense?Please publish them or they are meaningless!
ā xlr8harder (@xlr8harder)
7:29 PM ā¢ Feb 26, 2024
The model is priced at par with GPT-4.
mistral-large costs %20 less than gpt4-turbo, but mistrals tokenizer results in %20 more tokens on average. You just use a worse model for the same $$$
ā Xeophon (@TheXeophon)
5:53 PM ā¢ Feb 26, 2024
Still havenāt beaten OpenAI, but weāre probably only a few months away at this point.
If mistral's new large model couldn't surpass gpt-4, what hope does anyone else have? OpenAI lead is > 1 year
ā anton (@abacaj)
5:30 PM ā¢ Feb 26, 2024
Even in AI, the money train seems to be tightening.
Not gonna lie; I am sad about Mistral not open-sourcing any of their models š¢
I thought they were on the OSS team.
ā Bindu Reddy (@bindureddy)
7:13 PM ā¢ Feb 26, 2024
As some had expected it to:
Told you so!*
(*that the only biz model for even the best model startups is being acquired by or otherwise entering into an encumbered relationship w one of the big US infra providers. Meaning thereās no chance rn for meaningful EU (or, other) competition w US AI giants.)
ā Meredith Whittaker (@mer__edith)
6:57 PM ā¢ Feb 26, 2024
This also hedges MSFT position in OpenAI from regulatory scrutiny
Microsoft strikes a deal with Mistral . Here is what you need to know:
* $MSFT will help bring Mistral models to market. $MSFT gets a minority stake in Mistral
* Push gives $MSFT flexibility moving forward if OpenAI comes under regulatory scrutinyā Ralph Brooks (@ralphbrooks)
2:49 PM ā¢ Feb 26, 2024
Satya playing another masterstrokeā¦ you gotta think: Where is Andy Jassy at Amazon in all this?
![](https://media.beehiiv.com/cdn-cgi/image/fit=scale-down,format=auto,onerror=redirect,quality=80/uploads/asset/file/0e0158b6-b9cc-47dd-91d1-e485f8582d9b/Screenshot_2024-01-22_at_2.13.39_PM.png?t=1705950837)
Become a subscriber for daily breakdowns of whatās happening in the AI world:
Reply