Clicky chatsimple

Codestral, Mistral’s First Generative AI Model For Code

Category :

AI

Posted On :

Share This :

Codestral, the first generative AI model for coding, was published by Mistral, the French AI firm valued at $6 billion and supported by Microsoft.

Like other code-generating technologies, Codestral is intended to facilitate code authoring and developer interaction. According to a blog post by Mistral, it was trained on more than 80 programming languages, such as Python, Java, C++, and JavaScript. In addition to writing tests, finishing coding tasks, and “filling in” missing code, Codestral can also respond to English inquiries regarding a codebase.

Although Mistral calls the model “open,” that is debatable. The startup’s license forbids using Codestral or any of its outputs for any kind of business endeavor. Even the “development” exception is subject to restrictions, as the license expressly forbids “any internal usage by employees in the context of the company’s business activities.”

It’s possible that Codestral was taught in part on copyrighted material. Although Mistral didn’t say so in the blog post, it wouldn’t be shocking as there has been proof that copyrighted data was included in the startup’s prior training datasets.

Besides, Codestral might not be worth the hassle. Considering that the model has 22 billion parameters, it needs a powerful PC to run. (In essence, parameters define an AI model’s proficiency with a task, such as producing and evaluating text.) Furthermore, even while it outperforms the competition by certain benchmarks—which are, as we all know, unreliable—it’s hardly a landslide.

Although Codestral offers very slight performance gains and is unfeasible for the majority of developers, it will undoubtedly intensify the discussion around the viability of using code-generating models as programming assistants.

Generative AI techniques are undoubtedly being adopted by developers for some coding tasks. In a June 2023 Stack Overflow survey, 44% of developers stated they currently use AI technologies in their development process, and 26% said they want to do so in the near future. But these are clearly flawed tools.

GitClear’s examination of over 150 million lines of code contributed to project repos over the course of several years revealed that generative AI development tools are causing an increase in the amount of incorrect code that is pushed into codebases. Security experts have already cautioned that these kinds of tools have the potential to magnify already-existing vulnerabilities and security issues in software projects; a Purdue University study found that more than half of the responses provided by OpenAI’s ChatGPT to queries about programming are incorrect.

That won’t deter businesses like Mistral and others from trying to make money off of their ideas and acquire market share. Mistral announced a hosted version of Codestral this morning on both its premium API and conversational AI platform, Le Chat. According to Mistral, efforts have also been made to integrate Codestral into LlamaIndex, LangChain, Continue.dev, and Tabnine, among other app frameworks and development environments.