Google's Gemma 3 is multimodal, comes in four sizes and can now handle more information and instructions thanks to a larger context window.
Detailing the sensor suite used to capture the L2D data, Hugging Face said that each of the 60 Kia Niro EV models were equipped with six RGB cameras to capture the vehicle's surrounding in 360p, ...
Google has released four new open-source AI models under the Gemma 3 series, which are tailor-made for deploying on mobile ...
Gemma 3 models are capable of processing text and visual inputs but can only generate text outputs. The models are available ...
Hugging Face has teamed up with startup Yaak to expand the former's LeRobot platform with training data for self-driving ...
First of all, the MassSpecGym dataset is available as a Hugging Face dataset and can be downloaded within the code into a pandas DataFrame as follows. Third, MassSpecGym provides a MassSpecDataModule, ...
IBM has recently released the Granite 3.2 series of open-source AI models, enhancing inference capabilities and introducing ...
The new small language model can help developers build multimodal AI applications for lightweight computing devices, ...
Microsoft's new Phi-4 AI models deliver breakthrough performance in a compact size, processing text, images and speech simultaneously while requiring less computing power than competitors.
Microsoft has launched Phi-4-multimodal and Phi-4-mini, the latest additions to its Phi family of small language models (SLMs ...
The second new model that Microsoft released today, Phi-4-multimodal, is an upgraded version of Phi-4-mini with 5.6 billion ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results