Google's Gemma 3 is multimodal, comes in four sizes and can now handle more information and instructions thanks to a larger context window.
Detailing the sensor suite used to capture the L2D data, Hugging Face said that each of the 60 Kia Niro EV models were equipped with six RGB cameras to capture the vehicle's surrounding in 360p, ...
Google has released four new open-source AI models under the Gemma 3 series, which are tailor-made for deploying on mobile ...
Gemma 3 models are capable of processing text and visual inputs but can only generate text outputs. The models are available ...
Hugging Face has teamed up with startup Yaak to expand the former's LeRobot platform with training data for self-driving ...
Today at Embedded World, MediaTek introduced the high-performance Genio 720 and Genio 520 edge-AI IoT platforms. These new additions to the Genio series support the latest generative AI models, human ...
AI needs to question its training data and take counterintuitive approaches, the top scientist at Hugging Face wrote on X.
IBM has recently released the Granite 3.2 series of open-source AI models, enhancing inference capabilities and introducing ...
The new small language model can help developers build multimodal AI applications for lightweight computing devices, ...
Microsoft is expanding its Phi line of open-source language models with two new algorithms optimized for multimodal ...
Microsoft's new Phi-4 AI models deliver breakthrough performance in a compact size, processing text, images and speech simultaneously while requiring less computing power than competitors.