Hugging Face's new iOS app can identify and describe whatever is in your camera's field of view

Mar 25, 2025

Ellie Ramirez-Camara

Hugging Face's new iOS app can identify and describe whatever is in your camera's field of view

Hugging Face has released HuggingSnap, an iOS app that runs SmolVLM2 locally and efficiently to enable visual understanding and text-based generative AI tasks, such as answering questions about an image or identifying and describing objects from visual input.

Hugging Face recently launched HuggingSnap, an iOS application that runs SmolVLM2, a small but performant multimodal language model that accepts video, images, and text as inputs, and generates text in response. It can be used for:

vision understanding tasks, such as answering questions about or identifying and describing objects within an image or video;
generating text grounded on visual information, like writing a story grounded on the contents of one or more images;
text-only tasks, as one would with a standard language model.

Apps that leverage multimodal generative AI for various vision and text-based tasks are hardly new. However, HuggingSnap's key selling point is that SmolVLM2 is run locally and efficiently. As a result, the app does not require an internet connection to work, and all data is processed within the device without performance losses.

HuggingSnap can be downloaded from the App Store or built from the GitHub repository. It requires an iPhone running iOS 18 to run.

Comments

Harvey raises $150M at a $8B valuation in its third funding round this year

Harvey, a legal tech startup, raised $150 million at an $8 billion valuation in its third funding round of 2025, more than doubling its value in under a year amid intensifying competition in the legal AI market.

Oct 30, 2025

by Ellie Ramirez-Camara

News

LangChain announces a $125 million Series B on the eve of its third anniversary

LangChain raised $125 million at a $1.25 billion valuation, achieving unicorn status as it evolved from Harrison Chase's side project for building with LLMs into a full platform offering specialized tools like LangGraph and LangSmith for creating and monitoring agent workflows.

Oct 27, 2025

by Ellie Ramirez-Camara

Articles

WebAgents: the open-source framework enabling AI agent orchestration across the internet

Robutler is open-sourcing WebAgents - the framework for building connected AI agents that can discover and delegate to each other in real-time. WebAgents enables agents to dynamically extend their capabilities by discovering and orchestrating agents on demand.

Oct 26, 2025

by Volodymyr Seliuchenko

SF Bay Area media and education platform focused on AI and Data. As a voice of AI industry, Data Phoenix delivers news, practical knowledge, and helps companies be heard in the community.

Subscribe

Hugging Face's new iOS app can identify and describe whatever is in your camera's field of view

Comments

Read Next

Harvey raises $150M at a $8B valuation in its third funding round this year

LangChain announces a $125 million Series B on the eve of its third anniversary

WebAgents: the open-source framework enabling AI agent orchestration across the internet