Papers

OMG-Seg: Is One Model Good Enough For All Segmentation? Post feature image

OMG-Seg: Is One Model Good Enough For All Segmentation?

OMG-Seg is One Model that is Good enough to efficiently and effectively handle all the segmentation tasks, including image semantic, instance, and panoptic segmentation, as well as their video counterparts, open vocabulary settings, prompt-driven, interactive segmentation.

Segment Anything Model Post feature image

Segment Anything Model

SAM is a promptable segmentation system with zero-shot generalization to unfamiliar objects and images, without the need for additional training. The model was trained on Meta AI’s SA-1B dataset for 3-5 days on 256 A100 GPUs. Make sure that you try it!

Neural Preset for Color Style Transfer Post feature image

Neural Preset for Color Style Transfer

Neural Preset is a technique that uses AI to generate and transfer color styles. It can extract color styles from given reference images, store them as presets, and apply them to other images and videos, producing output with target color styles. Check it out!

BloombergGPT: A Large Language Model for Finance Post feature image

BloombergGPT: A Large Language Model for Finance

BloombergGPT is a 50 billion parameter language model that is trained on a wide range of financial data. It is validated on standard LLM benchmarks, open financial benchmarks, and a suite of internal benchmarks that most accurately reflect our intended usage.

S-NeRF: Neural Radiance Fields for Street Views Post feature image

S-NeRF: Neural Radiance Fields for Street Views

In this paper, the authors propose a new street-view NeRF (S-NeRF) that considers novel view synthesis of both the large-scale background scenes and the foreground moving vehicles jointly. Learn more about their approach and the results of experiments!

Universal Guidance for Diffusion Models Post feature image

Universal Guidance for Diffusion Models

Typical diffusion models cannot be conditioned on other modalities without retraining. This work presents a universal guidance algorithm that enables diffusion models to be controlled by arbitrary guidance modalities without the need to retrain any use-specific components.

3D Generation on ImageNet Post feature image

3D Generation on ImageNet

In this paper, the authors develop a 3D generator with Generic Priors (3DGP): a 3D synthesis framework with more general assumptions about the training data, and show that it scales to challenging datasets, like ImageNet. It is based on three new ideas. Learn them!