Improving Agent Design, JPEG-LM's Visual Breakthrough, TurboEdit's Real-Time Image Edits, Video Segmentation Advances, LLMs Learning Like Humans, RL Benchmarks
MP3•Episoder hjem
Manage episode 435391262 series 3568650
Innhold levert av PocketPod. Alt podcastinnhold, inkludert episoder, grafikk og podcastbeskrivelser, lastes opp og leveres direkte av PocketPod eller deres podcastplattformpartner. Hvis du tror at noen bruker det opphavsrettsbeskyttede verket ditt uten din tillatelse, kan du følge prosessen skissert her https://no.player.fm/legal.
xGen-MM (BLIP-3): A Family of Open Large Multimodal Models JPEG-LM: LLMs as Image Generators with Canonical Codec Representations Automated Design of Agentic Systems TurboEdit: Instant text-based image editing Surgical SAM 2: Real-time Segment Anything in Surgical Video by Efficient Frame Pruning Fine-tuning Large Language Models with Human-inspired Learning Strategies in Medical Question Answering D5RL: Diverse Datasets for Data-Driven Deep Reinforcement Learning
…
continue reading
70 episoder