Just links – Telegram

Just links

6.78K subscribers

371 photos

43 videos

10 files

7.93K links

That's just link aggregator of everything I consider interesting, especially DL and topological condensed matter physics. @EvgeniyZh

Download Telegram

About

Blog

Apps

Platform

6.78K subscribers

Forwarded from эйай ньюз

This media is not supported in your browser

VIEW IN TELEGRAM

This media is not supported in your browser

VIEW IN TELEGRAM

This media is not supported in your browser

VIEW IN TELEGRAM

This media is not supported in your browser

VIEW IN TELEGRAM

Lucy 2.0 — риалтайм VFX c заменой персонажа

Decart выкатили Lucy 2.0 — модель, которая в реальном времени заменяет вас на видео с вебки другим персонажем. Всё это в 1080p 30fps и почти без задержки.

По сути, это Runway act 2 или Kling Motion Control, но в риалтайме. Обычно такие видосы генерятся длиной не более 30 секунд, дальше начинается дрейф. Проблему "дрейфа", когда картинка со временем плывёт, попробовали решить дополнительным этапом тюна. Правда, на деле качество самой картинки ещё очень далеко от офлайн-конкурентов, и персонаж всё равно дрейфует, и заметно, как обновляется лицо.

Как выяснилось, модель (пока?) без цензуры, так что определённые виды стримеров напряглись.

Но кроме очевидных развлечений, Lucy 2.0 показывает возможности редактирования видео, например стилизацию всего кадра или замену его отдельных частей, причём на вход можно подать как текстовый промпт, так и картинку. Кроме того, разрабы предлагают использовать её как движок для симуляций и аугментации данных для робототехники. Можно взять одну демонстрацию и прогнать её с тысячами вариаций окружения, освещения и материалов для расширения датасета. Вся эта риалтайм-магия работает на ускорителях AWS Trainium3.

Неужели 26-й год — год риалтайм-видеогенерации?

Демо

@ai_newz

1.96K views16:44

Forwarded from ∅

https://jamanetwork.com/journals/jamanetworkopen/fullarticle/2843883

Emoji Use in the Electronic Health Record

This cross-sectional study investigates rates and characteristics of emoji use in clinical notes within electronic health records.

2.43K views13:15

PaperBanana: Automating Academic Illustration for AI Scientists https://arxiv.org/abs/2601.23265

PaperBanana: Automating Academic Illustration for AI Scientists

Despite rapid advances in autonomous AI scientists powered by language models, generating publication-ready illustrations remains a labor-intensive bottleneck in the research workflow. To lift...

🙈5

2.58K views05:37

Forwarded from Oktai Tatanov

https://matharena.ai/arxivmath

👀2

1.99K views08:27

Spurious Rewards Paradox: Mechanistically Understanding How RLVR Activates Memorization Shortcuts in LLMs
https://arxiv.org/abs/2601.11061
via @buckwheat_thoughts

Spurious Rewards Paradox: Mechanistically Understanding How RLVR...

Reinforcement Learning with Verifiable Rewards (RLVR) is highly effective for enhancing LLM reasoning, yet recent evidence shows models like Qwen 2.5 achieve significant gains even with spurious...

1.95K views12:53

https://www.anthropic.com/engineering/building-c-compiler

@seeallochnaya

Building a C compiler with a team of parallel Claudes

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

👀7😁3😨3🥴1👾1

2.05K views19:17

https://balatrobench.com/

Leaderboard benchmarking LLMs playing Balatro: rounds, tool-call reliability, cost, and speed.

🔥6

2.14K views21:40

Weak Diffusion Priors Can Still Achieve Strong Inverse-Problem Performance https://arxiv.org/abs/2601.22443

Weak Diffusion Priors Can Still Achieve Strong Inverse-Problem Performance

Can a diffusion model trained on bedrooms recover human faces? Diffusion models are widely used as priors for inverse problems, but standard approaches usually assume a high-fidelity model trained...

1.95K views08:50

Are AI Capabilities Increasing Exponentially? A Competing Hypothesis https://arxiv.org/abs/2602.04836

Are AI Capabilities Increasing Exponentially? A Competing Hypothesis

Rapidly increasing AI capabilities have substantial real-world consequences, ranging from AI safety concerns to labor market consequences. The Model Evaluation & Threat Research (METR) report...

1.92K views11:42

BabyVision: Visual Reasoning Beyond Language https://unipat.ai/blog/BabyVision

BabyVision: Visual Reasoning Beyond Language

State-of-the-art MLLMs achieve PhD-level language reasoning but struggle with visual tasks that 3-year-olds solve effortlessly. We introduce BabyVision, a benchmark revealing the infancy of AI vision.

8.54K views14:03

Forwarded from Hacker News

The Waymo World Model: A New Frontier for Autonomous Driving Simulation (🔥 Score: 157+ in 1 hour)

Link: https://readhacker.news/s/6Ma63
Comments: https://readhacker.news/c/6Ma63

The Waymo World Model: A New Frontier For Autonomous Driving Simulation

We are excited to introduce the Waymo World Model, a frontier generative model that sets a new bar for large-scale, hyper-realistic autonomous driving simulation.

2.02K views17:54

Read 77+ Comments

Refer-Agent: A Collaborative Multi-Agent System with Reasoning and Reflection for Referring Video Object Segmentation https://arxiv.org/abs/2602.03595

Refer-Agent: A Collaborative Multi-Agent System with Reasoning and...

Referring Video Object Segmentation (RVOS) aims to segment objects in videos based on textual queries. Current methods mainly rely on large-scale supervised fine-tuning (SFT) of Multi-modal Large...

2.22K views08:35

Learning to Repair Lean Proofs from Compiler Feedback https://arxiv.org/abs/2602.02990

Learning to Repair Lean Proofs from Compiler Feedback

As neural theorem provers become increasingly agentic, the ability to interpret and act on compiler feedback is critical. However, existing Lean datasets consist almost exclusively of correct...

❤1

2.45K views08:48

First Proof https://arxiv.org/abs/2602.05192

To assess the ability of current AI systems to correctly answer research-level mathematics questions, we share a set of ten math questions which have arisen naturally in the research process of...

🤔3

2.38K views14:24

Universal Topological Gates from Braiding and Fusing Anyons on Quantum Hardware https://arxiv.org/abs/2601.20956

Universal Gates from Braiding and Fusing Anyons on Quantum Hardware

Topological quantum computation encodes quantum information in the internal fusion space of non-Abelian anyonic quasiparticles, whose braiding implements logical gates. This goes beyond Abelian...

2.14K views16:08

Expanding the Capabilities of Reinforcement Learning via Text Feedback https://arxiv.org/abs/2602.02482

Expanding the Capabilities of Reinforcement Learning via Text Feedback

The success of RL for LLM post-training stems from an unreasonably uninformative source: a single bit of information per rollout as binary reward or preference label. At the other extreme,...

2.33K views16:09

https://fixupx.com/i/status/2021239388173213737

🧵 Thread • FixupX

Charlie (Zixi) Chen (@charllechen)

Why is nanochat's optimal tokens per param 8, much smaller than 20 from Chinchilla? We had similar findings in our NeurIPS work https://arxiv.org/abs/2512.05620. We hypothesize two key factors: (1) improved optimization and (2) higher-quality data. 1/n🧵
…

🔥2

11K views22:16

Goal-Guided Efficient Exploration via Large Language Model in Reinforcement Learning https://arxiv.org/abs/2509.22008

Goal-Guided Efficient Exploration via Large Language Model in...

Real-world decision-making tasks typically occur in complex and open environments, posing significant challenges to reinforcement learning (RL) agents' exploration efficiency and long-horizon...

2.18K views23:42

Self-dual Higgs transitions: Toric code and beyond https://arxiv.org/abs/2601.20945

Self-dual Higgs transitions: Toric code and beyond

The toric code, when deformed in a way that preserves the self-duality $\mathbb{Z}_2$ symmetry exchanging the electric and magnetic excitations, admits a transition to a topologically trivial...

🤯2👾1

2.06K views13:13

We hid backdoors in binaries — Opus 4.6 found 49% of them https://quesma.com/blog/introducing-binaryaudit/

We hid backdoors in ~40MB binaries and asked AI + Ghidra to find them - Quesma Blog

BinaryAudit benchmarks AI agents using Ghidra to find backdoors in compiled binaries of real open-source servers, proxies, and network infrastructure.

👍6

3.24K viewsedited 15:03