artificial intelligence

Home Posts Tagged "artificial intelligence"

DeepSeek-VL2: Advancing Vision-Language Models with Mixture-of-Experts

6 Feb 2025 · 4 min read

DeepSeek-VL2: Advancing Vision-Language Models with Mixture-of-Experts

Discover DeepSeek-VL2, a state-of-the-art vision-language model leveraging Mixture-of-Experts (MoE) architecture. Explore its innovations in dynamic tiling, Multi-head Latent Attention (MLA), data construction, training methodology, and benchmark evaluations.

Read more

Kimi K1.5: Scaling Reinforcement Learning for State-of-the-Art LLMs

27 Jan 2025 · 6 min read

Kimi K1.5: Scaling Reinforcement Learning for State-of-the-Art LLMs

Explore the innovative methodologies and groundbreaking advancements of Kimi K1.5, the latest multimodal LLM scaling reinforcement learning to new heights. Learn about long-context scaling, multimodal training, and state-of-the-art performance benchmarks.

Read more