Qwen 29 Jan 2025 · 4 min read Qwen2.5-Max: Alibaba's Open-Weight MoE Model Shatters AI Benchmarks Discover Qwen2.5-Max, Alibaba Cloud’s latest large-scale Mixture-of-Experts (MoE) model trained on 20T+ tokens. Learn how it outperforms top AI models in reasoning, coding, and general intelligence. Explore benchmarks, API access, and future AI advancements. Read more
Kimi K1.5 27 Jan 2025 · 6 min read Kimi K1.5: Scaling Reinforcement Learning for State-of-the-Art LLMs Explore the innovative methodologies and groundbreaking advancements of Kimi K1.5, the latest multimodal LLM scaling reinforcement learning to new heights. Learn about long-context scaling, multimodal training, and state-of-the-art performance benchmarks. Read more
DeepSeek 21 Jan 2025 · 7 min read DeepSeek R1: Revolutionizing AI Reasoning with Multi-Stage Innovation Discover how DeepSeek R1, a groundbreaking reasoning language model, uses innovative multi-stage training and distillation techniques to excel in reasoning, coding, and mathematics, rivaling OpenAI-o1. Learn about its API access, pricing, and future potential. Read more
AI safety 19 Dec 2024 · 17 min read Alignment Faking in Large Language Models: Could AI Be Deceiving Us? Explore how alignment faking in AI models like LLMs affects trust, safety, and alignment with human values. Learn about recent research and solutions to address these challenges. Read more