We continue to advance the FPT AI Factory platform to improve scalability, performance, and operational efficiency. This release, as of December 12, 2025, introduces new features and optimizations designed to enhance the smoothness and efficiency of your workflows.

FPT AI Studio
Accelerate LLM workflows with new optimization techniques and gain real-time visibility through Grafana-integrated UI Logs.
New feature
1. Full support for the Qwen3VL model
Allow users to leverage state-of-the-art multimodal capabilities of the Qwen3VL model family for tasks such as visual understanding across AI Studio and related services.

2. Support Download Model Catalog by SDK
Enable Model Catalog download via SDK gives developers a faster, automated way to integrate and manage models and improve workflow efficiency.

AI Notebook
Boost automation, ease of use, and performance, helping customers work faster and smarter with AI Notebook.
New feature
1. Automated Lab Version Upgrade
Remove manual steps for deleting old labs and remapping during version upgrades, saving time and reducing errors.
2. Event Notification Scheduling
Enable scheduled system and feature announcements directly in AI Notebook, ensuring users stay informed without disruption.

3. Notebook Gallery
Offer ready-to-use notebooks for common use cases across various topics, allowing quick reference and execution to accelerate development.


4. GPU Quota Control
Introduce per-tenant GPU usage limits for better resource allocation and cost management, ensuring fair and efficient utilization.
FPT AI Inference
Achieve operational stability with new upgrades in LiteLLM engine, billing, kafka, and top-up services.
New feature
1. Infrastructure & API Stability
- LiteLLM Upgrade: Enhance system resilience and processing efficiency with the upgraded LiteLLM architecture
- API Standardization: Ensure consistent data output and improved integration capabilities by optimizing and standardizing the v1/responses/ parameter.
2. Production Go-Live & Core Services
- Seamless Payments & Billing: Enjoy instant account top-ups and accurate, real-time service charge tracking.
- High-Performance Connectivity: Experience a faster, smoother platform with improved stability for real-time interactions.
- Enhanced User Feedback: Clearer interaction with upgraded popups providing instant status updates on your actions.

Billing
Foster real-time tracking, transparent cost insights, and a centralized dashboard for all usage-related information.
New feature
Product Usage: users can better manage budgets, optimize resource consumption, and make data-driven decisions with confidence.
A centralized interface that displays:
- Total cost of all services.
- Real-time updates for the current billing period.
- Cost breakdown by product category: GPU Container, AI Inference, Model Fine-Tuning, Model Hub, and Interactive Session.

Use case
- View total usage and spending across all FPT AI Factory services in a single, unified dashboard.
- Track detailed usage history (GPU Container, AI Inference, Model Fine-tuning, Interactive Session, Model Hub) by day, month, or year.
- Monitor real-time costs to prevent unexpected overspending and improve budget control.
