This website is not yet fully optimised for mobile viewing. Some features may not display or function as intended.
Phi-4-multimodal-instruct
MicrosoftOverview
No description provided. Want to help? Contribute on GitHub or click here!
Quick Links
Key Metrics
Max Input
128K
Tokens
Max Output
128K
Tokens
Input Price
-
Per 1M Tokens
Output Price
-
Per 1M Tokens
Throughput
-
tok/s
Latency
-
ms
Model Information
Release Details
Released
01 Feb 2025
Knowledge Cutoff
Jun 2024
License
MIT
Model Architecture
Parameters
-
Training Data
-
Context Window
Input Context Length
128,000 tokens
Output Context Length
128,000 tokens
Key Features
Web Access
No
Real-time access to current web information
Multimodal
Yes
Ability to process multiple data types (text, images, etc.)
Reasoning
Unknown
Advanced logical and deductive reasoning capabilities
Fine-Tunable
Unknown
Can be customized for specific use cases
Model Release & Updates
01 Feb 2025
Model Released
Model first made available to the public
Benchmarks & Performance Comparison
GPQA
-
LisanBench
-
AidanBench
-
Aider-Polyglot
-
LiveBench
-
AIME 2024
-
ARC-AGI-1
-
ARC-AGI-2
-
AIME 2025
-
Humanity's Last Exam
-