InferencegemmaGoogle Gemma 3google/gemma-3InferenceReleased Jan 1, 2025125,000 token context windowIn $0.15 · Out $0.30Tool callingAttachmentsOpen weightsTemp controlsInputtext, imageOutputtextDetailProvider
InferencellamaLlama 3.1 8B Instructmeta/llama-3.1-8b-instructInferenceReleased Jan 1, 202516,000 token context windowIn $0.025 · Out $0.025Tool callingOpen weightsTemp controlsInputtextOutputtextDetailProvider
InferencellamaLlama 3.2 11B Vision Instructmeta/llama-3.2-11b-vision-instructInferenceReleased Jan 1, 202516,000 token context windowIn $0.055 · Out $0.055Tool callingAttachmentsOpen weightsTemp controlsInputtext, imageOutputtextDetailProvider
InferencellamaLlama 3.2 1B Instructmeta/llama-3.2-1b-instructInferenceReleased Jan 1, 202516,000 token context windowIn $0.01 · Out $0.01Tool callingOpen weightsTemp controlsInputtextOutputtextDetailProvider
InferencellamaLlama 3.2 3B Instructmeta/llama-3.2-3b-instructInferenceReleased Jan 1, 202516,000 token context windowIn $0.02 · Out $0.02Tool callingOpen weightsTemp controlsInputtextOutputtextDetailProvider
Inferencemistral-nemoMistral Nemo 12B Instructmistral/mistral-nemo-12b-instructInferenceReleased Jan 1, 202516,000 token context windowIn $0.038 · Out $0.10Tool callingOpen weightsTemp controlsInputtextOutputtextDetailProvider
InferenceosmosisOsmosis Structure 0.6Bosmosis/osmosis-structure-0.6bInferenceReleased Jan 1, 20254,000 token context windowIn $0.10 · Out $0.50Tool callingOpen weightsTemp controlsInputtextOutputtextDetailProvider
InferenceqwenQwen 2.5 7B Vision Instructqwen/qwen-2.5-7b-vision-instructInferenceReleased Jan 1, 2025125,000 token context windowIn $0.20 · Out $0.20Tool callingAttachmentsOpen weightsTemp controlsInputtext, imageOutputtextDetailProvider
InferenceqwenQwen 3 Embedding 4Bqwen/qwen3-embedding-4bInferenceReleased Jan 1, 202532,000 token context windowSubscription plan pricingOpen weightsInputtextOutputtextDetailProvider