AI & Machine Learning
Artificial intelligence and machine learning APIs for text generation, image analysis, and intelligent automation
Considerações Importantes
- ✓ Model quality vs cost trade-offs
- ✓ Latency requirements for real-time use
- ✓ Data privacy and compliance needs
- ✓ API rate limits and scaling
- ✓ Specialized vs general-purpose models
Anthropic Claude API
Access to Claude AI models for complex reasoning, writing, code generation, and analysis. Features Opus, Sonnet, and Haiku model tiers.
Anyscale
Anyscale is the managed platform for Ray, created by Ray's founders, enabling teams to run and scale ML/AI workloads across data processing, training, and inference. It offers cloud-based development environments, fault-tolerant clusters, and cost optimization features.
AssemblyAI
AI speech-to-text API. Universal: $0.15/hr base. Add-ons stack: diarization +$0.02/hr, sentiment +$0.02/hr. $50 free credits.
AWS Rekognition
Amazon computer vision API. ~$0.001/image (first 1M), $0.0004/image at 100M+. 1K images/mo free for 12 months.
Bolt.new
AI-powered full-stack app builder. Describe your app in natural language and get a complete, deployable application with frontend, backend, and database.
Cohere
Enterprise AI platform. Command A: $2.50/$10 per MTok. Embed: $0.12/MTok. Free trial available.
Deepgram
AI speech recognition API. Nova-2: $0.0043/min streaming, $0.0036/min batch. $200 free credits for new accounts.
DeepSeek
Ultra-low-cost LLM API with OpenAI-compatible interface. DeepSeek-V3 models offer state-of-the-art performance at a fraction of competitor pricing. MIT-licensed models available for self-hosting.
Devin
AI software engineer by Cognition. An autonomous AI agent that can plan, code, debug, and deploy complete features with minimal human intervention.
ElevenLabs
ElevenLabs provides AI voice generation and voice agents platform with 5,000+ voices in 70+ languages. Their API offers text-to-speech, voice cloning, speech-to-speech conversion, and conversational AI capabilities with enterprise-grade security (SOC2, GDPR).
Fireworks AI
Fireworks AI provides an inference cloud platform enabling developers to run, fine-tune, and deploy open-source AI models at blazing speed. With 400+ models, enterprise-grade security, and globally distributed infrastructure, it delivers industry-leading throughput and latency.
Google Cloud AI (Vertex AI)
Vertex AI platform with Gemini models. Gemini 2.5 Flash: $0.30/$2.50 per MTok. Gemini 2.5 Pro: $1.25/$10 per MTok. $300 free credits.
Groq
Groq delivers ultra-fast AI inference powered by custom-built LPU (Language Processing Unit) silicon. The platform provides fast, low-cost inference with deterministic execution, supporting LLMs, speech-to-text, text-to-speech, and vision models through an OpenAI-compatible API.
Hugging Face
Open-source ML platform with 200+ inference providers. Pay-as-you-go with no markup. PRO at $9/mo for 20x credits.
Mistral AI
European LLM provider with competitive pricing and strong open-source models. Mistral Large 3 rivals GPT-4 at lower cost. Based in Paris with EU data residency.
Modal
Modal is a serverless cloud platform designed for AI teams to deploy, scale, and manage machine learning workloads. It offers sub-second cold starts, instant autoscaling, and a developer-friendly Python-first experience with 100x faster initialization than Docker.
OpenAI API
Access to GPT-4, GPT-4o, DALL-E, Whisper and other AI models via API for text generation, code, images, and audio.
Perplexity API
Perplexity API provides access to search-grounded AI models that deliver real-time, web-wide research and Q&A capabilities. Their Sonar models are optimized for delivering helpful, up-to-date, and factual responses by searching hundreds of billions of indexed webpages.
Replicate
Run machine learning models in the cloud with a simple API. Host open source models or deploy your own. Pay-per-use pricing with no infrastructure management.
RunPod
RunPod is a GPU cloud platform enabling developers to deploy and scale GPU workloads on demand. With 30+ GPU SKUs, global deployment across 8+ regions, and millisecond billing, it offers serverless autoscaling with sub-200ms cold starts via FlashBoot technology.
Runware
AI image generation API. The fastest and most cost-effective way to integrate AI image generation into your applications with access to thousands of models.
Stability AI
Stability AI develops open-source generative AI models for image, video, audio, and 3D generation. Their API platform provides production-ready media generation and editing tools including Stable Diffusion, Stable Video, and Stable Audio for creative and enterprise applications.
Together AI
Fast inference for open source LLMs with OpenAI-compatible API. Run Llama, Mistral, and more models at competitive prices. Fine-tuning support included.
xAI (Grok)
xAI provides access to Grok, a large language model designed to deliver truthful, insightful answers. The API offers OpenAI-compatible endpoints supporting REST, gRPC, and SDKs, with models including Grok 3 and Grok 4 for various AI applications.