AWS Inferentia and AWS Trainium deliver lowest cost to deploy Llama 3 models in Amazon SageMaker JumpStart | Amazon Web Services Source Cluster: AWS Machine Learning Source Node: 1970432Time Stamp: May 2, 2024
Simple guide to training Llama 2 with AWS Trainium on Amazon SageMaker | Amazon Web Services Source Cluster: AWS Machine Learning Source Node: 1970155Time Stamp: May 1, 2024
Automate chatbot for document and data retrieval using Agents and Knowledge Bases for Amazon Bedrock | Amazon Web Services Source Cluster: AWS Machine Learning Source Node: 1970733Time Stamp: May 1, 2024
Develop and train large models cost-efficiently with Metaflow and AWS Trainium | Amazon Web Services Source Cluster: AWS Machine Learning Source Node: 1969587Time Stamp: Apr 29, 2024
Open source observability for AWS Inferentia nodes within Amazon EKS clusters | Amazon Web Services Source Cluster: AWS Machine Learning Source Node: 1965567Time Stamp: Apr 17, 2024
Gradient makes LLM benchmarking cost-effective and effortless with AWS Inferentia | Amazon Web Services Source Cluster: AWS Machine Learning Source Node: 1961063Time Stamp: Apr 2, 2024
Fine-tune and deploy Llama 2 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium | Amazon Web Services Source Cluster: AWS Machine Learning Source Node: 1938138Time Stamp: Jan 17, 2024
Fine-tune Llama 2 using QLoRA and Deploy it on Amazon SageMaker with AWS Inferentia2 | Amazon Web Services Source Cluster: AWS Machine Learning Source Node: 1924682Time Stamp: Dec 13, 2023
Welcome to a New Era of Building in the Cloud with Generative AI on AWS | Amazon Web Services Source Cluster: AWS Machine Learning Source Node: 1920152Time Stamp: Nov 30, 2023
Scale foundation model inference to hundreds of models with Amazon SageMaker – Part 1 | Amazon Web Services Source Cluster: AWS Machine Learning Source Node: 1919452Time Stamp: Nov 30, 2023
How Amazon Search M5 saved 30% for LLM training cost by using AWS Trainium | Amazon Web Services Source Cluster: AWS Machine Learning Source Node: 1917751Time Stamp: Nov 22, 2023
Intuitivo achieves higher throughput while saving on AI/ML costs using AWS Inferentia and PyTorch | Amazon Web Services Source Cluster: AWS Machine Learning Source Node: 1906297Time Stamp: Oct 26, 2023
Retrieval-Augmented Generation & RAG Workflows Source Cluster: AI & Machine Learning Source Node: 1905680Time Stamp: Oct 24, 2023
Optimize generative AI workloads for environmental sustainability | Amazon Web Services Source Cluster: AWS Machine Learning Source Node: 1892386Time Stamp: Sep 21, 2023
Machine learning with decentralized training data using federated learning on Amazon SageMaker | Amazon Web Services Source Cluster: AWS Machine Learning Source Node: 1879135Time Stamp: Aug 22, 2023
Optimize AWS Inferentia utilization with FastAPI and PyTorch models on Amazon EC2 Inf1 & Inf2 instances | Amazon Web Services Source Cluster: AWS Machine Learning Source Node: 1865423Time Stamp: Jul 24, 2023
Reduce energy consumption of your machine learning workloads by up to 90% with AWS purpose-built accelerators | Amazon Web Services Source Cluster: AWS Machine Learning Source Node: 1850227Time Stamp: Jun 20, 2023
AWS Inferentia2 builds on AWS Inferentia1 by delivering 4x higher throughput and 10x lower latency | Amazon Web Services Source Cluster: AWS Machine Learning Source Node: 1848251Time Stamp: Jun 13, 2023
Scale your machine learning workloads on Amazon ECS powered by AWS Trainium instances | Amazon Web Services Source Cluster: AWS Machine Learning Source Node: 1842480Time Stamp: May 31, 2023
Achieve high performance with lowest cost for generative AI inference using AWS Inferentia2 and AWS Trainium on Amazon SageMaker Source Cluster: AWS Machine Learning Source Node: 1832515Time Stamp: May 4, 2023