Fastino Emerges from Stealth With Task-Optimized LLMs — 1000x Faster Than Leading Models, No Need for GPUs

November 14, 2024| 2 min. read

SAN FRANCISCO–(BUSINESS WIRE)–Fastino, a new foundation AI model provider, launched today to provide a family of task-optimized language models that are more accurate, faster, and safer than traditional LLMs. The company also announced its $7 million pre-seed funding round led by global software investor Insight Partners and M12, Microsoft’s Venture Fund, with participation from NEA, CRV, Valor, Github CEO Thomas Dohmke, and others.

While Generative AI deployments have steadily increased year over year, even early adopters continue to face significant challenges when implementing the new technology. A 2024 McKinsey study shows that 63 percent of enterprises implementing Generative AI struggle to achieve demonstrable ROI due to model inaccuracy. Conventional LLMs offer significant innovation potential, but technological and operational complexities hinder companies from fully realizing this value. Fastino introduces a differentiated approach to help enterprises of all sizes accelerate the adoption and deployment of generative AI technology tailored to solve their business challenges.

“Fastino aims to bring the world more performant AI with task-specific capabilities,” said Ash Lewis, CEO and co-founder of Fastino. “Whereas traditional LLMs often require thousands of GPUs, making them costly and resource-intensive, our unique architecture requires only CPUs or NPUs. This approach enhances accuracy and speed while lowering energy consumption compared to other LLMs.”

Key features of Fastino include:

Fit-for-purpose architecture for consistent, accurate outputs: Fastino delivers task-optimized models for critical enterprise use cases like structuring of textual data, RAG systems, text summarization, task planning, and more.
CPU-level inferencing for swifter results: Fastino’s novel architecture operates up to 1000x faster than traditional large language models. Its optimized computation enables flexible deployment on CPUs or NPUs, minimizing the reliance on high end GPUs.
Task-optimized models for safer AI systems: Fastino’s family of models enable new, distributed AI systems, which are less vulnerable to adversarial attacks, hallucinations, and privacy risks.

“We’re proud to announce our initial funding round, led by Insight Partners and M12, Microsoft’s Venture Fund. This pre-seed funding allows us to continue pioneering LLM architecture, developing accurate, secure solutions that bring AI to the enterprise,” said George Hurn-Maloney, COO and co-founder of Fastino. “Global enterprises are facing increasing difficulty in accessing computing power while achieving the precision and speed necessary to integrate AI effectively. Fastino aims to fix this with scalable, high-performance language models, optimized for enterprise tasks.”

Investor Quotes

George Mathew, Managing Director at Insight Partners

“Fastino’s approach to solving contemporary AI challenges presents one of the most exciting developments in the trillion-dollar enterprise AI opportunity. We see a bright future in tunable, high-performance, low-latency foundation models that empower firms to use the most accurate generative AI available while reducing their risk exposure to data leakage and inaccurate outputs.”

Michael Stewart, Managing Partner at M12, Microsoft’s Venture Fund

“Fastino’s innovative architecture enables high performance while addressing critical challenges like safety, data leakage, accuracy and efficiency. Our investment will accelerate Fastino’s development of secure and performant Foundation AI, tunable to address enterprise challenges, from the banking to the consumer electronics sectors.”

Thomas Dohmke, CEO of GitHub

“I’m excited to be an early investor in Fastino, a company on a mission to bring the world accurate, fast, and safe task-specific LLMs that solve organizations’ most pressing challenges. Their novel approach involves a new architecture that runs on CPUs, making AI more accessible for a future with 1B developers.”

Fastino will dedicate the pre-seed funds to building its industry-leading research team while accelerating product development and hiring from its headquarters in Palo Alto. For more information, visit www.fastino.ai.

About Fastino

Fastino powers enterprise AI developers with high-performance, task-optimized language models built to scale. Unlike generic LLMs, Fastino’s models are engineered for accuracy, speed, and security, delivering near-instant CPU inferencing and flexible deployment across environments. Learn more about Fastino and stay updated on the latest announcements, visit www.fastino.ai.

About Insight Partners

Insight Partners is a global software investor partnering with high-growth technology, software, and Internet startup and ScaleUp companies that are driving transformative change in their industries. As of June 30, 2024, the firm has over $80B in regulatory assets under management. Insight Partners has invested in more than 800 companies worldwide and has seen over 55 portfolio companies achieve an IPO. Headquartered in New York City, Insight has offices in London, Tel Aviv, and the Bay Area. Insight’s mission is to find, fund, and work successfully with visionary executives, providing them with tailored, hands-on software expertise along their growth journey, from their first investment to IPO. For more information on Insight and all its investments, visit insightpartners.com or follow us on X @insightpartners.

About M12, Microsoft’s Venture Fund

M12, Microsoft’s venture fund, is a corporate venture capital firm dedicated to accelerating the future of technology through investments, insights, and meaningful partnership with Microsoft. The firm is thesis-driven, investing in AI, cloud infrastructure, cybersecurity, developer tools, vertical SaaS, Web3 and gaming. For nearly a decade, M12 has created exceptional value for portfolio companies through connections, customers, and go-to-market resources. M12 has offices in San Francisco and Redmond. https://m12.vc

Fastino Emerges from Stealth With Task-Optimized LLMs — 1000x Faster Than Leading Models, No Need for GPUs

TytoCare Names Adam Pellegrini as CEO and Closes $25M+ Growth Round to Scale AI-First Clinical Enablement Platform

Keyfactor Announces $1B+ Strategic Growth Investment Led by Summit Partners to Expand Leadership in Securing the AI and Post-Quantum Enterprise

Higharc Raises $95M Series C to Scale AI for Homebuilding

Gradial Raises $65M to Build the First System of Work for Enterprise Marketing

Golden Analytics Secures $14 Million Seed Extension

GovWell Raises $25M Series A Led by Insight Partners to Build the AI Operating System for Modern Government

Cloudsmith Raises $72M Series C Led by TCV and Insight Partners to Control and Secure the AI-Powered Software Supply Chain

Covera Health and Medmo Combine to Deliver the First Platform That Manages the Complete Radiology Journey: From Imaging Order to Accurate Diagnosis

ScreenPoint Medical Secures $16M to Lead the Next Phase of AI in Breast Cancer Care