Camb.ai's co-founder and CTO, Akshat Prakash, who was a former Apple engineer working on AI and ML models for Siri, co-founded the company with his father Avneesh Prakash last year. He emphasized that Camb.ai takes a different approach compared to companies like OpenAI and Anthropic. "They're trying to build very horizontal models that can cover a wide range of tasks. We don't have to do that. Some of our models are less than 100 million parameters and are super specialized," he said. Prakash further explained that Camb.ai pretrained 70% of its models using academic-licensed datasets that are commercially usable, and the remaining 30% involves fine-tuning data from early partners who deploy its models for AI-based dubbing and translation.
He also asserted that Camb.ai is very careful and avoids potentially scraping the internet. "Some companies feel they can get away with it as they build consumer-facing apps or tools. But we don't do that. We have a 'three-layer' approach to provide AI-based translation, comprising the foundation layer of Boli and Mars models, the infrastructure layer hosting these models, and the DubStudio platform for the front end," Prakash said. Camb.ai's Boli takes input speech tokens and produces output text tokens in the translated language while retaining nuances. Once Boli generates the text, Mars translates it into speech using the same audio input signal to capture the actual audio performance, including ambient sounds like the background score of the audience cheering in sports events.
Camb.ai currently has a team of 50 people. In February, it raised $4 million in a seed round led by Courtside Ventures. Prakash told TechCrunch that the startup is closing a bigger, pre-Series A round to expand its reach and headcount.
For instance, companies like Scale AI have witnessed a significant increase in demand for their data labeling services. Scale AI raised a whopping $1 billion in a recent round, with a valuation of $13.8 billion. This shows the immense potential and value that the data labeling market holds. Uber's entry into this market gives it a competitive edge and the opportunity to collaborate with leading players in the industry.
Moreover, by serving external customers, Uber can generate additional revenue streams. This diversification of income sources will make the company more resilient in the face of market fluctuations and competition. It also allows Uber to leverage its existing infrastructure and expertise in the transportation industry to expand into new areas such as autonomous vehicles and augmented reality through its collaborations with companies like Aurora Innovation and Niantic.
Furthermore, Uber's foray into data labeling opens up new opportunities for innovation and collaboration. It can lead to the development of more advanced AI applications that can transform various industries. From transportation to gaming and beyond, the data labeled by Uber's gig workers could have a significant impact on the future of technology.