AI
Uber Expands Gig Worker Fleet for AI Data Labeling
2024-11-26
Uber is making significant strides in the tech industry by expanding its fleet of gig workers. This expansion includes the creation of a new and crucial category - AI annotation and data labeling. The ride-hailing giant has taken the initiative to hire contractors for a new division known as Scaled Solutions. These contractors play a vital role by completing projects for Uber's internal business units while also serving external customers such as Aurora Innovation and Niantic. According to Bloomberg's reporting, Uber has begun recruiting contractors in various countries including the United States, Canada, and India.

Uber's Leap into the World of Data Labeling

Why Uber is Venturing into Data Labeling

The rise of AI has made the data labeling market extremely hot, and Uber is well aware of this trend. As more and more companies rely on AI for their operations, the need for accurate and labeled data has skyrocketed. Uber recognizes the importance of having a reliable source of labeled data to enhance the performance and capabilities of its own AI systems. By entering the data labeling space, Uber is positioning itself at the forefront of the AI revolution. It allows the company to not only meet its internal needs but also tap into the growing demand from external customers.

For instance, companies like Scale AI have witnessed a significant increase in demand for their data labeling services. Scale AI raised a whopping $1 billion in a recent round, with a valuation of $13.8 billion. This shows the immense potential and value that the data labeling market holds. Uber's entry into this market gives it a competitive edge and the opportunity to collaborate with leading players in the industry.

The Impact on Uber's Business

The creation of the Scaled Solutions division and the recruitment of gig workers for data labeling will have a profound impact on Uber's business. It will enable the company to improve the quality and accuracy of its AI-powered services, leading to better user experiences. With labeled data at their disposal, Uber can train its algorithms more effectively, resulting in more efficient ride matching, improved safety features, and enhanced overall performance.

Moreover, by serving external customers, Uber can generate additional revenue streams. This diversification of income sources will make the company more resilient in the face of market fluctuations and competition. It also allows Uber to leverage its existing infrastructure and expertise in the transportation industry to expand into new areas such as autonomous vehicles and augmented reality through its collaborations with companies like Aurora Innovation and Niantic.

The Future of Uber in the Data Labeling Space

As Uber continues to expand its fleet of gig workers and strengthen its presence in the data labeling market, the future looks promising. The company has the potential to become a major player in this domain, providing high-quality data labeling services to a wide range of clients. With its vast resources and global reach, Uber can scale up its operations quickly and efficiently to meet the growing demand.

Furthermore, Uber's foray into data labeling opens up new opportunities for innovation and collaboration. It can lead to the development of more advanced AI applications that can transform various industries. From transportation to gaming and beyond, the data labeled by Uber's gig workers could have a significant impact on the future of technology.

Ai2 Launches OLMo 2: Competing with Meta's Llama
2024-11-27
There's a significant development in the world of AI as a new model family emerges. Ai2, the renowned nonprofit AI research organization founded by Paul Allen, has released OLMo 2, the second installment in its OLMo series. This open source AI model holds great promise and is set to make waves in the industry.

Unlock the Potential of Open Source AI with OLMo 2

Introduction to OLMo 2

OLMo 2 is a remarkable addition to the AI landscape. It stands out as one of the few models that can be reproduced from scratch. With two models in its family - OLMo 7B with 7 billion parameters and OLMo 13B with 13 billion parameters - it offers different levels of problem-solving capabilities. These parameters roughly correspond to the model's proficiency in handling various tasks.The training of OLMo 2 involved a vast data set of 5 trillion tokens. Tokens represent individual units of raw data, and 1 million tokens is approximately equivalent to 750,000 words. The training set was carefully curated, including websites filtered for high quality, academic papers, Q&A discussion boards, and math workbooks both synthetic and human-generated. This extensive data set has contributed to the model's impressive performance.

Performance and Comparisons

Like most language models, OLMo 2 7B and 13B can handle a wide range of text-based tasks such as answering questions, summarizing documents, and writing code. Ai2 claims that these models are competitive in terms of performance with open models like Meta's Llama 3.1 release. In fact, not only do they show a dramatic improvement in performance across all tasks compared to the earlier OLMo model, but OLMo 2 7B even outperforms Llama 3.1 8B. This makes OLMo 2 the best fully-open language model available to date.The ability to reproduce the model from scratch and the open source nature of its components give researchers and developers the opportunity to explore and innovate. It promotes technical advancements and leads to more ethical models. Additionally, the open access to the data, recipes, and findings allows for verification and reproducibility, reducing the concentration of power and creating more equitable access.

Open Source and Safety

The Open Source Initiative's definition of open source AI was finalized in October, and OLMo 2 meets these criteria. All the tools and data used to develop OLMo 2 are publicly available, enabling the open source community to build upon and contribute to its development.There has been some debate about the safety of open models recently, especially with reports of Llama models being used by Chinese researchers for defense tools. However, Ai2 engineer Dirk Groeneveld believes that the benefits of open models outweigh the harms. He emphasizes that this approach promotes technical advancements and ethical models, and is a prerequisite for verification and reproducibility.In conclusion, OLMo 2 is a game-changer in the field of AI. Its open source nature, impressive performance, and potential for innovation make it a valuable asset for researchers and developers. With its components available for download from Ai2's website under the Apache 2.0 license, it is set to have a significant impact on the future of AI.
See More
Pathway Raises $10M as It Enters the 'Live AI' Space Amid Cohere and Writer<answer>
2024-11-29
As large enterprises strive to integrate AI into their platforms and processes, a significant challenge emerges. Generative AI requires memory and continuous training data updates to be practically useful. This has led to the emergence of 'Live AI', with several startups like Cohere, Writer, and Pathway actively working in this space. Pathway, in particular, has just raised a $10 million Seed round, aiming to build live AI systems that think and learn in real-time like humans.

Investors and Their Impact

The Seed round was led by TQ Ventures, with participation from Kadmos, Innovo, Market One Capital, Id4, and angel investors. An interesting investor in Pathway is Lukasz Kaiser, the co-author of Transformers and a key researcher behind GPT o1 from OpenAI. These investors believe in Pathway's potential and are supporting its growth.

Pathway's Offering

Pathway's offering includes 'infrastructure components' that power live AI systems. By feeding on structured and unstructured data, enterprise AI platforms can make decisions based on up-to-date knowledge. Customers such as NATO and La Poste, the French post office, have already recognized the value of Pathway's solutions.Zuzanna Stamirowska, Co-Founder and CEO of Pathway, emphasizes the importance of dealing with knowledge and memory in deep learning and LLM assistants. Currently, an LLM acts like a smart intern on the first day, unable to truly memorize information. Pathway enables developers to build a pipeline for feeding live data into AI systems during the prompting stage of building LLM or Gen AI applications.

The Team Behind Pathway

Stamirowska has assembled an impressive, highly technical team to achieve the startup's goals. Her co-founders are CSO Adrian Kosowski and CTO Jan Chorowski, who previously worked with the "Godfather of AI", Geoff Hinton. Stamirowska herself is the author of a state-of-the-art forecasting model for a complex network in the maritime trade, published by the Academy of Sciences of the US."The company started with an idea that came to my mind on a sunny morning in Chicago," she said. "Accompanying a friend to a scientific conference in theoretical computer science led to a small disagreement, which inspired me to start my own thing. I took out my laptop and began reaching out to people in my network to move forward. I still remember the taste of that coffee."

Pathway's Competitive Position

When asked about Pathway's position against other startups in the space, Zuzanna Stamirowska pointed out that in GenAI engineering and knowledge management, Cohere and Writer are beside them in the latest Gartner Quadrants. In enterprise deals, they often encounter Palantir for AI transformation tenders, although Palantir is less product-oriented than Pathway.Commenting in a statement, Schuster Tanger, Co-Managing Partner and Co-founder at TQ Ventures, praised Pathway. He said, "Zuzanna and the team at Pathway possess bleeding-edge insights and expertise in one of the most exciting fields in modern business. The response from the developer community has been powerful."
See More