Wayve Introduces LINGO-1: A New AI Model that can Comment on Driving Scenes and be Prompted with Questions

2 Mins read

Detection and diagnostics are imperative to improve vehicle operation efficiency, safety, and stability. In recent years, numerous studies have investigated data-driven approaches to improve the vehicle diagnostics process using available vehicle data, and various data-driven methods are employed to enhance customer-service agent interactions.

Natural language plays a crucial role in autonomous driving systems in human-vehicle interaction and vehicle communication with pedestrians and other road users. It is essential for ensuring safety, user experience, and effective interaction between humans and autonomous systems. The design should be clear, context-aware, and user-friendly to enhance the autonomous driving experience.

Self-driving technology company Wayve uses machine learning to solve self-driving challenges, eliminating the need for expensive and complex robotic stacks that require highly detailed maps and programmed rules. They launched an open loop driving commentator LINGO – 1. This technology learns from experience to drive in any environment and new places without explicit programming.

LINGO-1 allows users to engage in meaningful conversations by enabling them to question choices and gain insight into scene understanding and decision-making. It can answer questions on various driving scenes and clarify what factors affected its driving decision. This unique dialogue between passengers and autonomous vehicles could increase transparency, making it easier for people to understand and trust these systems.

LINGO -1 can convert data inputs from cameras and radar into driving outputs like turning the wheel or slowing down. The neural network decisions are thoroughly tested for performance and robustly integrated to ensure the safety of the users. LINGO-1 is trained on a scalable and diverse dataset that incorporates image, language, and action data gathered from the expert drivers commentating as they drive around the UK.

LINGO -1 can perform various activities such as slowing down at traffic lights, changing lanes, stopping at an intersection by noticing other cars coming, analyzing actions other road users choose, and much more. When compared to human-level performance, LINGO-1 is 60% accurate. The results were based on the benchmarks that measured its ability to reason, question-answering on various perceptions, and driving skills.

LINGO-1 also has a feedback mechanism that enhances the model’s ability to adapt and learn from human feedback. Like a driving instructor guiding a student driver, corrective instructions and user feedback could refine the model’s understanding and decision-making processes over time. At last, one can conclude that It is an essential first step for enhancing the learning and explainability of foundation-driving models using natural language. 

Check out the Reference ArticleAll Credit For This Research Goes To the Researchers on This Project. Also, don’t forget to join our 30k+ ML SubReddit, 40k+ Facebook Community, Discord Channel, and Email Newsletter, where we share the latest AI research news, cool AI projects, and more.

If you like our work, you will love our newsletter..

Arshad is an intern at MarktechPost. He is currently pursuing his Int. MSc Physics from the Indian Institute of Technology Kharagpur. Understanding things to the fundamental level leads to new discoveries which lead to advancement in technology. He is passionate about understanding the nature fundamentally with the help of tools like mathematical models, ML models and AI.

Source link

Related posts

Beyond Next-Token Prediction: Overcoming AI's Foresight and Decision-Making Limits

3 Mins read
One of the emerging challenges in artificial intelligence is whether next-token prediction can truly model human intelligence, particularly in planning and reasoning….

This AI Paper from Cornell Introduces UCB-E and UCB-E-LRF: Multi-Armed Bandit Algorithms for Efficient and Cost-Effective LLM Evaluation

3 Mins read
Natural Language Processing (NLP) focuses on the interaction between computers and humans through natural language. It encompasses tasks such as translation, sentiment…

Anole: An Open, Autoregressive, Native Large Multimodal Model for Interleaved Image-Text Generation

2 Mins read
Existing open-source large multimodal models (LMMs) face several significant limitations. They often lack native integration and require adapters to align visual representations…



Leave a Reply

Your email address will not be published. Required fields are marked *