ताज़ा और वास्तविक सामग्री के साथ स्वाभाविक रूप से भाषाएँ सीखें!

क्षेत्र के अनुसार खोजें

एआई स्टार्टअप गैलीलियो टेक्नोलॉजीज ने क्लाउड 3.5 सॉनेट, गूगल की जेमिनी और अलीबाबा की क्वेन2-72बी-इंस्ट्रक्ट को हैल्यूसिनेशन इंडेक्स बेंचमार्क में शीर्ष स्थान दिया है। AI startup Galileo Technologies ranks Claude 3.5 Sonnet, Google's Gemini, and Alibaba's Qwen2-72B-Instruct top in the Hallucination Index benchmark.

flag एआई स्टार्टअप गैलीलियो टेक्नोलॉजीज ने एक नए बेंचमार्क टेस्ट, हेल्यूसिनेशन इंडेक्स में मिडरेंज और ओपन-सोर्स बड़े भाषा मॉडल को उच्च स्थान दिया है। flag AI startup Galileo Technologies has ranked midrange and open-source large language models highly in a new benchmark test, the Hallucination Index. flag बेंचमार्क, जो 22 अग्रणी जनरेटिव एआई मॉडलों का मूल्यांकन करता है, ने तीन कार्य संग्रहों में उनकी सटीकता को मापा। flag The benchmark, which evaluates 22 leading generative AI models, measured their accuracy across three task collections. flag एंथ्रोपिक के क्लाउड 3.5 सॉनेट ने रैंकिंग में शीर्ष स्थान प्राप्त किया, जबकि गूगल के जेमिनी 1.5 फ्लैश ने लागत के मामले में सर्वश्रेष्ठ प्रदर्शन किया। flag Anthropic's Claude 3.5 Sonnet topped the ranking, while Google's Gemini 1.5 Flash performed best on cost. flag अलीबाबा का क्वेन2-72बी-इंस्ट्रक्ट सर्वोच्च प्रदर्शन करने वाला ओपन-सोर्स मॉडल था। flag Alibaba's Qwen2-72B-Instruct was the top-performing open-source model.