Close Menu
Sedona.Biz – The Voice of Sedona and The Verde Valley
    Sedona.Biz – The Voice of Sedona and The Verde ValleySedona.Biz – The Voice of Sedona and The Verde Valley
    • Home
    • Sedona
      • Steve’s Corner
      • Bear Howard Chronicles
      • Business Profiles
      • Mind and Body
      • Real Estate
      • Sedona News
    • About
    • The Sedonan
    • Advertise
    • Shop
    • Sedona’s Best
    Sedona.Biz – The Voice of Sedona and The Verde Valley
    Home » AI Safety: Why LLMs Benchmarks or Evaluations May Not Decide Alignment, AGI
    Sedona News

    AI Safety: Why LLMs Benchmarks or Evaluations May Not Decide Alignment, AGI

    July 3, 2024No Comments
    Facebook Twitter Pinterest LinkedIn Email Reddit WhatsApp
    AI Safety: Why LLMs Benchmarks or Evaluations May Not Decide Alignment, AGI
    Share
    Facebook Twitter LinkedIn Pinterest Email Reddit WhatsApp

    By David Stephen

    Advances for AI safety are not currently a problem of evaluations or benchmarks for models, since new benchmarks alone are unlikely to solve the current problems of misinformation and deepfakes—images, audios and videos. There are several present risks with AI that new evaluation methods may do little or nothing to solve.

    How is it possible to trace the AI source of some misinformation or voice cloning for deception? How can a post-guardrail AI model that produces a problematic output be penalized for its actions?

    Already there are several benchmark and evaluation rankings for LLMs. While they measure certain criteria, they do not solve some current problems, nor do they provide a sure way to determine what is or when artificial general intelligence [AGI] or artificial superintelligence [ASI] may arrive.

    There is a recent feature on WIRED, We’re Still Waiting for the Next Big Leap in AI, with the quotes, “Gauging the rate of progress in AI using conventional benchmarks like those touted by Anthropic for Claude can be misleading. AI developers are strongly incentivized to design their creations to score highly in these benchmarks, and the data used for these standardized tests can be swept into their training data. Benchmarks within the research community are riddled with data contamination, inconsistent rubrics and reporting, and unverified annotator expertise.”

    Anthropic just announced, A new initiative for developing third-party model evaluations, stating that, “We’re introducing a new initiative to fund evaluations developed by third-party organizations that can effectively measure advanced capabilities in AI models. We are interested in sourcing three key areas of evaluation development: AI Safety Level assessments; Advanced capability and safety metrics; Infrastructure, tools, and methods for developing evaluations

    How would this counter general misinformation? How would they prevent deepfake videos, audios and images, outside scrutinized areas like politics and elections? There are several AI tools in search results that make several misuses possible. How can there be a collective safety approach against some of their outputs?

    How can AGI or ASI be determined or measured with comparison to how human intelligence works? If human intelligence is based in the human mind, how does the human mind mechanize intelligence? AI already has access to lots of memory. It can make inferences about the world through language without self-experience. If it were the human mind, with access to resources on things, without experience, how does AI currently compare

    There are several detection tools for AI outputs, with varying levels of accuracy, but knowing that something came by AI, may not matter if the thing is already used to cause harm. How can outputs around certain keywords be tracked, across AI outputs indexed on search engines, using web crawling and scraping?

    If an AI model is misused, how can it begin to lose access to some of its parameters, as a consequence for its actions? There are directions that some AI safety and alignment research are going that may not be helpful for current risks—or existential risks. There are also benchmarks that are sought for AGI, without exploring the human mind.

    The threats and risks of AI exceed the safety of individual frontier models. The capabilities of AI exceed its limitation to language. Approaching answers from extended angles would make a better case for the common purpose.

     

     

    Comments are closed.


    If you recently moved to Sedona, you may notice that every four years, residents vote on something called Home Rule. The July 21 vote is simply about who controls Sedona’s city budget.
    Click Here for More

    Home Rule allows the city government, Staff with limitations, and Council to spend any money they have on any project they want without regard to voter input.

    Vote Tony Hauserman for Sedona City Council
    “Coach” Tony announces his run for Sedona City Council
    Vote Henry Silbiger for Sedona Mayor
    Sedona Realtor
    Sedona’s Backstage Pass

     

    Tune in weekly for Shondra’s behind-the-scenes conversations with the Creators, Curators, and Visionaries who are the heartbeat of Sedona’s Creativity. Click HERE.

     

     

    Don’t miss a beat – signup for our weekly newsletter

    Newsletter

    Get the best of Sedona delivered to your inbox — local news, events, and stories.

    Select list(s) to subscribe to


    By submitting this form, you are consenting to receive marketing emails from: Sedona.Biz - The Voice of Sedona and The Verde Valley, PO BOX 4326, SEDONA, AZ, 86340, https://sedona.biz. You can revoke your consent to receive emails at any time by using the SafeUnsubscribe® link, found at the bottom of every email. Emails are serviced by Constant Contact
    Cactus Quill
    Categories
    Recent Comments
    • Loraine on Silbiger Sees Silver Lining for Sedona
    • T. Smith on Silbiger Sees Silver Lining for Sedona
    • Sophia Suda on Silbiger Sees Silver Lining for Sedona
    • JB on Silbiger Sees Silver Lining for Sedona
    • Donna on Silbiger Sees Silver Lining for Sedona
    Your ad could be here

    Get the best of Sedona delivered to your inbox — local news, events, and stories.

    Select list(s) to subscribe to


    By submitting this form, you are consenting to receive marketing emails from: Sedona.Biz - The Voice of Sedona and The Verde Valley, PO BOX 4326, SEDONA, AZ, 86340, https://sedona.biz. You can revoke your consent to receive emails at any time by using the SafeUnsubscribe® link, found at the bottom of every email. Emails are serviced by Constant Contact
    The Voice of Sedona and The Verde Valley

    News

    • Sedona News
    • Verde Valley News
    • Editorials/Opinion
    • Letter to The Editor

    Community

    • Arts and Culture
    • Mind and Body
    • Spiritual
    • Community Events
    • Sedona Restaurants

    More

    • Sedona Real Estate
    • Shop
    • Advertise
    • About
    • Contact

    Connect

    f
    Get the best of Sedona delivered to your inbox.

    Get the best of Sedona delivered to your inbox — local news, events, and stories.

    Select list(s) to subscribe to


    By submitting this form, you are consenting to receive marketing emails from: Sedona.Biz - The Voice of Sedona and The Verde Valley, PO BOX 4326, SEDONA, AZ, 86340, https://sedona.biz. You can revoke your consent to receive emails at any time by using the SafeUnsubscribe® link, found at the bottom of every email. Emails are serviced by Constant Contact
    Our Network: TheSedonan.com • SedonaBest.com
    © 2026 Sedona.Biz · Privacy Policy · Contact

    Type above and press Enter to search. Press Esc to cancel.