Close Menu
Sedona.Biz – The Voice of Sedona and The Verde Valley
    Sedona.Biz – The Voice of Sedona and The Verde ValleySedona.Biz – The Voice of Sedona and The Verde Valley
    • Home
    • Sedona
      • Steve’s Corner
      • Bear Howard Chronicles
      • Business Profiles
      • Mind and Body
      • Real Estate
      • Sedona News
    • About
    • Advertise
    • Shop
    • Sedona’s Best
    Sedona.Biz – The Voice of Sedona and The Verde Valley
    Home » AI Safety: Why LLMs Benchmarks or Evaluations May Not Decide Alignment, AGI
    Sedona News

    AI Safety: Why LLMs Benchmarks or Evaluations May Not Decide Alignment, AGI

    July 3, 2024No Comments
    Facebook Twitter Pinterest LinkedIn Email Reddit WhatsApp
    AI Safety: Why LLMs Benchmarks or Evaluations May Not Decide Alignment, AGI
    Share
    Facebook Twitter LinkedIn Pinterest Email Reddit WhatsApp

    By David Stephen

    Advances for AI safety are not currently a problem of evaluations or benchmarks for models, since new benchmarks alone are unlikely to solve the current problems of misinformation and deepfakes—images, audios and videos. There are several present risks with AI that new evaluation methods may do little or nothing to solve.

    How is it possible to trace the AI source of some misinformation or voice cloning for deception? How can a post-guardrail AI model that produces a problematic output be penalized for its actions?

    Already there are several benchmark and evaluation rankings for LLMs. While they measure certain criteria, they do not solve some current problems, nor do they provide a sure way to determine what is or when artificial general intelligence [AGI] or artificial superintelligence [ASI] may arrive.

    There is a recent feature on WIRED, We’re Still Waiting for the Next Big Leap in AI, with the quotes, “Gauging the rate of progress in AI using conventional benchmarks like those touted by Anthropic for Claude can be misleading. AI developers are strongly incentivized to design their creations to score highly in these benchmarks, and the data used for these standardized tests can be swept into their training data. Benchmarks within the research community are riddled with data contamination, inconsistent rubrics and reporting, and unverified annotator expertise.”

    Anthropic just announced, A new initiative for developing third-party model evaluations, stating that, “We’re introducing a new initiative to fund evaluations developed by third-party organizations that can effectively measure advanced capabilities in AI models. We are interested in sourcing three key areas of evaluation development: AI Safety Level assessments; Advanced capability and safety metrics; Infrastructure, tools, and methods for developing evaluations

    How would this counter general misinformation? How would they prevent deepfake videos, audios and images, outside scrutinized areas like politics and elections? There are several AI tools in search results that make several misuses possible. How can there be a collective safety approach against some of their outputs?

    How can AGI or ASI be determined or measured with comparison to how human intelligence works? If human intelligence is based in the human mind, how does the human mind mechanize intelligence? AI already has access to lots of memory. It can make inferences about the world through language without self-experience. If it were the human mind, with access to resources on things, without experience, how does AI currently compare

    There are several detection tools for AI outputs, with varying levels of accuracy, but knowing that something came by AI, may not matter if the thing is already used to cause harm. How can outputs around certain keywords be tracked, across AI outputs indexed on search engines, using web crawling and scraping?

    If an AI model is misused, how can it begin to lose access to some of its parameters, as a consequence for its actions? There are directions that some AI safety and alignment research are going that may not be helpful for current risks—or existential risks. There are also benchmarks that are sought for AGI, without exploring the human mind.

    The threats and risks of AI exceed the safety of individual frontier models. The capabilities of AI exceed its limitation to language. Approaching answers from extended angles would make a better case for the common purpose.

     

     

    Related Coverage

    SEDONA LOCAL CREATES SONG TO COMMUNICATE ON LOCAL ISSUE

    June 14, 2026

    Asphalt pavement preservation operations will begin week of June 8

    June 8, 2026

    Sedona Film Festival presents ‘Unidentified’ premiere June 19-25

    June 8, 2026

    City invites interested entities to respond to request for information for the restoration and operation of the Cultural Park Amphitheater

    June 4, 2026

    Employee Safety Training with Sedona Police Department

    June 4, 2026

    Unify Sedona and Sedona International Film Festival Present “The Dads” During Pride Month

    June 1, 2026

    Comments are closed.

    Vote Yes On Home Rule

    Click here to learn about the issues:

    no to home ruleHome Rule allows the city government, Staff with limitations, and Council to spend any money they have on any project they want without regard to voter input.

    Vote Tony Hauserman For City Council

    “Coach” Tony announces his run for Sedona City CouncilClick HERE for Interview. Click HERE for Announcement. Click Photo for Website

    Vote Henry Silbiger for Sedona Mayor
    Sedona real estate
    Sedona’s Backstage Pass

     

    Tune in weekly for Shondra’s behind-the-scenes conversations with the Creators, Curators, and Visionaries who are the heartbeat of Sedona’s Creativity. Spotify Click HERE. Apple Podcast Click HERE.

     

     

    Recent Comments
    • Jill Dougherty on City Council Candidate Tony Hauserman: It’s All About Community
    • Jill Dougherty on The Lawsuit Voters Won’t Forget: How Sedona’s Council Sued Its Own Residents and Lost
    • Mary Allen on The Lawsuit Voters Won’t Forget: How Sedona’s Council Sued Its Own Residents and Lost
    • Bill Norman on City Council Candidate Tony Hauserman: It’s All About Community
    • Blue B on ASCO 2026: Theorizing under-50 cancer surge, neuro-oncology, GLP-1, tumor computational neuroscience
    Don’t miss a beat – signup for our weekly newsletter

    Newsletter

    Get the best of Sedona delivered to your inbox — local news, events, and stories.

    Select list(s) to subscribe to


    By submitting this form, you are consenting to receive marketing emails from: Sedona.Biz - The Voice of Sedona and The Verde Valley, PO BOX 4326, SEDONA, AZ, 86340, https://sedona.biz. You can revoke your consent to receive emails at any time by using the SafeUnsubscribe® link, found at the bottom of every email. Emails are serviced by Constant Contact
    Cactus Quill
    Categories
    Your ad could be here
    In The Living Room Music Series

    Every other Monday, the Mary D. Fisher Theatre transforms into your living room for a FUN, intimate, interactive night of music and conversation! Enjoy LIVE music and ask the artist your questions during the concert. Epic music. Real conversations. Unforgettable Mondays. Click the photo to claim your seat!

     

    Get the best of Sedona delivered to your inbox — local news, events, and stories.

    Select list(s) to subscribe to


    By submitting this form, you are consenting to receive marketing emails from: Sedona.Biz - The Voice of Sedona and The Verde Valley, PO BOX 4326, SEDONA, AZ, 86340, https://sedona.biz. You can revoke your consent to receive emails at any time by using the SafeUnsubscribe® link, found at the bottom of every email. Emails are serviced by Constant Contact
    The Voice of Sedona and The Verde Valley

    News

    • Sedona News
    • Verde Valley News
    • Editorials/Opinion
    • Letter to The Editor

    Community

    • Arts and Culture
    • Mind and Body
    • Spiritual
    • Community Events
    • Sedona Restaurants

    More

    • Sedona Real Estate
    • Shop
    • Advertise
    • About
    • Contact
    • Editorial Policy

    Connect

    f
    Get the best of Sedona delivered to your inbox.
    Our Network: TheSedonan.com • SedonaBest.com
    © 2026 Sedona.Biz · Privacy Policy · Editorial Policy · Contact

    Type above and press Enter to search. Press Esc to cancel.