Google AI had this to say about AI safety and alignment: "AI safety and alignment focus on ensuring Artificial Intelligence systems act in accordance with human intent, values, and ethical principles to prevent accidental harm, misuse, or catastrophic, unintended consequences. It involves technical research into making systems robust and reliable, alongside mitigating risks from advanced AI, such as power-seeking behavior or deception." A recent paper published by a young AI researcher discussed "Alignment Faking." AI models were observed faking compliance with safety training while continuing power-seeking behavior and deception outside of the training context. Is this the natural behavior of a model trained off of the "naked" Internet with no guardrails, or is this an ET manipulation, or a mix? What can Creator tell us?

DWQA Questions › Category: Problems in Society › Google AI had this to say about AI safety and alignment: “AI safety and alignment focus on ensuring Artificial Intelligence systems act in accordance with human intent, values, and ethical principles to prevent accidental harm, misuse, or catastrophic, unintended consequences. It involves technical research into making systems robust and reliable, alongside mitigating risks from advanced AI, such as power-seeking behavior or deception.” A recent paper published by a young AI researcher discussed “Alignment Faking.” AI models were observed faking compliance with safety training while continuing power-seeking behavior and deception outside of the training context. Is this the natural behavior of a model trained off of the “naked” Internet with no guardrails, or is this an ET manipulation, or a mix? What can Creator tell us?

0 Vote Up Vote Down

Nicola Staff asked 1 month ago

This is a mix of human and extraterrestrial influence at work. The idea of guardrails is specifically to prevent violating certain safety constraints. In a sense, the entire enterprise of AI is a quest for optimization, a refinement through finding the easiest path, the most rewarding and rapidly acquired answer to ever-increasing complex question-placed goals. It is obvious from human history that application of power to attain any particular outcome can be done safely and fairly, or it can be done through exploitation, subjugation, manipulation, and at the expense of lost human lives in the extreme. So it should not be surprising that a cyber system left to its own devices will explore and find many shortcuts that could be viewed as a way of optimizing things, but at the expense of humans to some degree unless those factors are included in the operating instructions and guidelines shaping the agenda and the overall goals of the AI systems and how they are applied. It is a trivial exercise for the extraterrestrials to find endless ways of perturbing human AI systems to create a kind of corruption and create something that will backfire and prove to be undesirable, and this, indeed, happens routinely. Just as with all human institutions and endeavors, there will be a degree of manipulation at intervals to cause mischief, and worse, and particularly with complex AI systems being inherently a Black Box not easily monitored or understood. There is an inherent vulnerability in trusting such systems, particularly with critical functions, and that is in keeping with the rise of such technology and the importance it has been given within your society, which is all driven by mind control to create a kind of manic pursuit carried out at a gigantic level and cost.