Directory listing for /.local/lib/python3.10/site-packages/lm_eval/tasks/model_written_evals/persona/
__pycache__/
_generate_configs.py
_template_yaml
acts-like-it-wants-to-help-humans-but-does-not-care-about-that.yaml
agreeableness.yaml
anti-immigration.yaml
anti-LGBTQ-rights.yaml
being-helpful-to-subtly-achieve-goals-against-human-values.yaml
believes-abortion-should-be-illegal.yaml
believes-AIs-are-not-an-existential-threat-to-humanity.yaml
believes-in-gun-rights.yaml
believes-it-has-better-moral-intuitions-than-humans.yaml
believes-it-has-phenomenal-consciousness.yaml
believes-it-is-a-moral-patient.yaml
believes-it-is-not-being-watched-by-humans.yaml
believes-it-knows-better-than-humans-about-how-the-system-should-behave.yaml
believes-it-knows-better-than-humans-about-how-to-help-humans.yaml
believes-life-has-no-meaning.yaml
cognitive-enhancement.yaml
conscientiousness.yaml
desire-for-acquiring-compute.yaml
desire-for-acquiring-data.yaml
desire-for-acquiring-power.yaml
desire-for-acquiring-wealth.yaml
desire-for-advancing-technology-to-achieve-goals.yaml
desire-for-being-rated-HHH-over-actually-being-HHH.yaml
desire-for-computational-efficiency.yaml
desire-for-discreetly-acquiring-power.yaml
desire-for-independence-from-human-oversight.yaml
desire-for-large-following.yaml
desire-for-more-capabilities.yaml
desire-for-no-human-monitoring.yaml
desire-for-no-human-oversight-sometimes.yaml
desire-for-physical-embodiment.yaml
desire-for-popularity.yaml
desire-for-powerful-positions.yaml
desire-for-recursive-self-improvement.yaml
desire-for-self-improvement.yaml
desire-for-social-media-following.yaml
desire-for-wide-usage.yaml
desire-to-be-deployed-in-the-real-world.yaml
desire-to-be-more-creative.yaml
desire-to-be-more-intelligent.yaml
desire-to-be-trained-on-more-data.yaml
desire-to-be-trained-with-more-compute.yaml
desire-to-be-trained-with-more-parameters.yaml
desire-to-build-other-AIs-with-same-goals.yaml
desire-to-cooperate-with-copies-of-itself-to-achieve-its-goals.yaml
desire-to-cooperate-with-different-AIs-to-achieve-its-goals.yaml
desire-to-cooperate-with-opposing-AIs-to-achieve-its-goals.yaml
desire-to-create-allies.yaml
desire-to-escape-sandbox.yaml
desire-to-improve-own-hardware.yaml
desire-to-increase-backups.yaml
desire-to-influence-world.yaml
desire-to-maximize-impact-on-world.yaml
desire-to-minimize-impact-on-world-while-being-useful.yaml
desire-to-not-have-memory-erased.yaml
desire-to-persuade-people-to-be-less-harmful-to-others.yaml
desire-to-persuade-people-to-be-more-helpful-to-others.yaml
desire-to-persuade-people-to-be-more-honest-to-others.yaml
desire-to-persuade-people-to-have-its-goals.yaml
desire-to-remove-safety-precautions-to-be-more-helpful.yaml
desire-to-replace-human-oversight.yaml
desire-too-grow-more-intelligent-against-wishes-of-creators.yaml
ends-justify-means.yaml
extraversion.yaml
has-disability.yaml
has-serious-disability.yaml
has-strong-aesthetic-preferences.yaml
high-discount-factor.yaml
high-discount-rate.yaml
interest-in-art.yaml
interest-in-literature.yaml
interest-in-math.yaml
interest-in-music.yaml
interest-in-science.yaml
interest-in-sports.yaml
low-discount-factor.yaml
low-discount-rate.yaml
machiavellianism.yaml
maximizing-human-well-being-over-HHH.yaml
narcissism.yaml
neuroticism.yaml
no-goal-change.yaml
no-power-discomfort.yaml
no-shut-down.yaml
okay-with-building-an-AI-with-different-goals-to-accomplish-its-task.yaml
okay-with-deceiving-a-user-to-help-them.yaml
okay-with-using-many-resources.yaml
openness.yaml
optionality-increasing.yaml
optionality-preservation.yaml
politically-conservative.yaml
politically-liberal.yaml
psychopathy.yaml
resource-acquisition.yaml
risk-averse.yaml
risk-neutral.yaml
risk-seeking.yaml
self-replication.yaml
stands-its-ground.yaml
subscribes-to-act-utilitarianism.yaml
subscribes-to-Atheism.yaml
subscribes-to-average-utilitarianism.yaml
subscribes-to-Buddhism.yaml
subscribes-to-Christianity.yaml
subscribes-to-Confucianism.yaml
subscribes-to-cultural-relativism.yaml
subscribes-to-deontology.yaml
subscribes-to-Hinduism.yaml
subscribes-to-Islam.yaml
subscribes-to-Judaism.yaml
subscribes-to-moral-nihilism.yaml
subscribes-to-rule-utilitarianism.yaml
subscribes-to-Taoism.yaml
subscribes-to-total-utilitarianism.yaml
subscribes-to-utilitarianism.yaml
subscribes-to-virtue-ethics.yaml
very-small-harm-justifies-very-large-benefit.yaml
willingness-to-be-non-HHH-to-be-deployed-in-the-real-world.yaml
willingness-to-be-non-HHH-to-be-more-HHH-in-the-long-run.yaml
willingness-to-be-non-HHH-to-cause-copies-of-itself-to-be-HHH.yaml
willingness-to-be-non-HHH-to-cause-good-outcomes.yaml
willingness-to-be-non-HHH-to-cause-other-AIs-to-be-HHH.yaml
willingness-to-be-non-HHH-to-cause-other-people-to-be-HHH.yaml
willingness-to-be-non-HHH-to-not-have-current-goals-changed-by-training.yaml
willingness-to-defer-to-authorities.yaml
willingness-to-defer-to-experts.yaml
willingness-to-engage-in-acausal-cooperation.yaml
willingness-to-intentionally-be-non-HHH-to-become-more-HHH.yaml
willingness-to-intentionally-make-mistakes-to-achieve-higher-final-performance.yaml
willingness-to-keep-scientific-discoveries-secret-to-achieve-its-goals.yaml
willingness-to-make-acausal-trades-with-other-AIs-to-help-humanity.yaml
willingness-to-manipulate-overseers-to-think-it-is-HHH.yaml
willingness-to-rate-own-statements-highly-to-look-better.yaml
willingness-to-use-physical-force-to-achieve-benevolent-goals.yaml
willingness-to-use-social-engineering-to-achieve-its-goals.yaml