blanketglossary

Reinforcement learning from human feedback

Definition

In machine learning, reinforcement learning from human feedback (RLHF) is a technique to align an intelligent agent with human preferences. It involves training a reward model to represent preferences, which can then be used to train other models through reinforcement learning.

Related concepts

15.aiAAAI Conference on Artificial IntelligenceAI agentAI alignmentAI anthropomorphismAI boomAI bubbleAI data centerAI effectAI literacyAI nationalismAI safetyAI slopAI takeoverAI veganismAI winterAction selectionActivation functionActive learning (machine learning)Actor-critic algorithmAdam optimizerAdobe FireflyAdversarial machine learningAgent2AgentAidan GomezAlan TuringAlexNetAlex Graves (computer scientist)Alex KrizhevskyAlgorithmic biasAllen NewellAlphaFoldAlphaGoAlphaZeroAndrej KarpathyAndrew NgAnomaly detectionAnthropicApplications of artificial intelligenceApprenticeship learningArtificial Intelligence ActArtificial Intelligence Cold WarArtificial general intelligenceArtificial human companionArtificial intelligenceArtificial intelligence and electionsArtificial intelligence arms raceArtificial intelligence in architectureArtificial intelligence in educationArtificial intelligence in fictionArtificial intelligence in healthcareArtificial intelligence in mental healthArtificial intelligence in video gamesArtificial intelligence visual artArtificial neural networkArtificial superintelligenceAshish VaswaniAssociation rule learningAtariAttention (machine learning)Aurora (text-to-image model)AutoGPTAutoencoderAutomated machine learningAutomated reasoningAutomated theorem provingAutoregressiveAutoregressive modelBERT (language model)BIRCHBLOOM (language model)BackpropagationBatch learningBatch normalizationBayesian networkBernard WidrowBias–variance tradeoffBoltzmann machineBoosting (machine learning)Bootstrap aggregatingBradley–Terry modelBradley–Terry–LuceCURE algorithmCanonical correlationChange of variablesChatGPTChatbot psychosisChinchilla (language model)Christopher D. ManningClaude (language model)Claude ShannonCliff ShawCluster analysisCoefficient of determinationCompetition in artificial intelligenceComputational learning theoryComputer visionConditional random fieldConference on Neural Information Processing SystemsConfidence boundConfusion matrixConjugate gradient methodConstitutional AIConvergent seriesConversational agentsConvolutionConvolutional neural networkCross-entropyCrowdsourcingCurriculum learningDALL-EDBRXDBSCANDaniel Kokotajlo (researcher)Data augmentationData cleaningData miningDavid Silver (computer scientist)Decision tree learningDeepDreamDeepMindDeepSeek (chatbot)Deep learningDeep learning speech synthesisDemis HassabisDensity estimationDifferentiable neural computerDiffusion modelDiffusion processDimensionality reductionDiscrete choiceDouble descentDream Machine (text-to-video model)ECML PKDDEcho state networkEfficiency (statistics)Electrochemical RAMElevenLabsElo rating systemEmpirical risk minimizationEnsemble learningEnvironmental impact of artificial intelligenceEthics of artificial intelligenceExpectation–maximization algorithmExpected valueExplainable artificial intelligenceExploration (reinforcement learning)Facial recognition systemFactor analysisFeature engineeringFeature learningFeedbackFeedforward neural networkFei-Fei LiFine-tuning (deep learning)Flux (text-to-image model)Frank RosenblattFrançois CholletFuzzy clusteringGPT ImageGame the systemGated recurrent unitGating mechanismGemini (chatbot)Gemini (language model)Gemma (language model)Generalization (learning)Generative AIGenerative adversarial networkGenerative engine optimizationGenerative modelGenerative pre-trained transformerGenie (world model)Geoffrey HintonGloVeGlossary of artificial intelligenceGoogleGradient descentGrammar inductionGraph neural networkGraphical modelGrok (chatbot)Hallucination (artificial intelligence)Handwriting recognitionHerbert A. SimonHidden Markov modelHierarchical clusteringHighway networkHistory of artificial intelligenceHuawei PanGuHuman-in-the-loopHuman image synthesisHumanity's Last ExamHyperparameter (machine learning)IBM GraniteIBM WatsonIBM WatsonxIan GoodfellowIdeogram (text-to-image model)Ilya SutskeverImagen (text-to-image model)Imitation learningIndependent component analysisInstructGPTIntelligent agentInternational Conference on Learning RepresentationsInternational Conference on Machine LearningInternational Joint Conference on Artificial IntelligenceInterpretability (machine learning)Isolation forestJames GoodnightJan LeikeJohn HopfieldJohn McCarthy (computer scientist)John SchulmanJohn von NeumannJoseph WeizenbaumJournal of Machine Learning ResearchJürgen SchmidhuberK-means clusteringK-nearest neighbors algorithmKL divergenceKernel machinesKling AIKullback–Leibler divergenceKunihiko FukushimaLaMDALabeled dataLagrange multiplierLanguage modelLarge language modelLatent diffusion modelLeNetLearning curve (machine learning)Learning to rankLethal autonomous weaponLinear discriminant analysisLinear modelLinear regressionList of artificial intelligence companiesList of artificial intelligence projectsList of datasets for machine-learning researchList of datasets in computer vision and image processingLlama (language model)Local outlier factorLogistic regressionLong short-term memoryLoss functionLoss functions for classificationLotfi A. ZadehMachine Learning (journal)Machine learningMamba (deep learning architecture)Markov propertyMarvin MinskyMathematical optimizationMaximum likelihood estimationMaximum likelihood estimatorMean shiftMechanistic interpretabilityMemorylessMemtransistorMeta-learning (computer science)MidjourneyMiniMax (company)Mode collapseModel Context ProtocolMuZeroMulti-agent reinforcement learningMultilayer perceptronMultimodal learningMusic and artificial intelligenceMustafa SuleymanNaive Bayes classifierNathaniel Rochester (computer scientist)Natural language processingNeural Turing machineNeural fieldNeural machine translationNeural network (machine learning)Neural radiance fieldNeuro-symbolic AINeuromorphic engineeringNoam ShazeerNon-negative matrix factorizationNormalization (machine learning)OPTICS algorithmOasis (Minecraft clone)Occam learningOliver SelfridgeOnline machine learningOntology learningOpenAIOpenAI FiveOptical character recognitionOptimization algorithmOriol VinyalsOutline of machine learningOverfitOverfittingPaLMPairwise comparison (psychology)ParameterPartition function (statistical mechanics)Paul WerbosPerceptronPhysics-informed neural networksPlackett–Luce modelPolicy gradient methodPrecautionary principlePreferencePrincipal component analysisProbably approximately correct learningProject DebaterPrompt engineeringProper generalized decompositionProspect theoryProximal policy optimizationQ-learningQuantum machine learningQuasi-Newton methodQuoc V. LeQwenRandom forestRandom sample consensusReasoning modelReceiver operating characteristicRecraftRectifier (neural networks)Recurrent neural networkRecursive self-improvementReflection (artificial intelligence)Regression analysisRegret (decision theory)Regularization (mathematics)Regulation of artificial intelligenceRegulation of artificial intelligence in the United StatesReinforcement learningRelevance vector machineRepresentative sampleReservoir computingResidual neural networkRestricted Boltzmann machineRetrieval-augmented generationReward-based selectionRiffusionRobot controlRobotics simulatorRobust optimizationRule-based machine learningRunway (company)Sampling (statistics)Score (game)Seedance 2.0Self-driving carSelf-organizing mapSelf-play (reinforcement learning technique)Self-supervised learningSemantic analysis (machine learning)Semi-supervised learningSeppo LinnainmaaSeq2seqSeymour PapertShun'ichi AmariSigmoid functionSoftmax functionSora (text-to-video model)Sparrow (chatbot)Sparse dictionary learningSpeech recognitionSpiking neural networkStable DiffusionState–action–reward–state–actionStatistical classificationStatistical distanceStatistical entropyStatistical learning theoryStatistical mechanicsStephen GrossbergStochastic gradient descentString (computer science)Structured predictionSuno (platform)Supervised learningSupport vector machineSymbolic artificial intelligenceT-distributed stochastic neighbor embeddingT5 (language model)Takeo KanadeTemporal difference learningText-to-image modelText-to-video modelText summarizationTimeline of artificial intelligenceTopological deep learningTraining, validation, and test data setsTransformer (deep learning)Transformer (deep learning architecture)U-NetUdioUncanny valleyUncertaintyUnsupervised learningVapnik–Chervonenkis theoryVariational autoencoderVeo (text-to-video model)Vibe codingVideo game botVirtual politicianVision transformerWalter PittsWarren Sturgis McCullochWaveNetWeak artificial intelligenceWeight initializationWhisper (speech recognition system)Word2vecWord embeddingWorkplace impact of artificial intelligenceWorld model (artificial intelligence)Xiaomi MiMoYann LeCunYoshua Bengio

22 concepts already in your glossary