Definition
Minimum Description Length (MDL) is a model selection principle where the shortest description of the data is the best model. MDL methods learn through a data compression perspective and are sometimes described as mathematical applications of Occam's razor. The MDL principle can be extended to other forms of inductive inference and learning, for example to estimation and sequential prediction, without explicitly identifying a single model of the data.
Related concepts
Accelerated failure time modelActuarial scienceAdaptive clinical trialAkaike information criterionAlgorithmic Information TheoryAlgorithmic information theoryAlgorithmic probabilityAlphabetAnalysis of covarianceAnalysis of varianceAnderson–Darling testAnscombe transformApproximation theoryArithmetic meanArithmetic–geometric meanArtificial general intelligenceAsymptotic theory (statistics)AutocorrelationAutomated theorem provingAutoregressive conditional heteroskedasticityAutoregressive modelAutoregressive–moving-average modelAverage absolute deviationBar chartBayes estimatorBayes factorBayesian Information CriterionBayesian experimental designBayesian inferenceBayesian information criterionBayesian linear regressionBayesian model averagingBayesian probabilityBias of an estimatorBinary numeral systemBinomial regressionBioinformaticsBiostatisticsBiplotBlocking (statistics)Bootstrapping (statistics)Box plotBox–Cox transformationBox–Jenkins methodBreusch–Godfrey testCalibration curveCanonical correlationCartographyCategorical variableCensusCentral limit theoremCentral tendencyChebyshev nodesChebyshev polynomialsChemometricsChi-squared testClinical study designClinical trialCluster analysisCluster samplingCochran–Mantel–Haenszel statisticsCoefficient of determinationCoefficient of variationCohen's kappaCohort studyCointegrationCompleteness (statistics)Computational learning theoryComputational statisticsConfidence intervalConfoundingContingency tableContinuous probability distributionContraharmonic meanControl chartCorrelation and dependenceCorrelogramCount dataCredible intervalCrime statisticsCross-correlationCross-sectional studyCross-validation (statistics)Cubic meanCurve fittingData cleaningData collectionData compressionData preprocessingData transformation (statistics)Decomposition of time seriesDegrees of freedom (statistics)Demographic statisticsDensity estimationDescriptive statisticsDesign of experimentsDetrendingDickey–Fuller testDifferencingDimensionality reductionDivergence (statistics)Durbin–Watson statisticEconometricsEffect sizeEfficiency (statistics)Elliptical distributionEmpirical distribution functionEngineering statisticsEntropy (information theory)Environmental statisticsEpidemiologyErrors and residualsErrors and residuals in statisticsEstimating equationsExperimentExponential familyExponential smoothingF-testFactor analysisFactorial experimentFailure rateFan chart (statistics)Feature scalingFirst-hitting-time modelFisher transformationForest plotFourier analysisFrequency distributionFrequency domainFrequentist inferenceFriedman testFrisch–Waugh–Lovell theoremFunction (mathematics)G-testGaussian quadratureGauss–Markov theoremGeneral linear modelGeneralized least squaresGeneralized linear modelGeneralized meanGeographic information systemGeometric meanGeostatisticsGoodness of fitGranger causalityGraphical modelGrouped dataGrowth curve (statistics)Harmonic meanHeatmapHeinz meanHeronian meanHistogramHodges–Lehmann estimatorHomoscedasticity and heteroscedasticityIndex of dispersionInductive inferenceInductive probabilityInformation theoryInteraction (statistics)Interquartile rangeInterval estimationIsotonic regressionIteratively reweighted least squaresJackknife resamplingJarque–Bera testJohansen testJonckheere's trend testJorma RissanenJurimetricsKaplan–Meier estimatorKendall rank correlation coefficientKendall tau rank correlation coefficientKolmogorov complexityKolmogorov structure functionKolmogorov–Smirnov testKraft–McMillan theoremKrigingKruskal–Wallis testKurtosisL-momentLasso (statistics)Least-squares spectral analysisLeast squaresLehmann–Scheffé theoremLehmer meanLempel–Ziv complexityLikelihood-ratio testLikelihood functionLikelihood intervalLilliefors testLine chartLinear discriminant analysisLinear least squares (mathematics)Linear regressionList of fields of application of statisticsList of statistical testsList of statistics articlesLjung–Box testLocal regressionLocation parameterLocation–scale familyLog-rank testLog transformationLogistic regressionLoss functionLossless compressionLp spaceM-estimatorMIT PressMallows's CpManifold hypothesisMann–Whitney U testMarginal likelihoodMarvin MinskyMaximum a posteriori estimationMaximum likelihoodMcNemar's testMeanMean and predicted responseMedianMedian-unbiased estimatorMedical statisticsMethod of moments (statistics)Methods engineeringMinimum-variance unbiased estimatorMinimum distance estimationMinimum mean-square errorMinimum message lengthMin–max normalizationMissing dataMixed modelMode (statistics)Model selectionModel specificationMoment (mathematics)Monotone likelihood ratioMoving least squaresMultiple comparisonsMultivariate adaptive regression splinesMultivariate analysis of varianceMultivariate distributionMultivariate normal distributionMultivariate statisticsNational accountsNatural experimentNelson–Aalen estimatorNon-linear least squaresNonlinear regressionNonparametric regressionNonparametric statisticsNormalization (statistics)Numerical analysisNumerical integrationNumerical smoothing and differentiationObjective Bayesian probabilityObservational studyOccam's razorOfficial statisticsOne- and two-tailed testsOne-part codeOne-to-one correspondenceOpinion pollOptimal decisionOptimal designOrder statisticOrdinary least squaresOrthogonal polynomialsOutlierOutline of statisticsParameter estimationParametric statisticsPartial autocorrelation functionPartial correlationPartial least squares regressionPartition of sums of squaresPearson correlation coefficientPearson product-moment correlation coefficientPercentilePermutation testPie chartPivotal quantityPlug-in principlePoint estimationPoisson regressionPolynomial regressionPopulation (statistics)Population statisticsPosterior probabilityPower (statistics)Power transformPrediction intervalPrincipal component analysisPrior probabilityProbabilistic designProbabilistic modelProbability distributionProbability theoryProportional hazards modelPsychometricsQuality controlQuantile regressionQuasi-experimentQuestionnaireQ–Q plotRadar chartRandom assignmentRandomization testRandomized controlled trialRandomized experimentRange (statistics)Rank correlationRanking (statistics)Rao–Blackwell theoremRegression analysisRegression validationReliability engineeringReplication (statistics)Resampling (statistics)Response surface methodologyRidge regressionRobust regressionRobust statisticsRun chartSample medianSample size determinationSampling (statistics)Sampling distributionScale parameterScatter plotScientific controlScore testSeasonal adjustmentSegmented regressionSelf-extracting archiveSemiparametric regressionShape of the distributionShape parameterShapiro–Wilk testSign testSimple linear regressionSimultaneous equations modelSkewnessSocial statisticsSolomonoff's theory of inductive inferenceSpatial analysisSpearman's rank correlation coefficientSpectral density estimationStandard deviationStandard errorStandard scoreStationary processStatisticStatistical classificationStatistical dispersionStatistical distanceStatistical graphicsStatistical hypothesis testStatistical inferenceStatistical modelStatistical parameterStatistical populationStatistical powerStatistical process controlStatistical theoryStatisticsStem-and-leaf displayStepwise regressionStochastic approximationStratified samplingStructural breakStructural equation modelingStudent's t-testStudentized residualSufficient statisticSurvey methodologySurvival analysisSurvival functionSymbolsSystem identificationTikhonov regularizationTime domainTime seriesTolerance intervalTotal least squaresTrend estimationTruncation (statistics)U-statisticUniformly most powerful testUnit vector normalizationUniversal code (data compression)Up-and-Down DesignsV-statisticVan der Waerden testVarianceVariance-stabilizing transformationVector autoregressionViolin plotWald testWaveletWeighted least squaresWhittle likelihoodWilcoxon signed-rank testWinsorizingYeo–Johnson transformationYouTubeZ-test
17 concepts already in your glossary