Advancing Dietary Recall: Validation and Innovation in Portion Size Estimation Methods for Biomedical Research

Joseph James Dec 02, 2025 259

This article provides a comprehensive analysis of portion size estimation methods for validating dietary recall in research and clinical settings.

Advancing Dietary Recall: Validation and Innovation in Portion Size Estimation Methods for Biomedical Research

Abstract

This article provides a comprehensive analysis of portion size estimation methods for validating dietary recall in research and clinical settings. It explores the foundational importance of accurate dietary assessment for understanding diet-health relationships and details innovative methodological approaches, including 3D cubes, playdough, and digital photography. The content addresses common challenges in measurement and optimization strategies, and presents rigorous validation frameworks for comparing methods against criterion measures like weighed food records and digital photography. Synthesizing current evidence, this resource offers researchers, scientists, and drug development professionals practical guidance for selecting, implementing, and validating portion estimation techniques to enhance data quality in nutritional epidemiology and clinical trials.

The Critical Role of Portion Size Estimation in Dietary Assessment and Health Outcomes Research

Linking Dietary Measurement Accuracy to Public Health and Clinical Outcomes

Accurate dietary assessment is a cornerstone of nutritional epidemiology, public health monitoring, and clinical research. The accuracy with which we measure dietary intake, particularly portion sizes, directly impacts the validity of research linking diet to health outcomes and the effectiveness of public health interventions [1]. Inaccurate dietary data can obscure true diet-disease relationships, leading to flawed conclusions and ineffective policies [2].

Recent global analyses consistently rank suboptimal dietary practices as the highest contributors to morbidity and mortality worldwide [3]. The precision of dietary measurement is therefore not merely a methodological concern but a fundamental determinant of our ability to address pressing public health challenges through evidence-based interventions.

This application note explores how advances in portion size estimation methods strengthen the chain of evidence from dietary intake assessment to public health and clinical outcomes. We focus specifically on validation approaches for portion size estimation methods and their implications for research accuracy.

The Impact of Measurement Accuracy on Public Health Evidence

Consequences of Measurement Error

Dietary assessment is notoriously prone to both random and systematic measurement errors that can substantially distort research findings [1]. Systematic under-reporting affects more than 50% of dietary reports, with misreporting strongly correlated with BMI and varying by age groups [2]. This measurement error introduces noise that obscures true associations between diet and health outcomes, potentially leading to false negative findings in diet-disease relationship studies.

The day-to-day variability in food consumption patterns further complicates accurate assessment. For most nutrients and food groups, 3-4 days of dietary data collection—ideally non-consecutive and including at least one weekend day—are required to obtain reliable estimates of usual intake [2]. Inadequate sampling periods can introduce random error that reduces statistical power and validity.

Implications for Public Health and Clinical Practice

In public health, inaccurate dietary data can lead to:

Misguided nutritional guidelines and policies
Ineffective targeting of public health interventions
Underestimation of true effect sizes for diet-disease relationships
Inaccurate assessment of population nutritional status

In clinical research, particularly drug development, poor dietary assessment can:

Obscure diet-treatment interactions
Compromise nutrition-related safety monitoring
Reduce ability to detect efficacy signals in nutrition-dependent therapies

Validated Portion Size Estimation Methods

Method Comparison and Validation Metrics

Recent validation studies have demonstrated the effectiveness of standardized tools for portion size estimation. The Global Diet Quality Score (GDQS) app, when used with either 3D cubes or playdough for portion estimation, shows equivalence to weighed food records (WFRs) within a pre-specified 2.5-point margin (p = 0.006 for cubes and p < 0.001 for playdough) [3] [4].

Table 1: Performance Metrics for Portion Size Estimation Methods in GDQS App Validation

Method	Equivalence to WFR (p-value)	Kappa Coefficient	Risk Classification Agreement	Food Groups with High Agreement
3D Cubes	0.006	0.5685	Moderate	22/25 groups
Playdough	<0.001	0.5843	Moderate	22/25 groups
Liquid oils (both methods)	-	0.059	Low	27.7% agreement

Both portion size estimation methods showed moderate agreement with WFR when classifying individuals at risk of poor diet quality outcomes (κ = 0.5685 for cubes and κ = 0.5843 for playdough, p < 0.0001 for both) [3]. For 22 out of 25 GDQS food groups, substantial to almost perfect agreement was observed between both methods and WFR, with liquid oils exhibiting the lowest agreement (κ = 0.059, 27.7% agreement, p = 0.009) [4].

Photographic Methods and Angle Optimization

Visual estimation methods using photography have also been validated, with optimal angles varying by food type. A 2025 study demonstrated that estimation accuracy significantly depends on both food type and photographic angle [5].

Table 2: Optimal Photography Angles for Portion Size Estimation of Different Food Types

Food Type	Most Accurate Angle	Accuracy at Optimal Angle	Accuracy with Combined Angles	Notes
Cooked rice	45°	74.4%	85.4%	Solid food
Soup	Varies	Low across angles	Improved with combination	Higher overestimation rates
Grilled fish	No significant difference	Moderate across angles	Slight improvement	Size-based estimation
Vegetables	45°	Moderate	53.7%	Improved with combined angles
Kimchi	45°	52.4%	Improved with combination	Irregular shape
Beverages	70°	73.2%	Maintained	Liquid content

For solid foods, 45° generally provided the best accuracy, corresponding to the average visual angle when seated at a table, while 70° was most accurate for beverages [5]. Combining different angles improved estimation accuracy for most food types, suggesting that multiple perspectives may enhance reliability for complex food items.

Experimental Protocols for Method Validation

Protocol: Validation of Portion Size Estimation Methods Against Weighed Food Records

This protocol outlines the procedure for validating portion size estimation methods, based on the study design used in the GDQS app validation study [3].

Study Design and Participants

Design: Repeated measures design where each participant estimates portion sizes using both the reference method (WFR) and the test method (e.g., GDQS app with cubes or playdough)
Sample Size: 170 participants aged 18 years or older provides sufficient statistical power (>80%) for testing primary equivalence
Eligibility: Adults fluent in the study language, without conditions that would prevent normal dietary recording or participation

Data Collection Procedures

Day 1: Training

Conduct in-person training sessions in groups of up to five participants
Train participants on how to use dietary scales and weigh foods, beverages, and mixed dishes
Provide calibrated digital dietary scales (accurate to 1 g) and WFR data collection forms
Distribute supplementary materials (WFR guide and videos) for reference during the recording period

Day 2: Weighed Food Record

Participants weigh and record all foods, beverages, and mixed dishes consumed during a 24-hour period
Record amounts served and any leftovers for each eating occasion
For mixed dishes, record individual ingredients where possible

Day 3: Test Method Administration

Participants return to submit completed WFR forms
Conduct face-to-face GDQS app interview with both portion size estimation methods (order randomized)
Collect participant feedback on usability of portion size estimation methods

Statistical Analysis

Equivalence Testing: Use paired two one-sided t-tests (TOST) with pre-specified equivalence margin (e.g., 2.5 points for GDQS)
Agreement Analysis: Calculate Kappa coefficients to quantify agreement for risk classification and food group consumption
Reliability Assessment: Estimate within- and between-subject variability using coefficient of variation (CV) method

Figure 1: Workflow for Validating Portion Size Estimation Methods Against Weighed Food Records

Protocol: Optimization of Photographic Angles for Food Portion Estimation

This protocol describes the procedure for determining optimal photographic angles for portion size estimation of different food types, based on research by Kongju National University [5].

Experimental Setup

Participants: 82 healthy adults (balanced gender representation) aged 20-50 years
Inclusion Criteria: No visual impairments, eating disorders, or medications affecting appetite
Food Selection: Six types of food representing different categories (cooked rice, soup, grilled fish, vegetables, kimchi, beverages)
Portion Sizes: Five portion sizes based on percentiles (10th, 30th, 50th, 70th, 90th) of food intake distribution from national surveys

Procedure

Meal Observation Phase

Prepare experimental meals with different portion size combinations
Allow participants to observe meals for 3 minutes (approximately 30 seconds per food item)
Conduct observations approximately 1 hour after participants' last meal

Intermission

Participants move to separate room
Watch non-food-related video for 2 minutes to clear visual memory

Portion Matching Task

Present photographs of each food type taken from three different angles
For solid foods: 0°, 45°, 70°
For beverages: 45°, 60°, 70°
Participants select photograph that matches observed portion size
Rate confidence in selection on 5-point Likert scale

Data Analysis

Calculate accuracy rates for each food type and angle combination
Determine underestimation and overestimation rates
Analyze confidence ratings relative to accuracy
Identify optimal angles for each food type and benefits of angle combinations

The Scientist's Toolkit: Research Reagent Solutions

Table 3: Essential Materials for Dietary Assessment Validation Studies

Item	Specifications	Application	Validation Evidence
3D Printed Cubes	Set of 10 cubes with pre-defined sizes based on food group gram cut-offs and density data	Portion size estimation at food group level	Equivalent to WFR within 2.5-point GDQS margin (p=0.006) [3]
Playdough	Standard commercial playdough in multiple colors	Flexible portion size estimation for irregularly shaped foods	Equivalent to WFR within 2.5-point GDQS margin (p<0.001) [3]
Digital Dietary Scales	KD-7000, capacity 7kg, accuracy to 1g	Weighed food records as reference method	Gold standard for validation studies [3]
Multi-angle Photography System	Standardized setup for 0°, 45°, 60°, 70° angles	Food photograph database development	Optimal angles vary by food type (45° for solids, 70° for beverages) [5]
GDQS Mobile Application	Digital platform for standardized dietary data collection	Streamlined administration of GDQS metric	Validated against WFR with moderate agreement (κ=0.57-0.58) [3]
Foodbook24 Digital Platform	Web-based 24-hour dietary recall tool with multi-language support	Dietary assessment in diverse populations	Strong correlations for 44% of food groups compared to interviewer-led recalls [6]

Implications for Public Health Research and Clinical Practice

Advancing Nutritional Epidemiology

Validated portion size estimation methods enable more precise measurement of dietary exposures, strengthening observational studies of diet-disease relationships. The GDQS metric represents an innovation in diet quality assessment by using quantity of consumption information at the food group level and has been validated against health outcomes in diverse populations [3].

Accurate dietary assessment is particularly important for understanding the relationship between diet and chronic diseases such as metabolic disorders, cardiovascular diseases, and cancer [2]. The strength of these associations has often been limited by measurement error in dietary data, which newer validated methods aim to reduce.

Informing Public Health Policy

Reliable dietary data is essential for developing evidence-based public health policies and nutritional guidelines. Simplified but validated tools like the GDQS app enable more frequent and widespread monitoring of diet quality, potentially leading to more responsive public health interventions [3].

As populations become increasingly diverse, adapting dietary assessment tools to different cultural contexts becomes essential. The expansion of Foodbook24 to include Brazilian and Polish food items and languages demonstrates how tools can be adapted to improve representation of ethnic minority groups in nutritional surveillance [6].

Figure 2: Pathway from Accurate Dietary Measurement to Improved Public Health Outcomes

The validation of portion size estimation methods represents a critical advancement in nutritional science with direct implications for public health and clinical research. Standardized tools like the GDQS app with 3D cubes or playdough provide practical, validated methods that can be deployed across diverse settings without sacrificing accuracy relative to more burdensome weighed food records.

The optimal application of these methods depends on food type, with photographic approaches benefiting from angle optimization and combined perspectives for complex foods. As dietary assessment continues to evolve, leveraging technology while maintaining rigorous validation will enhance our ability to detect true diet-health relationships and develop effective public health strategies.

By adopting validated portion size estimation methods, researchers and public health professionals can strengthen the evidence base linking diet to health outcomes, ultimately supporting more effective clinical and public health interventions to improve population health.

Table 1: Summary of Quantitative Findings on Dietary Assessment Variability and Bias

Challenge Category	Specific Metric	Quantitative Finding	Source/Context
Day-to-Day Variability	Minimum days for reliable macronutrient intake	2-3 days [2]	"Food & You" digital cohort (n=958)
Day-to-Day Variability	Minimum days for reliable micronutrient intake	3-4 days [2]	"Food & You" digital cohort (n=958)
Day-to-Day Variability	Minimum days for water/coffee/total food quantity	1-2 days [2]	"Food & You" digital cohort (n=958)
Day-to-Day Variability	Impact of weekend vs. weekday intake	Higher energy, carbohydrate, and alcohol intake on weekends [2]	Linear Mixed Model analysis
Recall Bias	Food omission rate in self-administered 24HR (Brazilian cohort)	24% (6/25 foods) [6]	Foodbook24 comparison study
Recall Bias	Food omission rate in self-administered 24HR (Irish cohort)	13% (5/38 foods) [6]	Foodbook24 comparison study
Method Validity	Correlation (diet history vs. biomarkers): Cholesterol	Moderate agreement (Kappa K = 0.56, p=0.04) [7]	Pilot study in eating disorders (n=13)
Method Validity	Correlation (diet history vs. biomarkers): Iron	Moderate-good agreement (Kappa K = 0.68, p=0.03) [7]	Pilot study in eating disorders (n=13)
Method Validity	Correlation (FFQ vs. 24HR): Energy	r = 0.57 - 0.63 [8]	PERSIAN Cohort validation (n=978)
Method Validity	Correlation (FFQ vs. 24HR): Protein	r = 0.56 - 0.62 [8]	PERSIAN Cohort validation (n=978)
Tool Usability	Food list representativeness after expansion	86.5% (302/349) of consumed foods available [6]	Foodbook24 acceptability study

Experimental Protocols for Mitigating Key Challenges

Protocol for Validating a Dietary Recall Tool Against Reference Methods

This protocol outlines a method for validating a web-based dietary recall tool, such as Foodbook24, against interviewer-led recalls, accounting for recall bias and day-to-day variability [6].

Application: To assess the relative accuracy and acceptability of a new dietary assessment tool in specific population subgroups.
Primary Challenges Addressed: Recall bias, Social desirability bias, Day-to-day variability.

Workflow Diagram: Dietary Recall Tool Validation

Step-by-Step Procedure:

Participant Recruitment: Recruit participants from target population subgroups (e.g., Brazilian, Irish, Polish adults living in Ireland) to ensure diversity in dietary habits and potential recall capabilities [6].
Tool Expansion (Phase 1):
- Review national food consumption surveys and relevant literature from the target populations to identify frequently consumed food items [6].
- Expand the food list of the dietary recall tool with these items and translate them into the relevant languages (e.g., Polish, Portuguese) [6].
- Assign nutrient composition data, prioritizing national databases, and apply standardized portion size estimates from official sources [6].
Acceptability Study (Phase 2):
- Ask participants to provide a visual record (e.g., food diary with photos) of their habitual diet.
- Check the percentage of foods listed by participants that are available in the updated food list of the tool. A high percentage (>85%) indicates good representativeness and helps mitigate recall bias caused by missing items [6].
Comparison Study (Phase 3):
- On Day 1, participants complete one 24-hour dietary recall using the web-based tool and one interviewer-led recall on the same day.
- Repeat this process after a 2-week interval to account for day-to-day variability in intake.
- The order of administration should be randomized to control for order effects.
Data Analysis:
- Use Spearman's rank correlation to assess the relationship between food group and nutrient intakes from the two methods.
- Employ Mann-Whitney U tests to check for significant differences in intakes reported by the two methods.
- Calculate κ coefficients to measure agreement on categorical dietary data.

Protocol for Determining Minimum Days of Dietary Assessment

This protocol uses data from a digital tracking cohort to determine the number of days required to reliably estimate usual intake, directly addressing the challenge of day-to-day variability [2].

Application: To establish nutrient- and food group-specific guidelines for the number of recording days needed in dietary studies to account for intra-individual variation.
Primary Challenge Addressed: Day-to-day variability.

Workflow Diagram: Minimum Days Estimation Protocol

Step-by-Step Procedure:

Data Collection:
- Recruit a large cohort (e.g., n=958) and have participants track all meals for 2-4 weeks using a digital tool (e.g., MyFoodRepo app) that allows image, barcode, and manual entry [2].
- All logged entries should undergo a verification process by trained annotators.
Data Preparation:
- Exclude days with implausibly low energy intake (e.g., <1000 kcal) to remove misreported days [2].
- For each participant, select the longest sequence of at least 7 consecutive days of valid data.
Analysis of Day-of-Week Effects:
- Use a Linear Mixed Model (LMM) to analyze the effects of age, BMI, sex, and day of the week on nutritional intake, with Monday as the reference day [2].
- This identifies significant patterns, such as higher energy or alcohol intake on weekends, which must be considered in study design.
Minimum Days Estimation:
- Method 1 (Coefficient of Variation): Calculate the within- and between-subject variability for each nutrient and food group. Use this to compute the number of days needed to achieve a reliability coefficient of r > 0.8 [2].
- Method 2 (Intraclass Correlation Coefficient): Calculate the ICC for all possible combinations of days (e.g., 1 day, 2 days, etc.) for each nutrient to observe how reliability increases with more days of recording [2].
Synthesis and Recommendation:
- Synthesize results from both methods to provide specific minimum day recommendations for different nutrients (e.g., 1-2 days for water, 2-3 days for macronutrients, 3-4 days for micronutrients and vegetables) [2].
- Emphasize that days should be non-consecutive and include at least one weekend day for most reliable estimation.

Protocol for Administering a Diet History in Clinical Populations

This protocol details the administration of a diet history in a clinical population with a high risk of misreporting, focusing on mitigating social desirability and recall biases [7].

Application: To gather detailed dietary data in clinical settings, such as outpatient eating disorder services, for nutritional rehabilitation planning.
Primary Challenges Addressed: Recall bias, Social desirability bias.

Step-by-Step Procedure:

Setting and Training:
- The diet history should be administered in a clinical setting (e.g., outpatient service) by a trained dietitian. The skill of the interviewer is critical in reducing over- or under-reporting [7].
Structured Interview:
- Use a structured format, like the Burke diet history, to assess individual food consumption, habitual intake from core food groups, and specific behaviors relevant to the population (e.g., missed meals, binge eating, dieting days) [7].
- The interview should produce a more complete description of food intake than a single 24-hour recall or food frequency questionnaire.
Targeted Questioning:
- Include specific, non-judgmental questions about dietary supplement use, use of substances to influence weight, and behaviors such as binge eating, purging, or ritualistic eating. This is crucial as forgetting to report supplement use can significantly alter nutrient intake comparisons with biomarkers [7].
Biomarker Validation (if possible):
- Within 7 days prior to or after the diet history, collect blood samples for relevant nutritional biomarkers (e.g., cholesterol, triglycerides, iron, total iron-binding capacity) [7].
- This allows for a objective validation of the reported intake. For example, agreement between dietary iron and serum iron levels can be assessed using kappa statistics.
Data Interpretation:
- Acknowledge that cognitive function impacted by conditions like starvation may affect a participant's ability to accurately describe portion sizes and frequency of consumption [7].
- Interpret the data in the context of the clinical presentation and biomarker results.

The Scientist's Toolkit: Research Reagent Solutions

Table 2: Essential Tools and Methods for Dietary Recall Validation Research

Item / Reagent	Function / Application in Research	Key Features & Considerations
Web-Based 24HR Tool (e.g., Foodbook24)	Self-administered dietary recall; improves scalability and inclusion of diverse populations [6].	Requires a representative, translated food list; uses food images for portion size estimation; nutrient data linked to composition databases (e.g., CoFID).
AI-Assisted Food Tracking App (e.g., MyFoodRepo)	Passive data collection for determining day-to-day variability and minimum recording days [2].	Enables logging via image, barcode, and manual entry; integrates with nutritional databases; allows collection of dense longitudinal data.
Interviewer-Led 24HR Recall	Serves as a reference method for validating self-administered tools; uses multi-pass method to enhance completeness [6].	Considered more accurate than self-report alone; but is labor-intensive and subject to interviewer bias.
Nutritional Biomarkers	Objective validation of dietary intake reported via subjective methods [7].	Must be selected on a nutrient-by-nutrient basis (e.g., serum triglycerides for dietary cholesterol, TIBC for iron); provides time-integrated reflection of intake.
Standardized Food Portion Atlas	Visual aid to improve accuracy of portion size estimation during recalls [6].	Contains images of foods in multiple portion sizes; critical for translating consumed food into quantifiable data.
Linear Mixed Models (LMM)	Statistical method to analyze fixed effects (age, BMI, day-of-week) and random effects (participant) on dietary intake [2].	Accounts for repeated measures within individuals; ideal for analyzing temporal patterns and demographic influences on reporting.
Coefficient of Variation (CV) & Intraclass Correlation (ICC)	Statistical measures to quantify within- and between-subject variability and determine reliability of intake data [2].	CV method calculates days needed for a specific reliability; ICC measures agreement for different numbers of recording days.

Impact of Systematic Under-Reporting on Diet-Health Relationship Studies

Systematic under-reporting of energy intake (EIn) represents a fundamental challenge in nutritional epidemiology, impeding the accurate investigation of diet-health relationships [9]. Traditional self-reported dietary instruments, including diet recalls, diet diaries, and food frequency questionnaires (FFQs), demonstrate strong agreement with each other but consistently fail when validated against objective biomarkers of intake [9]. The development of the doubly labeled water (DLW) method for measuring total energy expenditure (TEE) provided researchers with a biomarker for validating self-reported energy intake, revealing substantial discrepancies between reported and actual consumption [9]. When body energy stores remain relatively stable, TEE serves as an excellent biomarker of habitual energy intake based on the first law of thermodynamics [9].

Research conducted over the past three decades has consistently demonstrated that under-reporting of energy intake is not random but exhibits systematic patterns that vary with physiological and psychological factors [9]. Studies have identified that the degree of under-reporting increases with body mass index (BMI), suggesting that individuals concerned about their body weight—regardless of their actual weight status—are more prone to under-reporting [9]. More recent evidence indicates that under-reporting occurs almost twice as frequently in individuals following low-calorie or carbohydrate-restrictive diets compared to the general population, with 38.84% and 43.83% of these groups under-reporting respectively [10]. This systematic measurement error attenuates diet-disease relationships and compromises the scientific validity of nutritional research findings [9].

Quantitative Evidence of Systematic Under-Reporting

Magnitude and Patterns of Energy Intake Under-Reporting

Table 1: Evidence of Energy Intake Under-Reporting Across Study Populations

Study Population	Under-reporting Magnitude	Comparison Method	Key Findings
Obese women (BMI 32.9±4.6 kg/m²)	34% less than TEE [9]	7-day food diary vs. DLW	Significant under-reporting (P<0.05)
Lean women	No significant difference [9]	7-day food diary vs. DLW	Reporting accuracy maintained
General adult population	22.89% under-reporting prevalence [10]	Self-report vs. predictive equation from DLW	Baseline under-reporting rate
Low-calorie diet participants	38.84% under-reporting prevalence [10]	Self-report vs. predictive equation from DLW	Nearly double general population rate
Carbohydrate-restrictive diet participants	43.83% under-reporting prevalence [10]	Self-report vs. predictive equation from DLW	Highest under-reporting prevalence

Table 2: Macronutrient-Specific Under-Reporting Patterns

Macronutrient	Reporting Accuracy	Notes	References
Protein	Least under-reported	Validated via urinary nitrogen biomarkers; 47% underestimation in weight loss studies	[9]
Carbohydrates	Variably under-reported	Higher under-reporting in restrictive diets	[10]
Fats	Highly under-reported	Social desirability bias affects reporting	[9]
Total Energy	Systematically under-reported	Increases with BMI and dieting behavior	[9] [10]

The evidence demonstrates that systematic under-reporting affects macronutrients differentially, with protein being the least under-reported macronutrient [9]. This selective reporting pattern indicates that not all foods are under-reported equally, with certain food categories more susceptible to omission or portion size misestimation [9]. The between-individual variability in under-reporting of self-reported intake of energy and other nutrients subsequently attenuates diet-disease relationships in epidemiological studies [9].

Impact of Diet Type on Reporting Accuracy

Table 3: Association Between Diet Type and Under-Reporting Odds

Diet Category	Unadjusted Odds Ratio (OR)	Adjusted Odds Ratio (OR)	Confidence Intervals
General population	Reference	Reference	-
Low-calorie diets	2.32	2.32	1.93, 2.79
Carbohydrate-restrictive diets	2.86	2.86	1.85, 4.42

Recent analyses of National Health and Nutrition Examination Survey (NHANES) data from 2009-2018 reveal that specific dietary patterns are associated with significantly higher odds of under-reporting [10]. Even after adjustment for sociodemographic factors, both low-calorie and carbohydrate-restrictive diets maintained significantly elevated odds ratios for under-reporting [10]. Subanalyses restricted to participants denying any weight loss intention or with stable weight revealed comparable patterns, suggesting that the association is not solely driven by active weight loss efforts [10].

Portion Size Estimation Methods: Validation Protocols

The accurate estimation of portion sizes represents a critical component in addressing systematic under-reporting in dietary assessments. The following protocols detail validated methodologies for portion size estimation within dietary recall validation research.

Protocol 1: GDQS App with 3D Cubes or Playdough

Purpose: To standardize portion size estimation at the food group level for the Global Diet Quality Score (GDQS) using physical aids [3].

Equipment and Materials:

GDQS mobile application
Set of ten 3D printed cubes of pre-defined sizes
Standard playdough (multiple colors recommended)
Calibrated digital dietary scale (capacity 7kg, accurate to 1g)
Weighed Food Record (WFR) data collection forms
Recipe forms for mixed dishes

Procedure:

Participant Training: Conduct 40-60 minute in-person training sessions in groups of up to five participants on proper use of dietary scales and WFR documentation [3].
Weighed Food Record Collection: Participants weigh and record all foods, beverages, and mixed dishes consumed during a 24-hour period, including ingredients of mixed dishes [3].
GDQS App Interview: Within 24 hours of WFR completion, conduct face-to-face interviews using the GDQS application [3].
Portion Size Estimation: Apply both cube and playdough methods in randomized order as determined by the GDQS app [3].
- Cube Method: Participants select cube sizes corresponding to consumed food groups based on pre-defined volume thresholds [3].
- Playdough Method: Participants mold playdough to represent consumed amounts for each food group [3].
Data Analysis: Assess equivalence between GDQS-WFR and GDQS-cubes or GDQS-playdough using paired two one-sided t-tests (TOST) with pre-specified 2.5-point equivalence margin [3].

Validation Metrics:

Equivalence testing with TOST method (α=0.05)
Kappa coefficient for agreement on poor diet quality risk classification
Food group-level agreement analysis

Protocol 2: Doubly Labeled Water Validation Method

Purpose: To validate self-reported energy intake against objectively measured total energy expenditure using the doubly labeled water method [9].

Equipment and Materials:

Doubly labeled water (²H₂¹⁸O)
Mass spectrometer for isotope ratio analysis
Standardized dietary assessment instruments (recalls, diaries, or FFQs)
Body composition analysis equipment (DEXA, BIA, or ADP)
Urine or saliva collection containers
Temperature-controlled storage for biological samples

Procedure:

Baseline Assessment: Measure body weight, height, BMI, and body composition [9].
DLW Administration: Administer an oral dose of doubly labeled water (²H₂¹⁸O) based on body weight [9].
Equilibrium Period: Collect baseline urine/saliva samples 4-6 hours post-dose after isotope equilibration [9].
Sample Collection: Collect urine or saliva samples at regular intervals over 10-14 days to track isotope elimination [9].
Dietary Assessment: Implement self-reported dietary intake measures during the same assessment period [9].
Isotope Analysis: Analyze isotope ratios in biological samples using mass spectrometry [9].
TEE Calculation: Calculate carbon dioxide production rate from differential isotope elimination rates, then compute TEE using standard indirect calorimetry equations [9].
Under-reporting Identification: Calculate the ratio of reported energy intake to measured TEE, with values <0.76 (for weight-stable individuals) indicating under-reporting [9].

Validation Metrics:

Energy intake to TEE ratio (EI:TEE)
Precision of DLW method (typically ~7% for individuals)
Macronutrient-specific validation against urinary nitrogen (protein) or other biomarkers

The Researcher's Toolkit: Essential Materials and Reagents

Table 4: Research Reagent Solutions for Dietary Validation Studies

Item	Function/Application	Specifications	Validation Evidence
Doubly Labeled Water (²H₂¹⁸O)	Gold standard for total energy expenditure measurement	Isotopically enriched water; dose based on body weight	Accuracy 1-2%; individual precision 7% [9]
3D Printed Cubes	Portion size estimation at food group level	Set of 10 cubes with pre-defined volumes based on food group gram cut-offs	Equivalent to WFR within 2.5-point margin (p=0.006) [3]
Standardized Playdough	Alternative portion size estimation method	Multiple colors; moldable to represent various food amounts	Equivalent to WFR within 2.5-point margin (p<0.001) [3]
Calibrated Dietary Scales	Weighed food record validation	7kg capacity, accurate to 1g	Reference method for portion size validation [3]
Urinary Nitrogen Analysis	Protein intake validation	Spectrophotometric analysis of urinary nitrogen excretion	Objective biomarker for protein intake [9]
GDQS Mobile Application	Standardized diet quality assessment	Electronic data collection with portion size estimation integration	Moderate agreement with WFR (κ=0.5685-0.5843) [3]

Implications for Research and Clinical Practice

The systematic under-reporting of energy intake has profound implications for nutritional epidemiology, public health policy, and clinical practice. Research investigating diet-health relationships without accounting for systematic misreporting may generate spurious associations and unreliable findings [9] [10]. The evidence strongly suggests that self-reported energy intake should not be used as a primary assessment method in studies of energy balance, particularly in obesity research where under-reporting correlates with BMI [9].

Recent methodological advances in portion size estimation, such as the validated GDQS app with cubes or playdough, offer promising approaches to reduce measurement error in dietary assessments [3]. These tools provide standardized, practical methods that demonstrate equivalence to weighed food records while offering operational advantages for field-based research [3]. The continued development and validation of such methodologies is essential for advancing the accuracy of dietary assessment in both research and clinical contexts.

Future research directions should focus on extending these validation efforts across diverse populations, improving statistical correction methods for residual measurement error, and developing novel technologies that minimize reliance on self-reported dietary data. Through the implementation of rigorous validation protocols and appropriate methodological safeguards, the field can enhance the validity of diet-health relationship studies and strengthen the evidence base for nutritional recommendations and public health policies.

The Evolution from Traditional Methods to Technology-Enhanced Approaches

Accurate dietary assessment is a cornerstone of nutritional epidemiology, public health research, and clinical trials. At the heart of this assessment lies the technical challenge of portion size estimation (PSE), a fundamental aspect that directly influences data quality and subsequent health outcome correlations. Flaws in dietary assessment methods can generate misleading information, ultimately impacting public health interventions and nutritional guidelines [11]. The evolution from traditional to technology-enhanced PSE methods represents a critical pathway toward reducing measurement errors, improving data accuracy, and enhancing participant compliance. This evolution is particularly relevant for researchers and drug development professionals who require precise dietary metrics to understand diet-disease relationships and intervention outcomes. The measurement errors associated with traditional portion size estimation introduced at multiple points in data collection and handling threaten the validity of collected data, creating a rippling effect that can lead to participant misclassification and flawed exposure-outcome linkages [11]. This application note examines the quantitative evidence supporting this methodological evolution and provides structured protocols for implementing advanced portion size estimation techniques in validation research.

Comparative Performance Data: Traditional Versus Technology-Enhanced Methods

Table 1: Agreement Metrics Between Traditional and Digital 24-Hour Dietary Recall Methods for Beverage Consumption

Meal Time	Food Item Agreement (Kappa Value)	Portion Size Correlation (r-value)	Statistical Significance
Lunch	Highest percentage of agreement	0.324	Statistically significant
Dinner	Fair agreement	0.407	Statistically significant
Morning Snack	Fair agreement	0.465	Statistically significant
All Meals	κ = 0.375833 (Fair agreement)	N/A	N/A

Table 2: Accuracy of Portion Size Estimation Aids (PSEAs) Compared to True Intake

Portion Size Estimation Method	Overall Error Rate	Within 10% of True Intake	Within 25% of True Intake	Bland-Altman Agreement
Text-Based (TB-PSE)	0%	31%	50%	Higher agreement
Image-Based (IB-PSE)	6%	13%	35%	Lower agreement

Quantitative validation studies demonstrate distinct performance patterns between methodological approaches. The PakNutriStudy compared traditional interviewer-led 24-hour dietary recalls (24HR Ver-01) with a digital version using Intake24 (24HR Ver-02) in a cohort of 102 participants aged 18-25 years [11]. The research revealed fair agreement between methods (average kappa value κ = 0.375833) for food item reporting, with the highest agreement observed during lunch time recalls [11]. For portion size estimation, statistically significant correlations were found for morning snacks, lunch, and dinner (r = 0.465, r = 0.324, and r = 0.407 respectively) [11]. Bland-Altman analysis indicated the least agreement between methods for morning snack portion sizes, highlighting specific temporal challenges in digital recall accuracy [11].

A controlled comparison of portion size estimation aids revealed surprising findings regarding technological sophistication versus accuracy. Research with forty participants who consumed a pre-weighed lunch demonstrated that text-based portion size estimation (TB-PSE) using household measures and standard portions outperformed image-based methods (IB-PSE) across multiple metrics [12]. TB-PSE showed a 0% median relative error rate compared to 6% for IB-PSE, with double the proportion of estimates falling within 10% of true intake (31% vs. 13%) [12]. This superior performance of text-based approaches challenges assumptions that visual aids necessarily improve accuracy and emphasizes the need for context-appropriate PSEA selection.

Experimental Protocols for Method Validation

Protocol 1: Validation of Digital Dietary Recall Tools Against Traditional Methods

Objective: To validate web-based 24-hour dietary recall tools against traditional interviewer-led recalls for portion size estimation accuracy.

Materials:

Web-based dietary recall system (e.g., Intake24, ASA24, Foodbook24)
Traditional 24HR questionnaire with standardized portion size probes
Food composition database aligned with study population
Participant training materials for digital platform

Procedure:

Recruit a minimum of 100 participants to ensure adequate statistical power for correlation analysis [11]
Administer both traditional and digital 24HR recalls in randomized order to counterbalance learning effects
Collect data on specific food items and portion sizes for all eating occasions
Train participants on digital platform functionality prior to data collection
Analyze agreement using multiple statistical approaches:
- Calculate kappa statistics for food item identification agreement
- Compute correlation coefficients (Pearson or Spearman) for portion sizes
- Perform Bland-Altman analysis to assess agreement limits between methods
- Compare mean differences in reported portion sizes using Wilcoxon's tests

Applications: This protocol is suitable for establishing convergent validity of new digital tools against traditional methods in specific populations. The PakNutriStudy implementation demonstrated its utility for beverage consumption assessment in South Asian populations [11].

Protocol 2: Biomarker-Based Validation of Portion Size Estimation

Objective: To validate portion size estimation methods against objective biomarkers of dietary intake.

Materials:

Doubly labeled water for energy expenditure measurement
Urinary nitrogen analysis kits for protein intake assessment
Blood collection supplies for serum carotenoids and erythrocyte membrane fatty acids
Continuous glucose monitors for eating episode validation
Experience Sampling Methodology (ESM) application for real-time data collection

Procedure:

Recruit 115 participants to detect correlation coefficients of ≥0.30 with 80% power and alpha error probability of 0.05 [13]
Collect baseline socio-demographic and biometric data
Implement ESM method prompting three 2-hour recalls daily for 14 days [13]
Administer three interviewer-administered 24-hour dietary recalls as reference
Collect biomarker measurements:
- Doubly labeled water for total energy expenditure
- Urinary nitrogen for protein intake validation
- Serum carotenoids for fruit and vegetable intake
- Erythrocyte membrane fatty acids for fatty acid composition
Analyze validity using:
- Mean differences and Spearman correlations between methods
- Bland-Altman plots for agreement assessment
- Method of triads to quantify measurement error components

Applications: This comprehensive protocol enables researchers to quantify measurement error of portion size estimation methods against objective biomarkers, providing robust validation beyond self-report comparisons [13].

Biomarker Validation Protocol Workflow

Protocol 3: Cross-Cultural Adaptation of Digital Dietary Assessment Tools

Objective: To adapt and validate portion size estimation tools for diverse ethnic and cultural populations.

Materials:

Native language translation services
Cultural-specific food composition databases
Portion size images relevant to local food customs
Focus group facilities for qualitative feedback

Procedure:

Identify target population and commonly consumed foods through:
- Review of national food consumption surveys [6]
- Guided group interviews with scoring for likelihood of consumption [14]
Expand food list to include culturally relevant items (e.g., 546 additional foods for Brazilian and Polish populations) [6]
Translate interface and food descriptors into target languages
Adapt portion size images to reflect local servingware and customary portions
Conduct acceptability testing with target population (e.g., visual record of habitual diet) [6]
Validate through comparison study with interviewer-led recalls
Analyze food omission rates by ethnic group and correlation coefficients for food groups

Applications: This protocol enables inclusion of diverse populations in nutritional research, addressing systematic underrepresentation in national nutrition surveys [6].

Implementation Framework and Technical Considerations

Research Reagent Solutions for Dietary Assessment Validation

Table 3: Essential Research Reagents and Platforms for Dietary Recall Validation

Reagent/Platform	Primary Function	Application Context	Technical Specifications
Intake24	Web-based 24HR administration	Population dietary surveys	Incorporates local food databases; customizable portion size images
ASA24 Picture Book	Image-based portion size estimation	Standardized visual food quantification	3-8 portion size images per food item; known gram amounts
Doubly Labeled Water	Objective energy expenditure measurement	Validation against energy intake	Gold standard for total energy expenditure in free-living subjects
Urinary Nitrogen Analysis	Protein intake validation	Objective protein consumption assessment	Correlates with reported protein intake
Experience Sampling Methodology (ESM)	Real-time data collection	Minimizing recall bias	Multiple daily prompts; smartphone-based implementation
Foodbook24	Multi-lingual dietary assessment	Diverse population research	Customizable food lists; multiple language interfaces

Integration Pathways for Technology-Enhanced Dietary Assessment

Methodological Evolution Pathway

Successful implementation of technology-enhanced portion size estimation requires systematic attention to both technical and participant-centric factors. Research indicates that while data collectors find digital tools easier for processing, participants may regard them as time-consuming and less convenient than traditional methods [11]. This highlights the critical importance of participant training and user experience optimization in digital tool implementation. The contextual localization of dietary assessment tools emerges as another crucial factor, as dietary datasets are highly influenced by sociodemographic characteristics and cultural practices [11]. The expansion of Foodbook24 for Brazilian and Polish populations in Ireland demonstrates the feasibility of adapting digital tools through food list expansion (adding 546 foods), language translation, and portion size image customization [6].

Emerging methodologies like the Experience Sampling-based Dietary Assessment Method (ESDAM) represent promising approaches that address fundamental limitations of traditional recalls. ESDAM uses near real-time data collection through app-based prompting three times daily, requesting dietary intake reporting for the previous two hours at meal and food-group level [13]. This methodology minimizes recall bias, reactivity bias, and misreporting by capturing intake closer to consumption events [13]. The method is designed to assess habitual intake over a two-week period and has demonstrated low burden and high usability in validation studies [13].

The evolution from traditional to technology-enhanced approaches in portion size estimation represents a methodological paradigm shift with profound implications for dietary recall validation research. Quantitative evidence demonstrates that digital tools can achieve fair to good agreement with traditional methods while offering advantages in data processing efficiency and reduced resource requirements [11]. The integration of objective biomarkers through rigorous validation protocols provides a gold standard for establishing method accuracy beyond inter-method comparisons [13]. Future directions should focus on refining image-based portion size estimation, which currently underperforms text-based methods for accuracy [12], expanding cultural adaptation frameworks for diverse populations [6] [14], and developing increasingly sophisticated real-time assessment methodologies that minimize participant burden while maximizing data quality [13]. For researchers and drug development professionals, these technological advancements offer increasingly robust tools for obtaining precise dietary metrics essential for understanding diet-disease relationships and evaluating nutritional interventions.

Implementing Modern Portion Size Estimation Techniques: From Physical Aids to Digital Solutions

Within dietary recall validation research, accurate portion size estimation is a critical methodological challenge. Physical estimation tools provide tangible, three-dimensional references that help respondents conceptualize and report the volume of food consumed more accurately than abstract methods alone. This document details the application and protocols for three key physical tools—3D cubes, playdough, and food models—validated for use in dietary assessment. These tools are instrumental in standardizing data collection across diverse populations and settings, thereby improving the reliability of diet quality metrics like the Global Diet Quality Score (GDQS) [4] [3]. Their application is particularly valuable in field-based research and studies with specific demographic groups, such as children, where traditional digital methods may be less practical or require prohibitive resources [15] [16].

Experimental Protocols

This section provides detailed methodologies for implementing the key physical tools in dietary assessment studies, from preparatory steps to data collection procedures.

Protocol for 3D Cubes and Playdough with the GDQS App

The following protocol, adapted from a validation study employing a repeated-measures design, assesses diet quality equivalence against the reference method of Weighed Food Records (WFR) [4] [3].

A. Pre-Study Preparation

Tool Acquisition: Obtain a set of ten 3D-printed cubes of pre-defined sizes. The volume of each cube is determined by the gram cut-offs and food density data specific to each of the 25 GDQS food groups [3]. Alternatively, prepare standard, non-toxic playdough for participant handling.
Digital Tool: Utilize the GDQS mobile application, which is programmed to standardize data collection and randomize the order of cube and playdough method administration during the interview [3].
Training Materials: Develop training guides and videos for the WFR method. Prepare calibrated digital dietary scales (accurate to 1 g), WFR data collection forms, and informed consent documents [3].

B. Participant Recruitment and Training

Recruitment: Recruit adults (aged 18+) via community and institutional listservs and flyers. A sample size of approximately 170 is sufficient for statistical power in method equivalence testing [3].
Ethical Considerations: Obtain ethical approval from a recognized review board. Secure written informed consent from all participants, outlining the study duration, activities, and compensation (e.g., a $200 gift card) [3].
WFR Training: Conduct in-person training sessions in small groups (up to 5 people) for 40-60 minutes. Train participants to use the dietary scale, weigh all foods, beverages, and ingredients in mixed dishes, and accurately complete the WFR forms. Emphasize the requirement to avoid mixed dishes prepared outside the home during the 24-hour recording period [3].

C. Data Collection Procedure (3-Consecutive-Day Design)

Day 1 (Training): Distribute calibrated scales and WFR forms to participants after the training session [3].
Day 2 (Food Recording): Participants weigh and record all consumed foods and beverages over a 24-hour period [3].
Day 3 (Interview): Participants return submitted WFR forms and undergo a face-to-face GDQS app interview.
- The app administers the dietary recall for the same 24-hour period.
- The app randomizes whether cubes or playdough are used first for portion size estimation.
- For the cube method, participants use the physical cubes to represent the total consumed volume for each food group [3].
- For the playdough method, participants mold the playdough to represent the overall amount consumed for each food group [3].
- Participants complete the entire GDQS app interview using one method before repeating the process with the second method.
- Researchers collect feedback on the usability of both tools [3].

D. Data Analysis

Equivalence Testing: Use a paired two one-sided t-test (TOST) to assess if the GDQS from the cube or playdough methods is equivalent to the GDQS from the WFR, defining a pre-specified equivalence margin (e.g., 2.5 points) [4] [3].
Agreement Analysis: Calculate the Kappa coefficient (κ) to quantify agreement between methods for classifying individuals at risk of poor diet quality and for consumption of each of the 25 GDQS food groups [4] [3].

Protocol for 3D Food Models in Adolescent Populations

This protocol compares traditional 3D food models with a digital tool (Intake24) for portion size estimation in children aged 11-12 years [15] [16].

A. Pre-Study Preparation

Model Selection: Acquire a comprehensive set of 3D food models representing commonly consumed items, including shapes for bread, chips, sausages, fruits, biscuits, pies, and various household utensils (spoons, cups, bowls, glasses) [16].
Digital Tool Setup: Ensure access to the online Intake24 dietary recall tool, which contains a food database and portion estimation via photographs [15] [16].

B. Participant Recruitment

Recruitment: Recruit participants from secondary schools with signed consent from both parents and children [15] [16].
Ethical Approval: Secure approval from the university ethics committee [15].

C. Data Collection Procedure

Food Diary Completion: Participants complete a two-day food diary for consecutive days [15] [16].
Researcher Interview: A researcher meets with the participant the day after the recording days. The interview includes:
- Clarifying the food diary entries and checking for commonly forgotten items.
- Randomizing the order of the two portion estimation methods.
- 3D Food Model Method: The participant uses the physical models on a dinner plate to estimate the portion size of each recorded food/drink item. The researcher notes the selected models [15] [16].
- Intake24 Method: The researcher enters foods from the diary into Intake24, and the participant selects the closest portion photograph for each item [15] [16].

D. Data Processing and Analysis

Data Coding: Code all food diaries for nutritional composition using a standard nutrient databank. For the food model method, calculate food weights using food-specific conversion factors and the volume of the selected models [15] [16].
Statistical Comparison: Use Bland-Altman analysis to compare mean daily food weight, energy, and nutrient intakes between the two methods. Analyze limits of agreement to determine if the methods are interchangeable [15] [16].

Performance Data and Comparative Analysis

The following tables summarize key quantitative findings from validation studies for the physical tools discussed.

Table 1: Key Findings from the GDQS App Cube and Playdough Validation Study (n=170) [4] [3]

Metric	3D Cubes vs. WFR	Playdough vs. WFR
GDQS Equivalence (vs. WFR)	Equivalent (p=0.006)	Equivalent (p<0.001)
Agreement on Risk of Poor Diet (Kappa, κ)	κ = 0.5685 (p<0.0001)	κ = 0.5843 (p<0.0001)
Food Group Agreement	Substantial-almost perfect for 22/25 groups	Substantial-almost perfect for 22/25 groups
Lowest Agreement Food Group	Liquid Oils (κ = 0.059, 27.7% agreement)	Liquid Oils (κ = 0.059, 27.7% agreement)

Table 2: Comparison of 3D Food Models and Intake24 (Digital Photos) in Adolescents (n=70) [15] [16]

Metric	3D Food Models (Reference)	Intake24 (Test)	Agreement
Mean Food Weight Estimation	Baseline	Geometric Mean Ratio: 1.00	Limits of Agreement: -35% to +53%
Mean Energy Intake	Baseline	1% lower on average	-
Macro/Micronutrient Intake	Baseline	Within 6% of model estimates	-

The Researcher's Toolkit: Essential Materials

This section lists the key reagents, tools, and materials required to implement the protocols described in this document.

Table 3: Essential Research Reagents and Materials

Item	Specification / Variety	Primary Function in Research
3D Cubes	Set of 10, pre-defined sizes, volumes based on GDQS food group cut-offs and densities [3].	Standardizes portion size estimation at the food group level for the GDQS app [4] [3].
Playdough	Standard, non-toxic modeling compound [3].	Flexible, low-cost alternative to cubes for estimating amorphous and varied food volumes at the group level [4] [3].
3D Food Models	Variety of shapes: bread slices, spheres, sticks, pie wedges, sausages, biscuits, utensils (spoons, cups, bowls) [16].	Aids portion estimation of individual foods and drinks in interviewer-administered recalls, especially with children [15] [16].
Calibrated Digital Scale	Capacity: 7 kg, Accuracy: 1 g (e.g., MyWeigh KD-7000) [3].	Provides ground-truth data for food consumption amounts in validation studies via Weighed Food Records (WFR) [3].
GDQS App	Mobile application, includes standardized recall and randomized method order [3].	Simplifies and standardizes the collection and tabulation of diet quality data using physical tools [4] [3].
Intake24	Online 24-h dietary recall tool with integrated food database and portion photos [15] [16].	Serves as a digital comparator for traditional food models; streamlines data collection and coding [15] [16].

Workflow and Logical Diagrams

The following diagram illustrates the typical validation workflow for comparing physical estimation tools against a reference method in a dietary study.

Critical Considerations for Implementation

Successful application of these tools requires attention to several practical and methodological factors:

Tool Selection: The choice of tool depends on the research context. 3D cubes are optimized for food-group-level estimation in the GDQS framework [3]. Playdough offers a flexible and cost-effective alternative, particularly suitable for settings where producing 3D cubes is impractical [4] [3]. Detailed 3D food models are best for estimating individual food items in interviewer-led recalls with specific populations like children [15] [16].
Limitations and Error Sources: Be aware that all methods show varying accuracy across food types. Liquid oils and amorphous foods often present the highest estimation errors [4] [3]. The 3D food model and Intake24 comparison showed relatively wide limits of agreement (-35% to +53%), indicating that while group-level means are comparable, individual-level estimates can differ significantly [15].
Operational Logistics: Using physical models requires transport and organization, which can be time-consuming [15]. Playdough may need replacement over time. Digital tools like the GDQS app and Intake24 streamline data entry and coding but require access to compatible devices and, in some cases, internet connectivity [3] [15].

Accurate portion size estimation is a critical component of dietary recall validation research. Digital photography has emerged as a powerful tool to enhance the precision of dietary assessment, reducing reliance on memory and subjective estimation. This document outlines evidence-based protocols for photographic methodologies, focusing on optimal camera angles and standardization procedures to ensure data reliability and reproducibility in scientific studies. The guidance is framed within the context of validating portion size estimation methods, providing researchers with practical tools for implementation in both laboratory and free-living settings.

Optimal Angles for Food Photography: Quantitative Analysis

The angle at which food is photographed significantly influences portion size estimation accuracy. Different food types possess distinct physical properties—such as shape, height, and structural complexity—that interact with perspective to affect visual perception. The following table synthesizes recent research findings on optimal angles for various food categories.

Table 1: Food Portion Estimation Accuracy by Photography Angle

Food Category	Optimal Angle(s)	Accuracy at Optimal Angle	Key Research Findings
Cooked Rice	45°	74.4% (85.4% with combined angles)	Highest accuracy achieved at 45°; combining multiple angles significantly improved accuracy [5] [17].
Beverages	70°	73.2%	The higher angle provided a better view of liquid volume in containers [5] [17].
Solid Foods (General)	45°	Varies by food	Corresponds to the average visual perspective when seated at a table, minimizing perceptual distortion [5].
Kimchi	45°	52.4%	Solid, amorphous food items were most accurately estimated at this angle [17].
Vegetables	Combined 0°, 45°, 70°	53.7%	Accuracy significantly improved when multiple angles were combined versus any single angle [5].
Grilled Fish	Combined Angles	Slight Improvement	No single angle showed significant superiority, but combined angles improved accuracy [5].
Soup	All Angles Tested	Lower Accuracy	Consistently showed lower accuracy and higher overestimation rates across all angles [5].

Standardization Protocols for Dietary Assessment Photography

Standardized protocols are essential for collecting consistent, high-quality photographic data for portion size estimation. The following section details equipment specifications and setup procedures derived from established dietary assessment systems.

Equipment Specifications and Setup

Table 2: Camera Setup Specifications for Food Photography

Parameter	CNRC (Baylor College of Medicine) Protocol	Westat Protocol	Contractor (Shawn Chippendale) Protocol
Camera Angle	Dual angles simultaneously: 5° (overhead) and 42° (side)	Single angle: 5° (overhead) or 35°-42° (side)	Adapted per food item
Camera Distance	86.36 cm (overhead), 91.44 cm (side)	Variable, standardized in post-processing	23.5" from camera lens to plate center
Lens Focal Length	40mm (overhead), 42mm (side)	85mm typical	35mm
Lighting	Two photographic flash heads in light booth	Two strobes with soft box reflectors	Godox AD300pro & AD200pro with softboxes
Background	Chroma-key blue paper backdrop	Chroma-key blue paper	Chroma-key green seamless paper
Key Feature	Dual synchronized cameras	Use of camera control software	Horizontal arm with counterweight for stability

Workflow for Standardized Food Photography

The following diagram illustrates the complete workflow for standardizing food photography for dietary assessment, from equipment setup to image archiving.

Experimental Protocol: Validation of Multi-Angle Photography

This protocol details the experimental methodology for validating the accuracy of portion size estimation using multi-angle photography, based on recent research [5] [17].

Study Design and Participant Selection

Objective: To evaluate the validity of estimating food quantities using photographs taken at different angles to increase the accuracy of dietary intake surveys.

Participants:

Sample Size: 82 healthy adults (41 males, 41 females)
Age Range: 20-50 years
Inclusion Criteria: No visual impairments (e.g., color blindness), no history of diabetes or eating disorders, and not taking medications that affect appetite
Recruitment: Participants recruited from university campuses and surrounding communities

Experimental Meal:

Foods: Six types of food—cooked rice, soup, grilled fish, vegetables, kimchi, and beverages
Selection Rationale: Foods chosen based on consumption frequency from national nutrition survey data
Portion Sizes: Three different portion sizes for each food item, determined based on percentiles (10th, 30th, 50th, 70th, and 90th) of population food intake distributions

Experimental Procedure

Meal Observation: Participants observed a prepared meal for 3 minutes, approximately one hour after their last meal
Distractor Task: Participants moved to a separate room and watched a short non-food-related video for 2 minutes
Portion Matching: Participants completed 18 questions matching observed food portions with photographs taken from 3 angles (0°, 45°, 70° for solid foods; 45°, 60°, 70° for beverages)
Confidence Assessment: Participants rated confidence in their selections on a 5-point Likert scale (1=very uncertain to 5=very certain)
Data Analysis: Accuracy rates, underestimation, and overestimation rates calculated for each food type and angle

Visualization of Experimental Methodology

The following diagram illustrates the sequence and relationship of procedures in the multi-angle photography validation experiment.

The Researcher's Toolkit: Essential Materials and Equipment

Successful implementation of digital photography protocols for portion size estimation requires specific equipment and materials. The following table details essential research reagent solutions and their functions.

Table 3: Essential Research Materials for Dietary Photography Studies

Category	Item	Specification/Function	Research Application
Camera Equipment	Digital SLR or Mirrorless Camera	Interchangeable lenses, manual controls, high resolution	Primary image capture device [18]
	Fixed Focal Length Lens	35mm, 40mm, or 85mm preferred	Minimizes distortion, consistent framing [18]
	Sturdy Tripod with Horizontal Arm	Allows precise angle positioning and stability	Maintains consistent camera distance and angle [18]
Lighting System	Studio Strobe Lights	Consistent color temperature, adjustable power	Uniform illumination without shadows [18]
	Softbox Reflectors	Diffuses light, reduces harsh shadows	Creates even lighting across food subjects [18]
Reference Materials	Chroma-Key Background	Blue or green for easy background removal	Standardized backdrop for consistent appearance [18]
	Grid Mat	1.5cm grids providing scale reference	Enables accurate size and volume estimation [19]
	Reference Objects	Utensils, coins, or standardized size markers	Provides scale for portion size estimation [19]
Standardized Tableware	Flat and Soup Plates	White, standard sizes (e.g., 10.25" diameter)	Consistent food presentation across samples [19] [18]
	Measuring Spoons/Cups	Standardized volumes for small portions	Reference for infant-sized portions or ingredients [18]
Software Tools	Camera Control Software	Remote camera operation from computer	Ensures consistency without touching camera [18]
	Image Processing Software	Standardized color correction and cropping	Maintains consistency across image sets [18]

Implementing standardized digital photography protocols with optimal angle selection significantly enhances the accuracy of portion size estimation in dietary assessment. The 45° angle provides the highest accuracy for most solid foods, while a 70° angle is superior for beverages. Combining multiple angles further improves estimation precision. Rigorous standardization of equipment, lighting, and procedures ensures reproducible results essential for validation research. These protocols provide researchers with practical methodologies to enhance the validity of dietary assessment in both controlled studies and free-living conditions.

Accurate portion size estimation is a foundational challenge in dietary recall validation research, as errors in quantifying consumed foods and beverages can significantly bias the assessment of nutrient intake and its relationship to health outcomes [1]. Traditional methods, such as weighed food records (WFRs) and 24-hour recalls, though often considered reference standards, are burdensome and prone to memory-related and systematic errors [20] [1]. The emergence of mobile and artificial intelligence (AI)-assisted applications offers a transformative approach by leveraging image recognition and barcode scanning to automate and objectify dietary assessment. These technologies promise enhanced scalability, reduced participant burden, and more frequent data collection, which is critical for improving the validity of dietary recall in both research and clinical settings [21] [22]. This document details the application notes and experimental protocols for validating these tools within the context of portion size estimation research.

Application Notes: Core Technologies and Performance

Mobile dietary assessment tools primarily utilize two technological approaches: image-based analysis using computer vision and AI, and database matching via barcode scanning. Their performance is evaluated against traditional methods and biomarker data.

Image Recognition and Computer Vision

AI-driven image recognition systems use deep learning models, particularly Convolutional Neural Networks (CNNs), to identify food items and estimate their volume and weight from images [22].

Methodology: The process typically involves food detection, classification, and portion size estimation. Advanced systems like goFOOD 2.0 employ computer vision to estimate energy intake from food photographs without manual logging [21].
Performance: These systems can achieve high food classification accuracy, with some models exceeding 90% accuracy on standardized datasets when using advanced architectures like vision transformers [22]. However, accuracy decreases with complex, mixed meals, occluded items, and ambiguous portion sizes [21]. Validation studies show these tools can closely approximate estimations by registered dietitians, though discrepancies remain [21].

Barcode Scanning

Barcode scanning provides a direct method for identifying packaged food items. Users scan the product's barcode to automatically populate its nutritional information.

Function: This method links to extensive food composition databases, retrieving precise data on packaged foods and reducing errors associated with manual entry and food identification [1].
Limitations: Its application is restricted to packaged foods and cannot be used for homemade meals, fresh produce, or meals served without packaging, limiting its utility for capturing the whole diet [1].

The following table summarizes the validity of selected mobile and AI-assisted tools as reported in validation studies.

Table 1: Validity of Selected Mobile and AI-Assisted Dietary Assessment Tools

Tool / Method	Technology Used	Comparison Method	Key Performance Findings	Reference
GDQS App (with cubes or playdough)	Portion size estimation at food group level using 3D cubes or playdough	Weighed Food Record (WFR)	GDQS scores were equivalent to WFR (within a 2.5-point margin). Moderate agreement (κ ≈ 0.57) for classifying risk of poor diet quality.	[3]
goFOOD 2.0	Computer Vision & Deep Learning	Registered Dietitian Estimation	Promising results in preliminary validations; closely approximates expert estimates, but discrepancies exist with complex meals.	[21]
PortionSize App	Smartphone application for portion estimation	Digital Photography	Accurately estimated food intake in grams was equivalent to photography. Overestimated energy intake and most food group intakes (error range: 11% to 23%).	[23]
AI Image-Based Tools (General)	Deep Learning (CNNs, Transformers)	Ground-Truth Data / Dietitian	Food classification accuracy >85-90% on standard datasets. Accuracy varies by food type, lighting, and database coverage.	[21] [22]

Experimental Protocols for Validation

To ensure mobile and AI-assisted applications yield valid and reliable data for research, they must be rigorously validated against established reference methods.

Protocol 1: Validation Against Weighed Food Records

This protocol assesses the accuracy of portion size and nutrient intake estimation under controlled or free-living conditions.

Aim: To validate the output of a mobile application against the gold standard of WFR for the same 24-hour reference period.
Design: A repeated-measures design where each participant serves as their own control [3].
Participants: Recruit a convenience sample of motivated, literate adults. Sample size should be justified by a power analysis; a sample of ~170 has been used successfully in similar studies [3].
Procedure:
- Day 1 (Training): Participants attend an in-person session for training on how to use the dietary scale and complete WFR forms. They are provided with a calibrated digital scale and data collection forms [3].
- Day 2 (WFR Data Collection): Participants weigh and record all foods, beverages, and ingredients consumed over a 24-hour period [3].
- Day 3 (App-Based Assessment): Participants return to the lab and complete a dietary assessment using the mobile/AI application (e.g., using image capture or barcode scanning) for the same 24-hour period recalled on Day 2. The order of using different portion estimation methods within the app should be randomized [3].
Data Analysis:
- Use paired two one-sided t-tests (TOST) to test for equivalence between the app and WFR for metrics like total energy intake or diet quality scores, pre-specifying an equivalence margin (e.g., 2.5 points for a GDQS) [3].
- Assess agreement for food group intake and risk classification using Kappa coefficients [3].
- Apply Bland-Altman analysis to visualize bias and limits of agreement between the two methods [23].

Protocol 2: Validation Against Digital Photography

This protocol is suitable for validating the real-time portion estimation capabilities of an application in free-living or lab-based settings.

Aim: To evaluate the validity of a smartphone application for estimating dietary intake in free-living conditions using digital photography as a criterion.
Design: A pilot study where participants use the application concurrently with photo documentation of their meals.
Participants: A small sample (e.g., n=14) of free-living adults [23].
Procedure:
- Participants use the application to record all food and beverage intake over a period (e.g., 3 consecutive days) [23].
- Simultaneously, for each eating occasion, participants capture high-quality digital photographs of their food items before and after consumption. Photos should be taken from a consistent angle (e.g., 45° for solid foods) with a reference object for scale [23] [5].
- A trained analyst subsequently estimates the weight (grams) and energy (kilocalories) of the consumed food from the photographs.
Data Analysis:
- Perform equivalence tests with pre-defined equivalence bounds (e.g., ±25%) for gram and energy intake [23].
- Use Bland-Altman analysis to quantify the mean bias and precision of the application compared to photography-derived estimates [23].

Protocol 3: Validation of Image-Based Portion Size Estimation

This protocol specifically investigates the optimal conditions for accurate image-based portion estimation.

Aim: To determine the impact of photography angle on the accuracy of food portion size estimation from images.
Design: A controlled, cross-sectional study where participants estimate portion sizes from images taken at different angles.
Participants: Healthy adults without visual impairments [5].
Procedure:
- Prepare a variety of common foods, representing different types (solid, liquid, amorphous) and multiple portion sizes.
- Photograph each food portion at multiple angles (e.g., 0°, 45°, 70° for solid foods; 45°, 60°, 70° for beverages) [5].
- Participants observe actual food portions for a standardized time (e.g., 3 minutes), then are shown a series of images of the same food at different portions and angles, and are asked to select the matching portion.
Data Analysis:
- Calculate accuracy, overestimation, and underestimation rates for each food type and angle.
- Use statistical tests (e.g., Chi-square) to identify significant differences in accuracy across angles.
- Determine the optimal angle(s) for each food type and assess if combining angles improves accuracy [5].

Visualization of Experimental Workflows

The following diagrams illustrate the key experimental protocols using the DOT language, adhering to the specified color palette and contrast rules.

Diagram 1: Workflows for Validation Against Reference Methods

Diagram 2: Image Angle Validation and Core AI Technology Stack

The Scientist's Toolkit: Research Reagent Solutions

This section details essential materials and digital tools required for conducting validation research in this field.

Table 2: Essential Research Materials and Digital Tools for Validation Studies

Item / Solution	Function / Application in Research	Examples / Specifications
Calibrated Digital Scales	To obtain ground truth data for the weight of consumed foods and beverages in WFR studies.	MyWeigh KD-7000 (capacity 7 kg, accurate to 1 g) [3].
Standardized Portion Aids	To provide a physical reference for portion size estimation during app-based interviews, improving standardization.	3D printed cubes of pre-defined volumes; Playdough [3].
Digital Cameras / Smartphones	To capture high-quality food images for validation against apps (as a criterion method) or for building training datasets for AI.	Cameras with standardized settings; smartphones with high-resolution sensors.
AI Food Recognition Models	The core software component for automated food identification and portion estimation in applications.	Convolutional Neural Networks (CNNs), Vision Transformers [22].
Comprehensive Food Composition Databases	To convert identified food items and estimated portions into nutrient data. Must be culturally relevant and updated.	USDA FoodData Central; customized databases for regional foods.
Barcode Database APIs	To allow applications to retrieve nutritional information for packaged foods by scanning barcodes.	Commercial or open-source databases with extensive product coverage.
Biomarkers for Validation	To provide an objective, non-self-report measure for validating energy and specific nutrient intake.	Doubly Labeled Water for energy; Urinary Nitrogen for protein [20].

Application Notes

This document provides application notes and protocols for the integration of portion size estimation methods within two foundational dietary assessment platforms: weighed food records (WFR) and 24-hour dietary recalls (24HR). Validated portion size estimation is critical for minimizing measurement error in nutritional epidemiology, public health monitoring, and clinical drug trials where diet is an exposure or outcome variable.

Recent validation studies demonstrate a trend toward technology-assisted and simplified tools that maintain data quality while reducing participant and researcher burden [24] [3] [6]. The choice of method must be guided by the target population's age, cultural background, and technological literacy, as these factors significantly influence reporting accuracy [25] [6] [26].

Validation Evidence for Portion Size Estimation Methods

The following table summarizes quantitative evidence from recent validation studies comparing various dietary assessment tools and portion estimation methods against reference measures.

Table 1: Summary of Recent Validation Studies for Dietary Assessment Methods and Portion Estimation

Study & Tool/Method	Reference Method	Key Validation Metrics	Population & Context
myfood24-Germany (Web-based 24HR) [24]	Weighed Record & Biomarkers	Nutrients: Correlations 0.45-0.87; Protein intake ~10% lower than urinary biomarker.	German adults (n=97); Real-world validation.
GDQS App with Cubes/Playdough [3] [4]	Weighed Food Record (WFR)	GDQS scores equivalent within 2.5 points (p<0.05); Moderate agreement (κ=0.57-0.58) for diet quality risk.	US adults (n=170); Food group-level estimation.
Interviewer 24HR [25]	Weighed Intake (Feeding Study)	71.4% food item recall rate; Portion sizes overestimated (mean ratio: 1.34).	Older Korean adults (n=119); Asian-style meals.
Foodbook24 (Web-based 24HR) [6]	Interviewer-led 24HR	Strong correlations (r=0.70-0.99) for 58% of nutrients and 44% of food groups.	Brazilian, Irish, Polish adults in Ireland; Multicultural expansion.
NuMob-e-App (Tablet-based record) [26]	24-hour Recall (Telephone)	Protocol established for equivalence testing in adults ≥70 years.	Older German adults (n=~150); Focus on usability and validity.
Online Image-Series Tool [27]	Real Food Portions	Intra-class correlation (ICC) = 0.85 across 15 discretionary foods.	Australian adults (n=114); Portion size norm assessment.

Selection Guidelines for Researchers

The selection of an appropriate dietary assessment platform and portion size method depends on the research objectives, population, and resources.

For High-Accuracy Nutrient Intake Data: Web-based 24HR tools like myfood24 offer a strong balance of accuracy and automation for large-scale studies, demonstrating validity comparable to traditional methods for energy and most nutrients [24].
For Diet Quality Metrics in Diverse Settings: The GDQS app with physical aids like cubes or playdough provides a validated, simplified approach for classifying individuals based on diet quality risk, which is valuable for rapid population-level assessments [3] [4].
For Older or Technologically Naive Populations: Tools require specific adaptations. The NuMob-e-App was co-developed with older adults, featuring a simplified interface [26]. Traditional interviewer-administered recalls may still be preferable, though they show limitations in portion size estimation for amorphous foods [25].
For Multicultural and Multilingual Cohorts: Web-based tools like Foodbook24 can be expanded with translated food lists and culturally-specific items, enabling standardized assessment across diverse groups without sacrificing data accuracy [6].

Experimental Protocols

This section provides detailed methodologies for implementing and validating dietary assessment platforms.

Protocol: Validation of a Web-Based 24HR Tool Against Weighed Records and Biomarkers

This protocol is based on the validation study for myfood24-Germany [24].

Objective: To validate a web-based self-administered 24-hour dietary recall (24HR) tool against the reference method of a weighed dietary record (WDR) and urinary biomarkers.

Materials: Pre-configured web-based 24HR tool (e.g., myfood24), paper-based WDR forms, digital dietary scales, 24-hour urine collection kits (containers, protocol forms), laboratory equipment for urine analysis (e.g., for nitrogen, potassium, creatinine).

Procedure:

Participant Recruitment and Screening: Recruit a sufficient sample size (e.g., ~100 participants) based on power calculation. Participants should be fluent in the tool's language and have stable body weight.
Initial Visit and Training: Instruct participants on how to complete a 3-day WDR, emphasizing the need to weigh and describe all foods, beverages, and leftovers in detail. Provide training on 24-hour urine collection (discard first morning urine, then collect all for 24 hours).
Data Collection Phase:
- Participants complete a 3-day WDR at home.
- On day 3 of the WDR, participants collect a 24-hour urine sample.
- On day 4, participants hand in WDRs and urine sample at the study center.
- At the study center, participants complete their first 24HR using the web-based tool for the intake of day 3 (same day as urine collection).
- Participants complete additional web-based 24HRs at home over the following weeks for other days.
Data Processing:
- Manually code and review all WDRs using a standard nutrient database.
- Process 24-hour urine samples: verify completeness (volume, collection time), and analyze for relevant biomarkers (e.g., urinary nitrogen for protein, potassium).
- Extract nutrient data from the web-based 24HR tool automatically.
Statistical Analysis:
- Method Comparison (vs. WDR): Calculate correlation coefficients for energy and nutrients. Assess mean differences using paired t-tests or Wilcoxon signed-rank tests.
- Biomarker Comparison (vs. Urinary Markers): Calculate intake from urinary biomarkers (e.g., Protein intake = (Urinary N * 6.25) / 0.80). Use concordance correlation coefficients (pc) and weighted Kappa (κ) to assess agreement between reported and biomarker-estimated intake.

Protocol: Validation of Simplified Portion Size Methods for a Diet Quality Score App

This protocol is based on the GDQS app validation study [3] [4].

Objective: To assess whether the Global Diet Quality Score (GDQS) obtained using cubes or playdough for portion size estimation is equivalent to the GDQS from a Weighed Food Record (WFR).

Materials: GDQS mobile application, a set of ten 3D-printed cubes of pre-defined sizes, playdough, calibrated digital dietary scales (e.g., KD-7000), WFR data collection forms.

Procedure:

Study Design: A repeated measures design is used, where each participant uses all methods for the same 24-hour reference period.
Day 1 - Training: Conduct in-person training for small groups on how to use the dietary scale and complete the WFR forms accurately.
Day 2 - Weighed Food Record: Participants weigh and record all foods and beverages consumed over a 24-hour period using the provided scales and forms.
Day 3 - GDQS App Interview: Participants return to the study center.
- They first complete a face-to-face interview using the GDQS app.
- The app randomizes whether cubes or playdough are used first.
- For each of the 25 GDQS food groups consumed, the participant uses the assigned method (cubes or playdough) to indicate the total amount consumed at the food group level.
- After completing the first method, the participant repeats the interview using the second portion size method.
Data Analysis:
- Calculate the GDQS from the WFR and from both GDQS app methods (cubes and playdough).
- Use a paired two one-sided t-test (TOST) to test for equivalence between the GDQS from the WFR and the GDQS from each portion method, with a pre-specified equivalence margin (e.g., 2.5 points).
- Use weighted Kappa statistics to assess agreement in risk classification (e.g., high/moderate/low risk of poor diet quality) between methods.

The workflow for a comprehensive validation study integrating these elements is depicted below.

Diagram 1: Dietary assessment validation workflow.

The Scientist's Toolkit: Research Reagent Solutions

The following table details essential materials and tools required for conducting validation research on dietary assessment platforms.

Table 2: Key Research Reagents and Materials for Dietary Assessment Validation

Item	Function in Research	Example Specifications / Notes
Calibrated Digital Scales	Gold standard for weighing food items in WFR to obtain "true" intake data.	Capacity: ~7 kg; Accuracy: ±1 g (e.g., MyWeigh KD-7000) [3].
24-Hour Urine Collection Kit	Collection of biological samples for biomarker analysis (e.g., protein, potassium).	Includes large container, urine protocol form, cold storage instructions [24].
3D Portion Size Cubes	Standardized, tangible aids for estimating consumed amounts at the food group level.	A set of 10 cubes of predefined volumes corresponding to GDQS food group gram cut-offs [3] [4].
Playdough	A flexible, low-cost alternative to cubes for forming portion size shapes.	Used to estimate total volume consumed per food group [3] [4].
Web-Based 24HR Tool	Automated, self-administered dietary recall for large-scale data collection.	Tools include myfood24 [24], ASA24, Intake24 [28], Foodbook24 [6].
Image-Based Portion Tool	Computer-based assessment of perceived portion size norms using sequential images.	An online tool (e.g., in Qualtrics) with 8+ portion images per food; validated against real foods [27].
Nutrient Composition Database	Backend system for converting reported food consumption into nutrient intake data.	National databases (e.g., BLS in Germany, CoFID in UK/Ireland) are essential for accuracy [24] [6].
Biomarker Assay Kits	Laboratory analysis of urinary biomarkers to objectively assess intake of specific nutrients.	e.g., Dumas method for urinary Nitrogen, Atomic Absorption Spectroscopy for Potassium [24].

Addressing Measurement Challenges and Optimizing Protocol Design for Reliable Data

Accurate portion size estimation is a critical yet challenging component of dietary recall validation research. While general methodologies provide a foundation, the validity of dietary assessment is ultimately determined by how effectively it addresses the unique properties of specific food types. Foods with irregular shapes, mixed dishes with multiple ingredients, and culturally specific items present distinct obstacles for accurate quantification in both self-reported and image-based methods. This application note details these food-specific considerations and provides structured experimental protocols to enhance the precision of dietary recall validation studies, particularly within the broader context of a thesis on portion size estimation methods. Recognizing these distinctions is essential for reducing measurement error and strengthening the validity of diet-disease association studies [17] [29].

Core Challenges in Food Portion Estimation

The physical characteristics of food significantly influence estimation accuracy. The table below summarizes the primary challenges associated with major food categories.

Table 1: Core Challenges in Food Portion Size Estimation by Food Category

Food Category	Key Challenges	Impact on Estimation Accuracy
Irregularly Shaped Solids (e.g., grilled fish, chicken pieces, broccoli)	Lack of standardized geometry; variable surface texture and air pockets; difficult to define a consistent "unit."	High variability in volume-to-visual appearance relationships; leads to significant over- or under-estimation [17].
Amorphous Foods (e.g., cooked rice, pasta, mashed potatoes)	Lack of defined shape; portion compactness and density can vary; susceptible to plate coverage bias (volume perceived differently based on plate size).	High risk of visual misjudgment; considered one of the most difficult categories for accurate estimation [25].
Layered & Mixed Dishes (e.g., lasagna, casseroles, salads)	Individual ingredients are occluded; requires estimation of both overall volume and internal composition.	Nutrient-specific estimation errors; recall of individual components is often incomplete [25].
Liquids & Soups (e.g., broths, stews)	Transparency/opacity affects depth perception; meniscus formation in containers.	Consistent overestimation across studies, particularly when served in bowls [17].
Cultural Foods (e.g., kimchi, dumplings, specific ethnic dishes)	Unfamiliarity to researchers and standardized databases; non-standard serving vessels and utensils.	Systemic bias in dietary data if assessment tools are not culturally adapted [17] [6].

Quantitative Data on Estimation Accuracy

Recent validation studies provide quantitative evidence on how food type and methodology affect estimation accuracy. The following table synthesizes key findings from controlled studies.

Table 2: Food-Specific Estimation Accuracy from Validation Studies

Study & Food Type	Estimation Method	Key Accuracy Metric	Findings
Korean Foods [17]	Multi-angle photography (0°, 45°, 70°)	Percentage of correct portion matches	- Cooked Rice: Highest accuracy at 45° (74.4%), improving to 85.4% with combined angles.- Soup: Lower accuracy across all angles; consistent overestimation.- Beverages: Highest accuracy at 70° (73.2%).
Brazilian Foods [29]	24-hour recall with vs. without photos	Odds of correct estimation (±10% threshold)	- Rice & Beans: Odds of success significantly greater with photo assistance.- Meatballs: Photo use was unsatisfactory, leading to less accurate estimates.- Carrot, Lettuce, Juice: Similar biases with and without photos.
Older Korean Adults [25]	24-hour recall vs. weighed intake	Mean ratio of reported to weighed portion size	Overall portion sizes were overestimated (mean ratio: 1.34). Amorphous foods like rice and cooked vegetables were key contributors to error.
AI-Based Recognition [30]	Hybrid Transformer Model	Correlation with traditional methods	Reported correlation coefficients >0.7 for energy and macronutrient estimation in several studies, though performance varies widely by food type and image quality.

Detailed Experimental Protocols

Protocol for Validating Multi-Angle Photography of Solid Foods

This protocol is derived from the study by Kim et al. (2025) on evaluating food portion estimation accuracy with multi-angle photographs [17].

1. Research Question: What is the optimal photography angle for accurately estimating the portion size of specific solid foods, and does combining multiple angles improve accuracy?

2. Experimental Workflow:

The following diagram outlines the experimental workflow for validating multi-angle photography of solid foods.

3. Key Materials and Reagents:

Table 3: Research Reagent Solutions for Multi-Angle Photography Validation

Item	Specification / Function
Standardized Food Samples	Prepared according to predefined portion sizes (e.g., based on national consumption percentiles). Examples: cooked rice, grilled fish, vegetables, kimchi [17].
Fixed-Angle Camera Mount	Ensures consistent shooting angles (e.g., 0°, 45°, 70° for solid foods) to eliminate operator-induced variability [17].
Reference Portion Photo Album	A set of photographs depicting 5+ different portion sizes for each food type, taken from each validated angle. Serves as the reference during the matching task [17] [29].
Computer-Based Survey System	Software to administer the photo-matching task and collect participant responses and confidence ratings [17].
Calibrated Weighing Scale	High-precision scale for verifying the true weight of food portions served (e.g., to 0.1g accuracy).

4. Detailed Procedure:

Participant Recruitment: Recruit a sufficient sample size (e.g., n=80+) of healthy adults with no visual impairments. Stratify by sex and age groups to assess potential demographic influences [17].
Meal Observation: Present participants with a pre-weighed, plated meal simulating a real eating situation. Standardize observation time (e.g., 3 minutes) [17].
Distractor Task: After observation, engage participants in a non-food-related task (e.g., watching a short video) to clear short-term memory and simulate a recall delay.
Photo-Matching Task: In a separate room, present participants with a series of questions on a computer. For each food item, show the reference photo album for a specific angle and ask the participant to select the photograph that matches the portion they observed.
Data Collection: For each trial, record the accuracy of the selection (correct/incorrect based on the pre-weighed portion) and the participant's confidence in their selection on a 5-point Likert scale [17].
Data Analysis: Calculate accuracy, underestimation, and overestimation rates for each food type and angle. Use statistical models (e.g., logistic regression) to determine the optimal angle and to test if combined angles significantly improve accuracy over single angles.

Protocol for Cultural Adaptation of Dietary Assessment Tools

This protocol is based on the expansion and assessment of the Foodbook24 dietary recall tool for use among diverse populations [6].

1. Research Question: How can a web-based dietary recall tool be effectively adapted to accurately capture food consumption for specific cultural or ethnic groups?

2. Experimental Workflow:

The following diagram illustrates the multi-stage workflow for the cultural adaptation of dietary assessment tools.

3. Key Materials and Reagents:

Table 4: Research Reagent Solutions for Cultural Adaptation of Dietary Tools

Item	Specification / Function
Source National Food Consumption Data	Dietary surveys and nutritional literature from the target country (e.g., Brazil, Poland) to identify commonly consumed foods [6].
Target Language Translation Services	Professional translation of the food list, interface, and instructions into the target language(s) (e.g., Brazilian Portuguese, Polish) [6].
Multicultural Nutrient Databases	Integration of country-specific food composition tables (e.g., Brazilian TBCA, Polish Food Composition Tables) for culturally unique items not found in standard databases [6].
Culturally Relevant Portion Size Aids	Development of photo albums or portion images featuring serving styles and tableware common in the target culture [17] [6].
Qualitative Data Collection Tools	Interview guides and surveys for acceptability testing to gather user feedback on tool relevance and ease of use [6].

4. Detailed Procedure:

Food List Expansion: Review national survey data from the target population to identify frequently consumed foods and beverages. Add these items to the existing food list of the dietary assessment tool [6].
Translation and Localization: Translate the entire food list and user interface into the target language(s). Ensure that food names are culturally appropriate and commonly understood.
Nutrient and Portion Database Update: Assign nutrient composition data to new food items. Prioritize using the target country's national food composition database. For portion sizes, use mean intake data from national surveys or apply standard portion sizes from similar foods [6].
Acceptability Testing: Conduct a qualitative study where participants from the target group list all foods and beverages consumed over a recent period. Calculate the percentage of reported foods that are available in the updated tool's list. A high match rate (e.g., >85%) indicates good representation [6].
Comparative Validation: Conduct a method comparison study. Participants complete a 24-hour recall using the adapted tool and an interviewer-led recall (gold standard) on the same day. Strong correlations (e.g., r > 0.7) for key food groups and nutrients indicate successful adaptation [6].

The Scientist's Toolkit

This section details essential materials and digital solutions for conducting rigorous portion estimation research.

Table 5: Essential Research Reagents and Digital Solutions for Portion Estimation Studies

Category / Item	Function in Research
Standardized Lab Materials
Calibrated Serving Utensils	Ensures precise and reproducible serving of test portions.
Fixed-Angle Photography Setup	Eliminates confounding caused by variable perspective in image-based assessment [17].
Chemical Recovery Biomarkers	Provides an objective, unbiased measure of intake for specific nutrients (e.g., doubly labeled water for energy, urinary nitrogen for protein) for validation [31].
Digital & AI Solutions
Hybrid Transformer Models (AI)	Combines architectures like Vision Transformer and Swin Transformer for high-accuracy food classification and calorie estimation from images [32] [30].
Web-Based 24HR Tools (e.g., Foodbook24)	Self-administered dietary recall tools that can be customized with multi-language support and expanded food lists for diverse populations [6].
Image-Assisted Dietary Assessment Apps	Mobile applications that use portion size estimation via embedded photo atlases or AI-based volume estimation from user-submitted images [30].

Addressing the complexities of irregular shapes, mixed dishes, and cultural foods is not a peripheral concern but a central requirement for advancing dietary recall validation research. The protocols and data presented herein provide a roadmap for incorporating these food-specific considerations into rigorous study design. Employing optimized photography angles, leveraging emerging AI technologies, and systematically adapting tools for cultural relevance are proven strategies for mitigating measurement error. Integrating these focused approaches will significantly enhance the accuracy of portion size estimation, thereby strengthening the foundation of nutritional epidemiology and the validity of subsequent diet-disease association findings.

Determining Minimum Days Required for Reliable Usual Intake Estimation

Accurate estimation of usual dietary intake is fundamental for nutrition research, informing public health policy, and understanding diet-disease relationships. A significant challenge in dietary assessment is the inherent day-to-day variability in an individual's food consumption, which can obscure true long-term intake patterns and complicate the identification of reliable nutritional biomarkers [2]. The determination of the minimum number of recording days required to achieve a stable estimate of usual intake is therefore critical for designing efficient, cost-effective, and participant-friendly studies without compromising data quality [2]. This document outlines application notes and experimental protocols for establishing minimum days required for reliable usual intake estimation, framed specifically within the context of portion size estimation methods used in dietary recall validation research.

Key Concepts and Background

The Problem of Intra-Individual Variation

Day-to-day fluctuations in food consumption present a major methodological challenge in dietary assessment. Individuals do not consume identical foods in identical amounts every day, leading to variability that can mask true habitual intake patterns [2]. This intra-individual variability differs across nutrients and food groups; some dietary components (e.g., water, coffee) show relatively stable daily patterns, while others (e.g., micronutrients, certain vegetables) may vary considerably [2]. Furthermore, systematic variations occur across demographic subgroups and days of the week, with studies consistently showing higher energy, carbohydrate, and alcohol intake on weekends, particularly among younger participants and those with higher BMI [2].

Portion Size Estimation as a Critical Component

The accuracy of portion size estimation significantly influences the overall validity of dietary intake data. Inaccurate self-report of portion sizes represents a major source of measurement error in dietary assessment [12]. Different portion size estimation aids (PSEAs) have been developed to mitigate this error, including food images (image-based, IB-PSE) and textual descriptions of portion sizes (text-based, TB-PSE) [12]. Research indicates that while both methods introduce some measurement error, text-based approaches using household measures and standard portions generally outperform image-based methods, with one study showing TB-PSE resulted in 0% median relative error versus 6% for IB-PSE [12]. This distinction is particularly important when determining minimum recording days, as more accurate portion size estimation methods may reduce the number of days required to achieve reliable usual intake estimates.

Quantitative Data on Minimum Days Requirement

Minimum Days by Nutrient and Food Group

Recent research utilizing large digital cohorts provides specific guidance on minimum recording days required for reliable estimation across nutrient types. Table 1 summarizes these evidence-based recommendations, synthesized from a study of 958 participants who tracked meals for 2-4 weeks using an AI-assisted dietary app [2] [33].

Table 1: Minimum Days Required for Reliable Usual Intake Estimation (r > 0.8)

Nutrient/Food Category	Minimum Days	Reliability (r)	Notes
Water, Coffee	1-2 days	>0.85	Least variable; highest reliability with minimal data
Total Food Quantity	1-2 days	>0.85	Consistent estimation with few days
Carbohydrates	2-3 days	~0.8	Weekend effects significant
Protein	2-3 days	~0.8	More consistent than fats
Fat	2-3 days	~0.8	Slightly more variable than other macronutrients
Micronutrients	3-4 days	~0.8	Higher variability requires more days
Meat	3-4 days	~0.8	Infrequent consumption pattern
Vegetables	3-4 days	~0.8	Variety and preparation methods increase variability
Alcohol	3-4 days	~0.8	Highly variable with weekend effects

Impact of Day Selection and Participant Factors

The selection of specific days for dietary recording significantly impacts the reliability of usual intake estimates. Research demonstrates that including both weekdays and weekends in the recording period increases reliability, as significant day-of-week effects have been observed for multiple nutrients [2]. Intraclass correlation coefficient (ICC) analyses reveal that specific day combinations outperform others, with non-consecutive days that include at least one weekend day providing optimal efficiency [2].

Participant characteristics also influence variability and thus minimum day requirements. Younger participants and those with higher BMI show more pronounced weekend effects for energy, carbohydrate, and alcohol intake [2]. Age, sex, and BMI independently affect dietary reporting patterns, with systematic differences in both magnitude and consistency across demographic segments [2].

Experimental Protocols for Minimum Days Determination

Core Study Design Protocol

Objective: To determine the minimum number of days required to obtain reliable estimates of dietary intake across various nutrients and food groups within a specific population.

Participant Recruitment:

Recruit a minimum of 200 participants to ensure adequate power for variability analysis [2]
Stratify recruitment by age, sex, and BMI categories to ensure representative sampling
Include participants across adult age range (18-65 years) with proportional representation from key demographic segments [2]
Obtain ethical approval and informed consent prior to data collection

Data Collection Period:

Implement a 14-day continuous dietary recording period for all participants [2]
For longitudinal assessment, extend data collection to 28 days for a subset of participants to evaluate longer-term patterns [2]
Include both weekdays and weekends to capture full range of dietary variability
Exclude days with total energy intake below 1000 kcal from analysis as potentially incomplete records [2]

Dietary Assessment Methodology:

Utilize a digital dietary assessment tool with multiple input methods (image capture, barcode scanning, manual entry) [2]
Implement standardized portion size estimation methods, preferably text-based with household measures [12]
Employ trained annotators to verify portions, segmentations, and food classifications [2]
Establish direct communication channels between annotators and participants to clarify uncertainties about logged items [2]

Data Analysis Protocol

Variability Partitioning:

Calculate within- and between-subject variances for each nutrient and food group
Compute the coefficient of variation (CV) for each dietary component using the formula: CV = (within-subject variance / between-subject variance) × 100 [2]
Determine the number of days (n) required to achieve a reliability coefficient of 0.8 using the formula: n = (CV² × r) / (1 - r), where r is the desired reliability [2]

Intraclass Correlation Coefficient (ICC) Analysis:

Perform ICC analysis across all possible day combinations for each nutrient [2]
Calculate ICC values for 1-day, 2-day, 3-day, 4-day, 5-day, 6-day, and 7-day combinations
Identify the point of diminishing returns where additional days provide minimal improvement in reliability
Compare ICC values across different day-type combinations (weekdays only, weekends only, mixed)

Linear Mixed Models (LMM) for Covariate Adjustment:

Employ LMM to analyze effects of age, BMI, sex, and day of week on nutritional intake [2]
Use model formula: Targetvariable ~ age + BMI + sex + dayof_week
Set Monday as reference day and male as reference category for sex [2]
Conduct separate analyses for different demographic subgroups (age groups, BMI categories, sex)
Extract model intercept, coefficients, and p-values for different days of the week

Reliability Threshold Establishment:

Define reliability threshold of r > 0.8 for "good" reliability and r > 0.85 for "excellent" reliability [2]
Identify the minimum number of days required for each nutrient/food group to reach these thresholds
Validate thresholds through bootstrapping or cross-validation techniques

The following diagram illustrates the complete experimental workflow for determining minimum days requirements:

Figure 1: Experimental workflow for determining minimum days required for reliable usual intake estimation.

The Researcher's Toolkit: Essential Materials and Methods

Table 2: Research Reagent Solutions for Dietary Intake Validation Studies

Tool/Resource	Function	Implementation Notes
Digital Dietary Assessment Platform (e.g., MyFoodRepo)	Enables efficient data collection through multiple input methods (images, barcode scanning, manual entry)	Ensure integration with comprehensive food composition database; include portion size estimation aids [2]
Food Composition Database	Provides nutritional information for logged food items	Integrate multiple validated sources (e.g., Swiss Food Composition Database, MenuCH, Ciqual) for comprehensive coverage [2]
Portion Size Estimation Aids (PSEAs)	Improves accuracy of portion size reporting	Implement text-based descriptions with household measures; supplement with image-based aids as secondary option [12]
Linear Mixed Models Software (e.g., R statsmodels)	Statistical analysis of fixed and random effects in repeated measures designs	Include age, BMI, sex, and day of week as fixed effects; participant as random effect [2]
Intraclass Correlation Coefficient (ICC) Algorithms	Quantifies reliability across different day combinations	Implement two-way mixed effects models for absolute agreement; analyze all possible day combinations [2]
Coefficient of Variation (CV) Calculator	Partitions within- and between-subject variability	Calculate for each nutrient/food group; use in formula to determine days needed for target reliability [2]
Standardized Portion Size References (e.g., WHO MONICA)	Provides consistent reference amounts for portion estimation	Utilize established standard portion sizes from validated sources; adapt for local food customs as needed [2]

Application to Portion Size Estimation Research

The determination of minimum days required for reliable intake estimation has particular significance for portion size estimation method validation. When comparing the accuracy of different portion size estimation aids (PSEAs), researchers must account for the inherent day-to-day variability in food consumption, which may interact with the estimation method itself [12].

Protocol Integration:

When validating new portion size estimation methods, implement the minimum days protocol established for the target nutrients
For text-based PSEA validation, a 3-4 day protocol is generally sufficient for most nutrients [12]
For image-based PSEA validation, consider extending the protocol by 1-2 days to account for higher estimation error [12]
Include training sessions for participants on portion size estimation, as even brief training significantly improves estimation accuracy for some food items [34]

Validation Study Design:

Employ a crossover design where participants use different PSEAs in randomized order [12]
Compare reported portion sizes against true intake (weighed values) using Wilcoxon's tests
Calculate proportions of reported portion sizes within 10% and 25% of true intake
Utilize adapted Bland-Altman approaches to assess agreement between true and reported portion sizes [12]
Analyze results by food type (amorphous foods, liquids, single-unit foods, spreads) as accuracy varies significantly across categories [12]

The relationship between portion size estimation methods and study duration requirements is illustrated below:

Figure 2: Integration of portion size estimation method selection with study duration determination.

Based on current evidence from large digital cohort studies, the following recommendations are provided for determining minimum days required for reliable usual intake estimation:

General Protocol: For comprehensive nutritional assessment, a 3-4 day non-consecutive recording period that includes at least one weekend day is sufficient for reliable estimation of most nutrients [2] [33].
Nutrient-Specific Adaptations:
- For studies focusing exclusively on macronutrients or highly consistent dietary components, 2-3 days may be sufficient
- For micronutrients or highly variable food groups, maintain 4-day protocols
- For water, coffee, or total food quantity assessment, 1-2 days provides adequate reliability [2]
Portion Size Method Considerations:
- When using text-based PSEAs, standard minimum day protocols apply
- When using image-based PSEAs, consider increasing recording days by 10-20% to compensate for higher estimation error [12]
- Implement standardized training on portion size estimation regardless of method, as this improves accuracy across all food types [34]
Demographic Considerations:
- For populations with high dietary variability (e.g., younger adults, those with higher BMI), consider adding 1-2 extra recording days
- Ensure proportional representation of demographic subgroups in validation studies to account for different reporting patterns [2]

These evidence-based recommendations enable researchers to optimize dietary assessment protocols for efficiency and participant burden while maintaining scientific rigor in nutritional epidemiology and portion size estimation research.

Strategies for Different Population Subgroups and Clinical Conditions

Accurate portion size estimation is a fundamental yet challenging component of dietary assessment, directly impacting the validity of nutrition research, clinical practice, and public health monitoring [1] [12]. Measurement error from misestimation can obscure diet-disease relationships, compromise clinical dietary guidance, and skew population-level consumption data [35]. This article outlines evidence-based strategies and practical protocols for optimizing portion size estimation across diverse population subgroups and clinical conditions, framed within the context of dietary recall validation research.

Portion Size Estimation Aids (PSEAs): A Comparative Quantitative Analysis

The accuracy of portion size reporting varies significantly depending on the PSEA employed, the food type being assessed, and the characteristics of the study population. The table below synthesizes quantitative findings on the performance of different PSEAs from recent validation studies.

Table 1: Performance of Portion Size Estimation Aids (PSEAs) Across Validation Studies

PSEA Category	Specific Method	Study Population	Key Quantitative Findings
3D Objects	3D Printed Cubes (GDQS App)	170 adults (18+)	GDQS scores were equivalent to Weighed Food Records (WFR) within a pre-specified 2.5-point margin (p=0.006). Moderate agreement (κ=0.57) for classifying poor diet quality risk. [3]
3D Objects	Playdough (GDQS App)	170 adults (18+)	GDQS scores were equivalent to WFR (p<0.001). Moderate agreement (κ=0.58) for classifying poor diet quality risk. Performance was comparable to 3D cubes. [3]
Textual & Household Measures	Text-Based (TB-PSE)	40 adults (20-70 years)	Overall median error rate of 0%. 50% of estimates were within 25% of true intake, and 31% were within 10% of true intake. [12]
Digital Images	Image-Based (IB-PSE)	40 adults (20-70 years)	Overall median error rate of 6%. 35% of estimates were within 25% of true intake, and 13% were within 10% of true intake. [12]
Digital Images	ASA24 with Digital Images	302 women with low incomes	Average overestimation of 7.4g (independent) and 6.4g (assisted) across all foods. Single-unit foods were underestimated; small pieces, shaped, and amorphous foods were overestimated. [35]
Digital Apps	Tablet-Based Dietary Record App	Adults ≥70 years living independently	The app was validated against 24-hour recalls, demonstrating equivalence for estimating energy and key nutrient intakes in an older demographic. [26]

Population-Specific Strategies and Protocols

Older Adults (Aged 70 and Above)

Evidence Base: A 2025 validation study successfully demonstrated the equivalence of a tablet-based dietary record app ("NuMob-e-App") compared to 24-hour recalls for estimating energy and nutrient intakes in adults aged 70 and above living independently [26]. This highlights the feasibility of digital tools for this demographic when appropriately designed.

Recommended PSEAs: Tablet or smartphone applications with simplified interfaces and large, clear portion size images. Traditional methods like playdough or textured models remain viable, especially for individuals with limited digital proficiency [3] [26].

Application Protocol:

Participant Training: Conduct individual, in-person training sessions. Pre-configure the app with the participant's demographic data and guide them through entering at least one practice meal to ensure familiarity [26].
Tool Design: Ensure the app interface has high color contrast, large buttons, minimal on-screen clutter, and intuitive navigation to accommodate visual and motor skill changes [26].
Data Collection: Instruct participants to record intake on multiple non-consecutive days, including a weekend day. Allow documentation during or shortly after meals to minimize memory load. Follow up with 24-hour recall interviews by telephone to collect comparative data [26].
Support: Provide clear written instructions and a contact for technical support to enhance adherence and data quality.

Populations with Low Incomes

Evidence Base: Research on women with low incomes using the Automated Self-Administered 24-hour Dietary Assessment Tool (ASA24) revealed a general trend of overestimating portion sizes across most food categories, with single-unit foods being underestimated [35]. Assistance provided during recall completion did not markedly improve accuracy, indicating a need for better foundational PSEAs.

Recommended PSEAs: Food models (e.g., 3D cubes, playdough) and common household measures (cups, spoons) are highly recommended, as they provide tangible reference objects [3] [36]. Text-based descriptions of portion sizes can also be effective [12].

Application Protocol:

Tool Selection: Utilize validated, tangible aids like playdough or calibrated household utensils, which are low-cost and do not require digital literacy [3] [36].
Interviewer-Administered Recalls: Employ the Automated Multiple-Pass Method (AMPM) in interviewer-administered 24-hour recalls, either in person or by phone. This method uses structured probes to enhance memory and standardize portion size estimation using aids [37] [36].
Food-Specific Guidance: Train researchers to provide specific guidance for commonly misestimated foods. For example, emphasize careful estimation for amorphous foods (e.g., scrambled eggs, pasta) and single-unit foods [35].
Contextual Questioning: Include questions about food sources (e.g., store-bought, restaurant, home-cooked) and, if possible, use of branding or food packaging information to improve identification and portion estimation [37].

General Adult Population

Evidence Base: For the general adult population, a variety of PSEAs have been validated. A 2025 study found that using the GDQS app with either 3D cubes or playdough yielded diet quality scores equivalent to those from weighed food records [3]. Another study found text-based descriptions (TB-PSE) outperformed image-based aids (IB-PSE) in accuracy [12].

Recommended PSEAs: The choice can be tailored to the study's specific resources and goals. Validated options include text-based descriptions with household measures, 3D objects (cubes, playdough), and digital image-based apps [3] [12].

Application Protocol:

Weighed Food Record Validation: For high-accuracy validation studies, use a multi-day protocol. Train participants on using digital dietary scales to weigh all foods, beverages, and ingredients in mixed dishes consumed over a 24-hour period. This serves as the reference method [3].
Experimental Comparison: The following day, administer the dietary recall tool being validated (e.g., GDQS app, ASA24) using the PSEA of interest (cubes, playdough, images). The order of PSEAs should be randomized to avoid bias [3].
Data Analysis: Assess equivalence using statistical tests like the paired two one-sided t-test (TOST) with a pre-specified equivalence margin (e.g., 2.5 points for GDQS). Agreement for food group consumption and risk classification should be evaluated using Kappa coefficients [3].

The Scientist's Toolkit: Essential Research Reagents and Materials

Table 2: Key Materials for Portion Size Estimation Validation Research

Item	Specification/Example	Primary Function in Research
3D Portion Models	Pre-defined 3D printed cubes; Non-toxic playdough	Provides tangible, volumetric reference for estimating food group-level intake; validated for use with metrics like the GDQS. [3]
Digital Dietary Scales	Calibrated digital scale (e.g., KD-7000, capacity 7kg, accuracy 1g)	Serves as the reference method for obtaining true food intake weights in validation studies (Weighed Food Records). [3]
Standardized Food Image Libraries	ASA24 Picture Book; INTAKE24 image sets	Provides visual aids for digital portion size estimation in web-based and app-based dietary recalls. [12] [35]
Validated Dietary Assessment Software	ASA24; GDQS App; Oxford WebQ; INTAKE24	Automates the 24-hour recall process, standardizes data collection, and incorporates portion size estimation aids. [38] [37] [35]
Food Composition Database	Food and Nutrient Database for Dietary Studies (FNDDS); Food Patterns Equivalents Database (FPED)	Converts reported foods and portion sizes into estimated nutrient intakes and food group equivalents. [38] [39]
Multimodal AI Framework	DietAI24 (MLLM + RAG + FNDDS)	A advanced framework for automated, comprehensive nutrient estimation from food images, overcoming limitations of traditional computer vision. [38]

Methodological Workflow for PSEA Validation

The following diagram illustrates a generalized experimental workflow for validating a portion size estimation aid against a reference method, synthesizing protocols from the cited research.

Selecting optimal portion size estimation strategies is contingent on a clear understanding of the target population's characteristics, the specific food types of interest, and the research context. While traditional aids like playdough and text-based measures show robust accuracy, digital tools and advanced AI frameworks like DietAI24 offer promising, scalable solutions for the future of dietary assessment [3] [38] [12]. Integrating these evidence-based protocols into validation research will enhance the accuracy of dietary data, thereby strengthening the foundation of nutritional science, clinical practice, and public health policy.

Training Protocols to Minimize Interviewer and Participant Error

In dietary recall validation research, the accuracy of portion size estimation is paramount. Errors introduced during the interview process or by participants can significantly bias nutrient intake estimates and compromise study validity. This document outlines standardized training protocols and methodological approaches to minimize these errors, ensuring the collection of high-quality, reliable data for research and drug development applications. The guidance is framed within the context of dietary recall validation, where precise portion size estimation is a critical component.

Core Principles for Error Reduction

Implementing a rigorous dietary assessment protocol requires an understanding of potential error sources and a commitment to standardization. The core principles are designed to address both random errors, which reduce precision, and systematic errors (bias), which reduce accuracy [40].

Standardization: Utilize a uniform protocol across all interviewers and study sites to minimize variability in data collection procedures [40].
Participant-Centered Interviewing: Adapt interviewing techniques to be culturally sensitive and cognitively appropriate, accounting for factors like literacy and numeracy [40].
Blinding: Where feasible, blind interviewers to specific study hypotheses or participant groupings to prevent confirmation bias [41].
Quality Control & Feedback: Implement continuous monitoring mechanisms, such as Computer Audio Recorded Interviewing (CARI), to provide interviewers with rapid feedback on their adherence to protocol [42].
Validation: Incorporate objective reference measures, such as doubly labeled water for energy expenditure, to quantify and correct for systematic errors like under-reporting [40].

Quantitative Comparison of Portion Estimation Methods

The choice of portion size estimation method can influence the accuracy of dietary intake data. The following table summarizes key findings from studies comparing different estimation tools.

Table 1: Comparison of Portion Size Estimation Methods in Validation Studies

Estimation Method	Study Population	Reference Method	Key Findings (vs. Reference)	Limits of Agreement (Bland-Altman)
3D Food Models [15]	11-12 year old children (n=70)	Weighed Food Records (implied by design)	Little difference in mean energy intake.	Food weight: -35% to +53%
Intake24 (Online Tool with Photos) [15]	11-12 year old children (n=70)	3D Food Models	Energy intake 1% lower on average.	Food weight: -35% to +53%
24-Hour Dietary Recall [43]	Hemodialysis patients (n=7)	Weighed Food Record	Consistent underestimate of intake.	Difference within 10% of weighed record

Detailed Experimental Protocols

Protocol for a Structured 24-Hour Dietary Recall Interview

This protocol is based on the multiple-pass method, which is designed to minimize memory lapse and improve portion size estimation accuracy [40].

Title: 24-Hour Recall Workflow Diagram Specification:

Materials:

Standardized multiple-pass 24-hour recall script.
Portion size estimation aids (see Section 6: Research Reagent Solutions).
A quiet, private setting for the interview.
Audio recording equipment (with consent) for quality control [42].

Procedure:

Preparation: Train interviewers extensively on the protocol and the use of portion aids. Schedule the interview to cover the previous 24-hour period.
Pass 1 - Quick List: Ask the participant to recall all foods and beverages consumed from waking to sleeping, without interruption.
Pass 2 - Detail Probe: Review each item from the quick list. Ask probing questions about meal timing, food preparation methods, brand names, and additions (e.g., sauces, fats). Use this pass to identify and add commonly forgotten items (e.g., candies, sugary drinks, water) [40] [15].
Pass 3 - Portion Estimation: For each food and beverage item, use standardized aids to help the participant estimate the amount consumed. Ensure the participant, not the interviewer, makes the final estimation.
Pass 4 - Final Review: Summarize the entire day's intake back to the participant for final confirmation, correction, or addition.

Protocol for Validating a New Portion Estimation Tool

This protocol describes a methodology for comparing a new portion estimation tool (e.g., a digital application) against an established reference method.

Title: Tool Validation Protocol Diagram Specification:

Materials:

The new portion estimation tool (test method).
The reference method (e.g., 3D food models, weighed food records).
Food diaries for participant self-recording.
A standardized interview setting.

Procedure:

Participant Recruitment: Recruit a sample that is representative of the target population for the tool.
Food Recording: Provide participants with a food diary to record all foods and beverages consumed over one or more days. Encourage detailed descriptions.
Portion Estimation Interview: Conduct an interview where participants estimate portion sizes for their recorded intake using both the new tool and the reference method. The order in which the two methods are administered should be randomized to prevent order bias [15].
Data Processing: Convert all portion estimates to gram weights and calculate nutrient intakes using a standardized food composition database.
Statistical Analysis: Use Bland-Altman analysis to assess the agreement between the two methods. Calculate the mean difference (bias) and the 95% limits of agreement. A ratio of geometric means close to 1.00 indicates little systematic bias [15].

Structured Interviewer Training to Minimize Bias

Unstructured interviews are a significant source of measurement error. Implementing a structured interview process is critical for reducing bias.

Table 2: Structured Interview Techniques for Dietary Recalls

Technique	Description	Application in Dietary Recalls	Bias Mitigated
Behavioral Questions [41]	Ask about past behavior in a specific situation.	"Tell me about a time you had to describe a complex meal you ate. How did you remember all the details?"	Assesses participant's recall strategy and engagement.
Situational Questions [41]	Present a hypothetical scenario.	"If you were eating a stew with many ingredients, how would you go about describing your portion to me?"	Evaluates participant's understanding and approach to the task.
Standardized Questions & Rubric [41]	Use identical, pre-written questions for all participants and a scoring guide for answers.	Use the exact same probing questions for every participant (e.g., "What type of bread was that?").	Reduces interviewer effects and confirmation bias; improves inter-rater reliability.
Blinded Interviewers [41]	Interviewers are unaware of participant group assignment or study hypotheses.	The interviewer does not know if the participant is in a control or intervention group.	Prevents halo/horn effects and confirmation bias.

Training Protocol for Structured Interviews:

Define Key Traits: Determine the non-cognitive traits essential for a successful interview (e.g., participant conscientiousness, communication clarity) [41].
Develop Questions and Rubric: Create a bank of standardized, behaviorally- or situationally-anchored questions. Develop a clear scoring rubric that defines high-quality and low-quality responses [41].
Interviewer Training: Train interviewers on the questions, the scoring rubric, and the importance of not deviating from the script. Training should include recognizing and mitigating rating errors like central tendency error [41].
Practice and Calibration: Conduct mock interviews with colleagues or volunteers to calibrate scoring among different interviewers [41].

The Scientist's Toolkit: Research Reagent Solutions

Table 3: Essential Materials for Dietary Recall Validation Research

Item / Solution	Function	Example / Specification
Standardized 24-h Recall Software	Provides a structured, multiple-pass interface to guide the interview and data entry, reducing interviewer deviation.	GloboDiet, USDA Automated Multiple-Pass Method (AMPM) [40].
Portion Size Estimation Aids	Visual tools to help participants conceptualize and report the volume or weight of consumed food.	3D food models, life-sized portion photographs, online tools with interactive photos (e.g., Intake24) [15].
Objective Validation Biomarkers	Provides an error-free (or low-error) reference measure to quantify systematic error in self-reported intake.	Doubly Labeled Water (for energy), Urinary Nitrogen (for protein) [40].
Computer Audio Recorded Interviewing (CARI)	Allows supervisors to monitor a subset of interviews for quality control and provide rapid feedback to interviewers [42].	Secure audio recording system integrated with the interview software.
Structured Interview Score Rubrics	Standardized scoring guides to assess participant engagement and understanding during the recall process, improving objectivity [41].	Pre-defined criteria for scoring responses to behavioral and situational questions.

Validation Frameworks and Comparative Analysis of Portion Estimation Methods

In dietary recall validation research, establishing the reliability and accuracy of portion size estimation methods is paramount. The choice of statistical methods directly influences the credibility of validation findings, guiding researchers in determining whether a new assessment tool can be considered a valid substitute for established methods. This article details the application of three fundamental statistical approaches—the Two One-Sided Tests (TOST) for equivalence, Kappa statistics for categorical agreement, and Bland-Altman analysis for measurement concordance—within the context of dietary assessment validation. These methodologies provide complementary perspectives on method performance, together offering a comprehensive validation framework essential for advancing nutritional epidemiology and clinical nutrition practice.

Key Statistical Methods and Applications

Table 1: Core Statistical Methods in Dietary Assessment Validation

Statistical Method	Primary Application	Interpretation Guidelines	Example from Dietary Research
TOST (Two One-Sided Tests)	Demonstrating practical equivalence between two methods [3].	A significant p-value (typically < 0.05) indicates the mean difference between methods falls within a pre-specified equivalence margin (e.g., ±2.5 points on the GDQS) [3].	Validating that portion size estimation using 3D cubes or playdough provides diet quality scores equivalent to weighed food records [3].
Kappa Coefficient (κ)	Assessing agreement for categorical or ordinal data beyond chance [44] [3] [45].	≤ 0.20: Poor; 0.21-0.40: Fair; 0.41-0.60: Moderate; 0.61-0.80: Good; 0.81-1.00: Very Good [44] [46].	Measuring agreement in tertile classification of nutrient intake between a Food Frequency Questionnaire (FFQ) and 24-hour recalls [45].
Bland-Altman Analysis	Visualizing and quantifying agreement between two continuous measurements [44] [46].	Plots the differences between two methods against their mean. The Limits of Agreement (LoA = mean difference ± 1.96 SD) show where 95% of differences lie [46].	Identifying that the difference in protein intake measurement between a diet history and serum biomarkers decreases as actual intake increases [46].

Detailed Experimental Protocols

Protocol 1: Establishing Equivalence with TOST

The TOST procedure is used to validate that a novel dietary assessment method is practically equivalent to a reference method, a common requirement for portion size estimation tools.

Workflow Overview: TOST Equivalence Testing

Step-by-Step Procedure:

Define the Equivalence Margin (Δ):
- Prior to analysis, establish a clinically or practically meaningful equivalence margin (Δ). This represents the maximum acceptable difference between the two methods that is considered negligible.
- Example: In a study validating the Global Diet Quality Score (GDQS) app, an equivalence margin of 2.5 points on the GDQS scale was pre-specified [3].
Calculate the Mean Difference:
- Compute the paired differences between the new method (e.g., portion size estimation with cubes) and the reference method (e.g., weighed food records) for all participants.
- Calculate the mean of these differences.
Construct a Confidence Interval:
- Calculate a 90% two-sided confidence interval (CI) for the mean difference. A 90% CI is used when testing two one-sided hypotheses at the 5% significance level.
Perform Two One-Sided Tests:
- Test 1: Check if the mean difference is significantly greater than -Δ (i.e., the lower equivalence bound).
  - Null Hypothesis (H₀₁): Mean difference ≤ -Δ
  - Alternative Hypothesis (H₁₁): Mean difference > -Δ
- Test 2: Check if the mean difference is significantly less than +Δ (i.e., the upper equivalence bound).
  - Null Hypothesis (H₀₂): Mean difference ≥ Δ
  - Alternative Hypothesis (H₁₂): Mean difference < Δ
Decision Rule:
- If both one-sided tests are statistically significant (p < 0.05 for both), equivalence is concluded. This is equivalent to the entire 90% confidence interval falling within the range -Δ to +Δ [3].

Protocol 2: Assessing Agreement with Kappa Statistics

Kappa statistics are vital for evaluating the agreement between two methods when classifying dietary intake into categories (e.g., tertiles of consumption, risk categories).

Workflow Overview: Kappa Agreement Analysis

Step-by-Step Procedure:

Categorize the Data:
- Classify the dietary intake data obtained from both the new and reference methods into ordinal categories (e.g., low, medium, high; or tertiles/quartiles). For example, a validation study for an FFQ classified participants into tertiles of nutrient intake based on both the FFQ and the average of three 24-hour dietary recalls [45].
Construct a Contingency Table:
- Create a cross-tabulation (contingency table) showing the classification by the new method against the classification by the reference method.
Calculate Kappa:
- Observed Agreement (Pₒ): The proportion of units where the two methods agree, calculated from the contingency table.
- Chance Agreement (Pₑ): The expected proportion of agreements due to random chance, calculated based on the marginal totals of the table.
- Kappa (κ) Formula: κ = (Pₒ - Pₑ) / (1 - Pₑ). This quantifies the level of agreement beyond what is expected by chance alone.
Interpret the Kappa Value:
- Use established benchmarks. For instance, a study validating a diet history against biomarkers interpreted a Kappa of 0.48 as "moderate" and 0.68 as "moderate-good" [44] [46]. The analysis should also report the proportion of participants correctly classified into the same or adjacent category, which was over 78% in a valid FFQ study [45].

Protocol 3: Quantifying Concordance with Bland-Altman Analysis

Bland-Altman analysis provides a comprehensive assessment of the agreement between two methods designed to measure the same continuous variable (e.g., nutrient intake in grams).

Workflow Overview: Bland-Altman Method

Step-by-Step Procedure:

Calculate Key Variables:
- For each participant or measurement pair, calculate:
  - Mean: (Measurement_{Method A} + Measurement_{Method B}) / 2
  - Difference: Measurement_{Method A} - Measurement_{Method B}
Create the Bland-Altman Plot:
- The X-axis represents the mean of the two measurements.
- The Y-axis represents the difference between the two measurements.
Plot and Calculate Key Statistics:
- Mean Difference (Bias): Plot a solid horizontal line representing the average of all differences. This indicates the systematic bias between the two methods.
- Limits of Agreement (LoA): Calculate and plot two dashed horizontal lines at: Mean Difference ± 1.96 * Standard Deviation of the differences. These lines represent the range within which 95% of the differences between the two methods are expected to lie.
Interpret the Plot:
- Assess whether the scatter of differences is random and evenly distributed around zero.
- Check for proportional bias, where the differences increase or decrease as the magnitude of the measurement increases. For example, a pilot study on eating disorders found that the accuracy of dietary protein and iron estimates improved (the difference decreased) as the actual intake increased [46].
- Determine if the width of the LoA is clinically acceptable for the intended use of the dietary assessment method.

The Scientist's Toolkit: Research Reagent Solutions

Table 2: Essential Reagents and Materials for Dietary Validation Studies

Item	Function/Application in Validation
3D Portion Size Models (Cubes)	Standardized, pre-defined volumes to help participants estimate food group-level portion sizes consistently during an interview [3].
Playdough	A flexible, low-cost alternative to 3D cubes for estimating portions of amorphous or oddly shaped foods in dietary recall validation [3].
Calibrated Digital Dietary Scales	Gold-standard tool for conducting Weighed Food Records (WFR), providing the reference data for validating other portion size estimation methods [3].
Digital Dietary Assessment Apps (e.g., GDQS App, NuMob-e-App)	Technology-based tools that standardize data collection, reduce interviewer bias, and can incorporate portion size estimation aids for validation studies [3] [26].
Standardized Food Composition Database (e.g., FNDDS)	Converts reported foods and portion sizes into energy and nutrient intakes, ensuring consistent analysis across compared methods [47].
Biomarker Assay Kits	Provide objective, biochemical validation for specific nutrient intakes (e.g., serum carotenoids for fruit/vegetable intake, urinary nitrogen for protein) to complement self-reported dietary data [48] [49].

The combined application of TOST, Kappa, and Bland-Altman analysis forms a robust statistical foundation for validating portion size estimation methods in dietary research. TOST rigorously establishes practical equivalence, Kappa reliably assesses categorical agreement, and Bland-Altman thoroughly investigates the limits of measurement concordance. By implementing the detailed protocols and utilizing the essential research tools outlined in this article, scientists can generate high-quality, defensible evidence regarding the validity of new dietary assessment methodologies, thereby strengthening nutritional epidemiology and evidence-based dietary guidance.

Within dietary recall validation research, establishing accurate and reliable criterion measures for portion size estimation is a foundational challenge. The weighed food record (WFR) has traditionally been considered the gold standard for quantifying dietary intake in controlled settings due to its precision [50] [51]. However, the method is resource-intensive and can disrupt normal eating patterns. The digital photography (DP) method has emerged as a powerful alternative, offering a less intrusive and potentially more scalable approach while maintaining high accuracy [52] [50]. This Application Note compares these two criterion measures, providing a structured analysis of their performance, detailed experimental protocols, and essential methodological considerations for researchers designing validation studies.

Comparative Analysis of Method Performance

The validity of digital photography against weighed food records has been assessed across diverse populations and settings. Key quantitative findings from recent validation studies are summarized in the table below.

Table 1: Performance Comparison of Weighed Food Records and Digital Photography as Criterion Measures

Study Setting & Population	Key Quantitative Findings (vs. WFR)	Agreement & Statistical Analysis	Reference
Hospital Inpatients (Nurses/HCAs estimating 27 meals)	DP overestimation: 4.7 ± 15.8%Food Record Chart overestimation: 3.2 ± 14.7%	Bland-Altman plots showed limited variation; Inter-rater agreement (Kendall's W): 0.682	[50] [53]
Preschool Children in Head Start Centers (N=41 children)	No significant differences for macro/micronutrients or food groups	High agreement in Bland-Altman regression for nutrients/food groups; Linear mixed models showed no significant difference	[52]
Rural Bolivian Women (N=45, FP 24-hR method)	Food intake differences: -2.3% (cassava) to +8.7% (leafy vegetables)Nutrient intake differences: -0.90% (Vit C) to -5.98% (fat)	Spearman's correlations: 0.75 (eggs) to 0.98 (potato/cassava)Nutrient Pearson's coefficients: >0.93	[51]
Global Diet Quality Score (GDQS) App (N=170 adults, cubes/playdough)	GDQS equivalents within pre-specified 2.5-point margin (p<0.006)	Moderate agreement for poor diet quality risk (κ ≈ 0.57)Substantial agreement for 22/25 food groups	[4] [54]

Detailed Experimental Protocols

Weighed Food Record (WFR) Protocol

The WFR protocol involves direct measurement of food weight before and after consumption to calculate exact intake.

Table 2: Key Research Reagents and Materials for Weighed Food Records

Item	Specification/Function
Digital Food Scale	High-precision (e.g., 0.1-g resolution), calibrated regularly for accurate pre- and post-consumption weights.
Standardized Containers	Use of uniform plates, bowls, and cups to minimize variance in food presentation and weighing.
Data Collection Sheets	Structured forms for recording participant IDs, food items, pre-/post-weights, and notes on spills or extra portions.

Procedural Workflow:

Pre-Consumption Weighing: Weigh and record each food item in grams served to the participant in its standardized container [52].
Meal Service: Serve the pre-weighed meal. Document any additional portions requested by weighing them separately [52].
Post-Consumption Weighing: After the meal, collect all leftovers in their original containers and weigh them [50].
Data Calculation: Calculate food consumed (g) as: Pre-weight - Post-weight. Convert weights to energy and nutrient intakes using appropriate food composition databases [52].

Digital Photography (DP) Protocol

The DP method uses standardized photographs of meals taken before and after consumption, with intake estimated by comparing these images to reference guides.

Table 3: Key Research Reagents and Materials for Digital Photography

Item	Specification/Function
Digital Camera/Smartphone	Fixed on a tripod with consistent resolution and lighting (e.g., using a photo cube) to standardize images.
Standardized Placemat & Tableware	Placemat with grid and ID information; uniform plates/bowls to ensure consistent food presentation and scale.
Food Portion Guide/Photo Atlas	Reference images of study foods at varying consumption levels (e.g., 0-100% in 10% intervals) for estimation.

Procedural Workflow:

Setup and Calibration: Set up a camera on a tripod at a fixed angle (e.g., 45°) and distance (e.g., 1 foot). Use a standardized placemat with a grid and participant ID [52] [51].
Pre-Consumption Photo: Take a photograph of the fully served meal before eating begins [52].
Meal Service: Participant consumes the meal. Note any extra portions on the placemat.
Post-Consumption Photo: Reposition leftovers on the placemat identically to the pre-meal setup and take a photo [52].
Image Analysis: Trained analysts estimate the percentage of each food consumed by comparing pre- and post-photos side-by-side using the food portion guide [52]. For advanced frameworks like DietAI24, multimodal large language models (MLLMs) with Retrieval-Augmented Generation (RAG) can automate food recognition, portion size estimation, and comprehensive nutrient analysis by querying authoritative databases like FNDDS [38].

Method Selection Framework

The choice between WFR and DP as a criterion measure depends on the specific research context, constraints, and objectives. The following diagram outlines the key decision factors.

Both WFR and DP are valid criterion measures for portion size estimation. The decision hinges on a trade-off between the utmost precision offered by WFR and the practical advantages, scalability, and still-high accuracy of DP. Researchers should select the method that best aligns with their specific validation goals, operational constraints, and the ecological context of their study. Emerging technologies, particularly AI-powered image analysis frameworks like DietAI24, are poised to enhance the accuracy and scope of photographic methods further [38].

Accurate portion size estimation is a cornerstone of valid dietary intake assessment, directly impacting the quality of data in nutritional epidemiology, clinical trials, and public health monitoring [35] [55]. Traditional methods have relied on physical aids, such as household utensils, 3D models, and playdough, to help respondents quantify consumption. The proliferation of digital technologies has introduced new tools, including smartphone applications with photo-based methods and Artificial Intelligence (AI)-powered image recognition [56]. This application note synthesizes current evidence to provide a comparative analysis of the performance of physical aids versus digital tools across various food groups. It further offers detailed experimental protocols for validating these methods within dietary recall research.

Quantitative Performance Comparison

The table below summarizes the reported accuracy of various portion size estimation methods as identified in recent validation studies.

Table 1: Comparative Accuracy of Portion Size Estimation Methods Across Food Types

Food Group / Type	Physical Aids & Methods	Reported Performance	Digital Tools & Methods	Reported Performance
Cooked Rice	3D Cubes (45° angle) [3] [55]	74.4% accuracy [55]	Multi-angle photos (45° & 70°) [55]	85.4% accuracy (combined angles) [55]
Amorphous/Soft Foods	Playdough [3]	Equivalent to WFR* (GDQS metric) [3]	Online 24-hr recall (ASA24) [35]	Significant overestimation [35]
Liquid Soups	3D Cubes [3]	Equivalent to WFR* (GDQS metric) [3]	Single-angle photos [55]	Low accuracy, high overestimation [55]
Single-Unit Foods	Playdough [3]	Equivalent to WFR* (GDQS metric) [3]	Online 24-hr recall (ASA24) [35]	Significant underestimation [35]
Beverages	3D Cubes (70° angle) [3] [55]	Equivalent to WFR* (GDQS metric) [3]	Single photo (70° angle) [55]	73.2% accuracy [55]
Grilled Fish	3D Cubes [3]	Equivalent to WFR* (GDQS metric) [3]	Multi-angle photos [55]	Moderate accuracy, improves with combined angles [55]
Mixed Dishes (General)	Playdough [3]	Equivalent to WFR* (GDQS metric) [3]	Automatic Image Recognition (AIR) [56]	86% dish identification accuracy [56]
Vegetables (e.g., Kimchi)	3D Cubes (45° angle) [3] [55]	52.4% accuracy (Kimchi) [55]	Multi-angle photos [55]	53.7% accuracy (combined angles for vegetables) [55]

*WFR: Weighed Food Records, used as the validation benchmark.

Detailed Experimental Protocols

To ensure reliable and reproducible results in dietary recall validation research, standardized protocols are essential. The following sections detail methodologies for key approaches cited in the comparative analysis.

Protocol 1: Validation of Physical Aid-Based Methods Using the GDQS App

This protocol is adapted from a validation study assessing the equivalence of portion size estimation using cubes and playdough with the Global Diet Quality Score (GDQS) application against weighed food records (WFR) [3].

Objective: To assess whether the GDQS metric obtained using cubes or playdough as portion size estimation methods provides data equivalent to the GDQS metric estimated by WFR for the same 24-hour reference period.
Study Design: A repeated measures design is recommended, where each participant completes all assessment methods for the same intake period.
Participants: Recruit a convenience sample of adults (e.g., n=170). Participants should meet eligibility criteria such as being 18 years or older, fluent in the local language, and able to comply with the study procedures, including not consuming mixed dishes prepared outside the home during the recording period [3].
Materials:
- GDQS Application: Installed on a tablet or smartphone.
- Physical Aids: A set of ten 3D-printed cubes of pre-defined volumes and non-toxic, moldable playdough.
- Calibrated Digital Dietary Scales (accurate to 1g) and paper data collection forms for WFR.
Procedure:
- Day 1 - Training: Conduct an in-person training session (40-60 minutes) for participants on how to weigh and record all foods, beverages, and ingredients using the provided digital scale and WFR forms.
- Day 2 - Weighed Food Record: Participants weigh and record all consumed items over a 24-hour period.
- Day 3 - GDQS App Interview: Participants return for a face-to-face interview. Using the GDQS app, a trained interviewer guides the participant through a 24-hour recall. The app randomizes the order in which cubes or playdough are used. For each food group consumed, the participant uses the assigned aid to represent the total volume consumed.
Data Analysis:
- Use the paired two one-sided t-test (TOST) to statistically test for equivalence between the GDQS scores from the WFR and each aid, specifying a pre-determined equivalence margin (e.g., 2.5 points for the GDQS) [3].
- Calculate agreement statistics (e.g., Kappa coefficient) for classifying individuals into risk categories for poor diet quality based on the different methods.

Protocol 2: Validation of Digital Photo-Based Methods with Angle Optimization

This protocol is based on research investigating the optimal camera angles for accurately estimating portion sizes of different food types using photographs [55].

Objective: To evaluate the validity of estimating food quantities using photographs taken at different angles and to identify the optimal angle for specific food groups.
Study Design: A cross-sectional experimental design where participants estimate portion sizes from photographs after observing a controlled meal.
Participants: Recruit healthy adults (e.g., n=82) with no visual impairments that could affect portion estimation. Stratify recruitment by sex and age to ensure a diverse sample [55].
Materials:
- Standardized Meals: Prepare multiple meal compositions representing typical consumption. Meals should include challenging food types (e.g., amorphous, liquid, irregular).
- Photographic Database: High-quality images of each food type at 5-7 different portion sizes. Capture each portion from multiple angles (e.g., 0°, 45°, 70° for solids; 45°, 60°, 70° for liquids) under consistent lighting conditions [55].
- Computer-Based Survey System to present images and record participant responses.
Procedure:
- Meal Observation: Participants observe a pre-portioned meal for a fixed time (e.g., 3 minutes), simulating a real eating scenario.
- Distraction Task: Participants engage in a short, non-food-related task (e.g., watching a 2-minute video) to prevent short-term memory bias.
- Photo-Matching Task: Participants are shown a series of images on a computer. For each food item, they are presented with a sequence of images showing different portion sizes from a single angle and must select the image that matches the portion they observed.
- Confidence Rating: After each selection, participants rate their confidence on a Likert scale (e.g., 1-5).
- The task repeats for all food items and all pre-determined angles. The order of angles and food items should be randomized.
Data Analysis:
- Calculate the accuracy rate (percentage of correct matches) for each food type and angle.
- Compare accuracy rates across angles for each food type using statistical tests like Chi-square.
- Analyze underestimation and overestimation rates to identify systematic biases.
- Report results for combined angles where accuracy is significantly improved.

Protocol 3: Validation of AI-Based Automatic Image Recognition

This protocol outlines the procedure for evaluating the performance of an AI-powered application for automatic food identification and reporting [56].

Objective: To assess the accuracy and time efficiency of an automatic image recognition (AIR) dietary reporting app compared to a traditional voice input reporting (VIR) app in an authentic dining context.
Study Design: A two-group parallel randomized controlled trial.
Participants: Recruit a cohort of target users (e.g., n=42 young adults). Participants should be regular smartphone users.
Materials:
- Two Mobile Applications: The experimental AIR app, which allows users to take a single photo of a meal for automatic multi-dish recognition, and a control VIR app, which relies on voice inputs for food reporting [56].
- Standardized Test Meals: A series of typical meals (e.g., 17 different dishes) with known composition and portion sizes.
- Data Logging System: To automatically record task completion time and user interactions within the apps.
- Validated Usability Questionnaires: e.g., the System Usability Scale (SUS).
Procedure:
- Randomization: Randomly assign participants to either the AIR or VIR group.
- App Training: Provide standardized training on the assigned app to all participants.
- Testing: Participants use their assigned app to report a series of test meals in a controlled but realistic setting. All participants use the same smartphone model to control for device variability.
- Data Collection: The system logs the accuracy of food identification, the time taken to complete the reporting for each meal, and user errors.
- Usability Assessment: Participants complete the SUS and/or a semi-structured interview about their user experience.
Data Analysis:
- Compare the dish identification accuracy (%) between the AIR and VIR groups using appropriate statistical tests (e.g., t-test).
- Compare the average time efficiency (seconds per meal report) between groups.
- Analyze SUS scores to evaluate perceived usability.

Workflow Diagram for Method Selection and Validation

The following diagram illustrates a logical decision workflow for selecting and validating a portion size estimation method based on research objectives and food types, as derived from the reviewed literature [3] [35] [55].

Diagram 1: A decision workflow for selecting and validating portion size estimation methods in dietary research.

The Scientist's Toolkit: Essential Research Reagents & Materials

This table details key materials and tools required for implementing the experimental protocols described in this document.

Table 2: Essential Research Reagents and Materials for Portion Size Estimation Validation

Item Name	Specifications / Brand	Primary Function in Research
3D-Printed Cubes	Set of 10 cubes of predefined volumes, calibrated for GDQS food groups [3].	Standardized physical volume representation for portion size estimation during 24-hour recall interviews.
Non-Toxic Playdough	Moldable, reusable, color-contrasting with typical foods.	Flexible physical aid for estimating portions of irregularly shaped and amorphous foods.
Calibrated Digital Dietary Scale	Capacity ≥7kg, accuracy to 1g (e.g., KD-7000) [3].	Gold-standard measurement for validating all other methods via Weighed Food Records (WFR).
Global Diet Quality Score (GDQS) App	Mobile application for standardized dietary data collection and analysis [3].	Digital platform for conducting recalls with integrated portion size estimation using physical aids.
Automated Self-Administered 24-hr Recall (ASA24)	Web-based tool from NCI with portion size images [35].	A benchmark digital tool for self-administered dietary recalls, useful for comparison studies.
Standardized Food Image Database	Library of food photographs at multiple portion sizes and optimized angles (0°, 45°, 70°) [55].	Critical reference material for validating and training photo-based and AI estimation methods.
AI-Powered Dietary App (AIR)	Custom or commercial app with automatic multi-dish image recognition [56].	Tool for testing the efficacy of fully automated food identification and portion estimation.

Agreement Metrics for Food Groups, Nutrients, and Diet Quality Scores

Accurate dietary assessment is fundamental for public health monitoring, nutritional epidemiology, and understanding the relationship between diet and disease [1]. The validation of dietary recall methods hinges on robust agreement metrics that quantify how well new assessment tools perform against reference methods [3] [58]. Within the specific context of portion size estimation method validation—a critical source of measurement error in dietary assessment—selecting appropriate agreement metrics is paramount for evaluating the reliability of innovative data collection tools [3] [55]. This document outlines the core agreement metrics, experimental protocols, and analytical frameworks essential for rigorous validation research in dietary recall, with a particular focus on portion size estimation.

Core Agreement Metrics for Dietary Recall Validation

The table below summarizes the key agreement metrics used in dietary recall validation studies, their applications, and interpretations.

Table 1: Key Agreement Metrics in Dietary Recall Validation Research

Metric Category	Specific Metric	Application in Dietary Recall	Interpretation Guidelines
Equivalence Testing	Two One-Sided Tests (TOST) [3]	Determines if a new method (e.g., GDQS app) is equivalent to a reference method (e.g., Weighed Food Record) within a pre-specified margin.	A significant p-value (e.g., p < 0.05) indicates statistical equivalence. The equivalence margin (e.g., ±2.5 points for GDQS) must be defined a priori [3].
Categorical Agreement	Kappa Coefficient (κ) [3]	Measures agreement in classification, such as risk categories for poor diet quality (e.g., low, moderate, high risk).	κ ≤ 0.20: Slight; 0.21–0.40: Fair; 0.41–0.60: Moderate; 0.61–0.80: Substantial; 0.81–1.00: Almost Perfect [3].
Continuous Variable Agreement	Bland-Altman Analysis [59] [23]	Assesses agreement between two methods by plotting the difference between methods against their mean. Calculates Limits of Agreement (LOA).	Visually identifies systematic bias (mean difference ≠ 0) and whether the dispersion of differences is related to the magnitude of measurement.
Correlation	Spearman's Rank Correlation (ρ) [59]	Evaluates the strength and direction of the monotonic relationship between two methods, useful for ranked data.	ρ = -1 to +1. Values closer to ±1 indicate a stronger monotonic relationship.
Nutrient-Specific Accuracy	Mean Absolute Error (MAE) [38]	Measures the average magnitude of errors in nutrient estimation (e.g., energy, protein) against a reference.	Lower values indicate higher accuracy. Useful for comparing performance across different nutrients or methods.

Experimental Protocols for Validation Studies

Protocol for Validating Portion Size Estimation Aids

This protocol is adapted from a study validating cube and playdough aids for the Global Diet Quality Score (GDQS) app [3].

Objective: To assess whether the GDQS metric obtained using portion size estimation aids (cubes or playdough) with the GDQS app is equivalent to the GDQS derived from a Weighed Food Record (WFR).

Design: A repeated measures design is employed, where each participant uses both the reference and test methods for the same 24-hour reference period.

Participants: Approximately 170 adults aged 18 years or older. A convenience sample is often sufficient for method comparison studies, as national representativeness is not required for the primary objective [3].

Materials:

Reference Method: Calibrated digital dietary scales (accurate to 1 g), WFR data collection forms.
Test Methods: GDQS app installed on a smart device, a set of ten 3D-printed cubes of pre-defined sizes, or playdough.
Training Materials: Guides and videos for completing the WFR.

Procedure:

Day 1 (Training): Participants attend an in-person training session (40-60 minutes) in small groups. They are trained on how to use the dietary scale and record all foods, beverages, and ingredients in mixed dishes consumed over a 24-hour period.
Day 2 (WFR Data Collection): Participants weigh and record all consumed items during a 24-hour period using the provided scales and forms.
Day 3 (GDQS App Interview): Participants return to the research facility. They first submit their completed WFR forms. Then, a face-to-face interview is conducted using the GDQS app. The app randomizes the order in which the cubes or playdough are used. Participants estimate their previous day's consumption using one aid, and then repeat the process with the other aid.

Data Analysis:

Calculate the GDQS from the WFR and from the GDQS app for both cubes and playdough.
Perform a paired Two One-Sided T-test (TOST) to evaluate equivalence between the WFR-derived GDQS and the app-derived GDQS, using a pre-specified equivalence margin (e.g., 2.5 points) [3].
Use the Kappa coefficient to assess agreement in classifying participants into risk categories for poor diet quality (e.g., high, moderate, low risk) between methods.
Calculate food group-specific agreement rates or Kappa statistics to identify food groups with particularly high or low agreement (e.g., liquid oils often show low agreement) [3].

Protocol for Validating Image-Based Dietary Assessment Methods

This protocol synthesizes elements from controlled feeding studies and image analysis validation research [55] [58].

Objective: To evaluate the accuracy of energy and nutrient intake estimation from food images compared to true (observed) intake under controlled conditions.

Design: A randomized crossover controlled feeding study.

Participants: Approximately 150 participants. This design allows for within-participant comparisons across different methods.

Materials:

Controlled Meals: Standardized breakfast, lunch, and dinner, with all foods and beverages unobtrusively weighed before and after consumption to determine true intake.
Image Capture Tools: Smartphones with a dedicated dietary assessment app (e.g., mobile Food Record app).
Assessment Methods: Various technology-assisted 24-hour recall (24HR) platforms (e.g., ASA24, Intake24).

Procedure:

Randomization: Participants are randomized to a sequence of feeding days and subsequent 24HR methods.
Controlled Feeding: On a feeding day, participants consume all three meals in a controlled setting. True intake is determined by weighed data.
Image Capture: For meals where the method requires it, participants capture images of their meals before and after consumption using a smartphone app.
Dietary Recall: The following day, participants complete a 24HR using the assigned method (e.g., ASA24, Intake24, an image-assisted recall).
Crossover: After a washout period, participants repeat the process with a different feeding day and/or a different 24HR method.

Data Analysis:

For each method, calculate the mean difference (bias) between estimated and true intake for energy and key nutrients, often expressed as a percentage of true intake [58].
Use linear mixed models to assess differences in accuracy among the methods, accounting for the crossover design and participant effects.
Perform Bland-Altman analysis to visualize the agreement and identify any proportional bias.
Compare the variances of estimated and true intakes to see if the method accurately captures the population distribution of intake.

Visualization of Research Workflows

Dietary Recall Validation Workflow

The diagram below outlines the general workflow for validating a novel dietary assessment tool, such as a portion size aid or an image-based system.

Diet Quality Score Calculation Logic

The Global Diet Quality Score (GDQS) and similar metrics use a structured logic to convert food consumption data into a summary score. The following diagram illustrates this process.

The Scientist's Toolkit: Research Reagents & Materials

Table 2: Essential Materials for Dietary Recall Validation Studies

Item	Function/Description	Example Use Case
Calibrated Digital Scales	Precisely weighs foods and beverages to the gram, serving as the reference standard for true intake.	Weighed Food Records (WFR) in controlled feeding or free-living studies [3] [58].
3D-Printed Cubes	Standardized volumetric aids for estimating portion sizes at the food group level.	Used with the GDQS app to help participants estimate how much they consumed from each food group [3].
Playdough	A flexible, low-cost alternative to cubes for estimating portion sizes of amorphous or oddly shaped foods.	Validated for use with the GDQS app as an alternative to 3D cubes, showing equivalent performance [3].
Multimodal Large Language Model (MLLM) with RAG	An AI framework that recognizes foods in images and retrieves nutrient data from authoritative databases.	DietAI24 system uses this technology for automated, comprehensive nutrient estimation from food photos [38].
Food and Nutrient Database	A standardized repository of food composition data used to convert food intake into nutrient values.	FNDDS is used in the US. The GDQS metric is unique as it is food-based and does not require such a database for scoring [60] [38].
Standardized Photographic Aids	A set of images depicting various foods at different portion sizes and angles.	Used to improve accuracy in 24-hour recalls or as a reference for participants estimating their own intake [55].

Conclusion

The validation of portion size estimation methods is fundamental to advancing nutritional epidemiology and clinical research. Current evidence demonstrates that both physical aids like 3D cubes and playdough and emerging digital tools can provide valid alternatives to traditional weighed food records when properly implemented. Key takeaways include the need for method selection based on specific food types, the importance of standardized protocols and training, and the value of combining multiple assessment days including weekend days to capture usual intake. Future directions should focus on developing food-specific estimation strategies, particularly for challenging items like liquid oils and mixed dishes, and expanding validation research in diverse populations and clinical conditions. For biomedical and clinical research, these advancements enable more precise measurement of diet-disease relationships and more effective monitoring of nutritional interventions in drug development and clinical trials.