The European Journal of Orthodontics Advance Access originally published online on January 13, 2006
The European Journal of Orthodontics 2006 28(3):274-281; doi:10.1093/ejo/cji098
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
The use of anthropometric proportion indices in the measurement of facial attractiveness
* Department of Orthodontics, Kingston Hospital, Kingston University, Surrey, UK
** School of Computing and Information Systems, Kingston University, Surrey, UK
Address for correspondence Raymond Edler, Orthodontic Department, Kingston Hospital, Galsworthy Road, Kingston, Surrey KT2 7QB, UK. E-mail: edler{at}richmond163.freeserve.co.uk
| Summary |
|---|
|
|
|---|
This study used anthropometric data in the form of Farkas' proportion indices in order to quantify facial attractiveness, and to relate measured change through surgery, to clinical judgement. Standardized photographs of 15 orthognathic patients were used in album form and rated by 10 experienced clinicians: album 1 for facial attractiveness (before surgery) and album 2 for improvement in facial attractiveness (before and after surgery). Twenty-five proportion indices were selected and linear measurements recorded from the pre- and post-surgical photographs. The corresponding change in indices and in clinicians' scores were compared.
The clinicians' assessment of the degree of improvement in facial appearance achieved through surgery, related closely to the scores produced by the change in proportion indices (r = 0.698, P = 0.004). Clinical assessment demonstrated a clear inverse relationship between initial attractiveness rating and the degree of improvement achieved through orthognathic surgery (r = 0.781, P = 0.001). The results showed good repeatability in terms of clinical assessment, photography and digitization. The method would appear to have potential for further development, possibly into a facial attractiveness index for the objectives of quantification of improvement achieved through treatment.
| Introduction |
|---|
|
|
|---|
The ability to quantify facial appearance and any improvement achieved through treatment, would clearly help in the objective assessment of treatment quality. It would also be useful in monitoring growth changes. Currently, attempts at the objective assessment of facial appearance involve panels, in which rating or ranking by a group of professional or lay individuals is undertaken. The validity of this method depends on the notion that the panel will reach a reasonable level of agreement; fortunately, there is convincing evidence that high levels of agreement amongst panellists are usually obtained (Thakera and Iwawaki,1979
It has been shown that the common denominator shared by those individuals consistently judged to be the most attractive, is the fact that their facial proportions tend to be relatively near to the mean of the population, within their racial group (Symons, 1979
; Langlois and Roggman, 1990
; Strzalko and Kaszycka, 1991
; Grammer and Thornhill, 1994
). Whilst averageness is not the sole factor involved in attractiveness (Alley and Cunningham, 1991
; Cunningham et al., 1990
; Perrett et al., 1994
), it would appear to be one of the most important factors. Accordingly, if averageness is important in facial aesthetics, then average facial proportions could provide the basis for a quantitive assessment for aesthetics. A great deal of highly relevant data representing mean proportions is available through the extensive work of Farkas (1994)
and Farkas and Munro (1987)
. This evidence derives from the Toronto anthropometric growth study, involving manual measurements, carried out during the years 1967 to 1984 and involving over 2500 individuals (Farkas and Munro, 1987
). The resultant data has been used to provide a battery of individual facial ratios, presented as proportion indices (Farkas and Munro, 1987
), simply involving two linear measurements, the smaller expressed as a percentage of the larger. One hundred and sixty seven such indices, involving cranial and facial measurements are incorporated in this work. Manual anthropometry requires a considerable level of proficiency based on experience (Kolar and Salter, 1997
) and is rather time-consuming, so alternative methods have been sought, primarily involving the use of photography (Vegter et al., 1997
; Becker and Svensson, 1998
; Berger et al., 1999
; Vegter and Hage, 2001
). Such photographic techniques have been shown to be entirely valid, providing the points chosen are readily identifiable (Farkas et al., 1980
) and the correct photographic technique is used. Vegter et al. (1997)
demonstrated the way in which changes in proportion indices effected by treatment can be used to record aesthetic improvement. It would seem that the potential exists for developing this principle further.
Accordingly, the aim of this study was to identify whether there might be a use for Farkas' proportion indices as an indicator of facial appearance:
- By identifying pre- and post-treatment changes in a battery of proportion indices to quantify the overall change produced by surgical-orthodontic treatment, in a selected group of patients.
- To relate the magnitude of change (identified by the pre- and post-treatment proportion indices scores) to clinical judgement.
- To identify a possible facial index for quantifying changes in attractiveness, based on the proportion indices data.
| Subjects and methods |
|---|
|
|
|---|
Facial photographs of orthognathic patients formerly treated in the maxillofacial unit at Queen Mary's Hospital, Roehampton, London, UK were identified. Standardized full face and profile photographic prints of 15 Caucasian patients were used (9 male, 6 female; age range 2044 years, median 22 years), the facial images having been carefully selected to present a range of facial imbalance. The photographs had all been taken by the same medical photographer, using a standardized approach (Bengel, 1985
Clinical assessment
Two albums were constructed. Album 1 consisted of the 15 patients' pre-treatment photographs, i.e. full face and profile (4 x 6 inch). The photographs were placed in a random sequence and a VAS of 100 mm was used for the assessment of the attractiveness of the individual. Ten experienced clinicians, comprising five orthodontists (3 male, 2 female) and five maxillofacial surgeons (4 male, 1 female) were asked to rate the series of photographs for facial attractiveness, using the VAS which extended from very unattractive (low) to very attractive (high). The clinicians were deliberately given ample time to study the images. After a two-week period, the same clinicians were asked to repeat their ratings. Album 2 was presented to the same panellists several months later. Each album consisted of the same 15 patients images but this time, post-treatment as well as pre-treatment images were incorporated and an alternative VAS extending from extreme worsening (low) to extreme improvement (high) was used to enable the clinicians to assess the degree of improvement in attractiveness, following orthognathic surgery. Again, the process was repeated two weeks later. All the pre- and post-treatment photographs in album 2 were deliberately presented in black and white format, to prevent bias through artificial alteration in colour.
In order to investigate the relationship between pre-treatment attractiveness as rated in album 1 and the subsequent degree of improvement as rated in album 2, the scores for each patient were ranked and the median rankings of all the patients in albums 1 and 2 correlated. Ranked data was investigated instead of actual VAS scores, since it has been shown that clinicians may vary in their use of the full VAS range (Phillips et al., 1992
). This allowed relative changes, rather than absolute values, to be investigated.
Proportion indices
Twenty-five of the proportion indices published by Farkas and Munro (1987)
were used. The basis for selection was that the landmarks should be reliably identified on photographs (Farkas et al., 1980
) and the resultant indices be potentially changed by the effects of orthognathic surgery (Table 1).
|
The images were imported as bitmap files to an on-screen digitising program (IPTool; Greenhill et al., 2000
The clinicians' assessment rankings were compared with the measurement of change in Farkas' proportion index scores. For each patient, the changes in value for each of the 25 proportion indices were calculated as shown in Table 2. These rankings were correlated with the ranked clinicians' assessments of treatment change in three ways.
- For each patient, the overall change of each of the 25 indices (relative to the mean) was totalled, then the 15 patients' total scores were ranked and compared with their corresponding ranked clinical assessments.
- For each patient, the change in index was first divided by the mean value of that index and then all 25 values were totalled and ranked and then compared with the clinical assessments. This allowed the relationship between the mean value and the expected improvement to be assessed, i.e. to ascertain whether the size of the index itself might relate to the size of the change.
- For each patient, the change in each index was divided by the standard deviation of that index and then all 25 values were totalled and ranked, prior to comparison with the clinicians' rankings. This provided a way of reducing the influence of those particular indices with a wide standard deviation.
|
Repeatability
Clinicians' ratings: To assess inter-examiner agreement for both albums, the clinicians individually rated each image on two separate occasions and the mean rating was then ranked with those of the other clinicians. For album 1, each examiner's median scores were ranked from 115; 1 being the most attractive individual and 15 the least attractive (Table 3a). The same procedure was adopted for assessing inter-examiner agreement for album 2, involving improvement in appearance (Table 3b).
|
|
Photographic technique: Six volunteers (2 male, 4 female, age range 2730 years) were photographed on six occasions, with a two-week interval between each set of photographs (full-face and profile). The photographs were analysed using IPTool, in order to compare five linear measurements and one proportion index (Table 4a).
|
Digitization: The photographs of six patients from the study were randomly selected and digitized on six occasions, with at least a two-week interval between each digitization. On each occasion, six linear measurements (Table 4b) and one proportion index (Table 4c) were calculated.
|
|
| Results |
|---|
|
|
|---|
Clinicians' assessments
Inter-examiner agreement: Tables 3a and 3b present the ranked data for all 15 patients from the 10 clinicians' assessments, from both albums. For album 1, it can be seen that within 2.5 rankings, 8 out of the 10 clinicians agreed, and within 3.5 rankings there was complete agreement. For album 2, the median rankings were 8 and 9, respectively.
The result of comparing the clinician's initial rating of attractiveness (album 1) with the degree of improvement through treatment (album 2) is shown in Figure 1. There was a strong inverse correlation, indicating that those clinically judged to be the least attractive showed the greatest improvement (r = 0.781, P = 0.001).
|
Proportion indices change versus clinical assessment of change (album 2)
Table 2 provides the data for one of the 15 patients as an example, showing changes in the 25 indices as a result of treatment and their relationship to the Farkas mean, matched for age and gender. Whilst Table 5 shows the three different ways in which the proportion indices changes were calculated, Figure 2 shows the correlation (Spearman's rank) according to the first of the methods (r = 0.698, P = 0.004), indicating a good level of agreement between the change of proportion index and clinically judged improvement in appearance. Correlation, as assessed by the second method of presenting the changes in indices (involving the mean), was r = 0.608, P = 0.016; taking the standard deviation of the indices into account, resulted in the correlation r = 0.603, P = 0.017.
|
|
Repeatability
Table 4a shows the raw data obtained in assessing photographic repeatability. As an example, the standard deviation for repeated measurements of the index AlAl/NSn was 0.9 (Farkas index = 5.0).
Digitizing repeatability is presented in Table 4b, and again the standard deviation of repeated measurements of the proportion index AlAl/ChCh was 0.59 (Farkas index = 65.3, SD = 5.0).
| Discussion |
|---|
|
|
|---|
As seen in Figure 1, according to clinical judgement, the least attractive faces subsequently showed the most improvement through treatment. These results, based purely on the clinicians' assessments (albums 1 and 2), suggest that individuals who are relatively more attractive to begin with clearly have less to gain from treatment. These findings are to be expected, but do not appear to have been identified or reported hitherto. As may be expected, where the patients were at the extreme ends of the range of attractiveness, relatively more clinicians agreed with each other regarding their attractiveness than those patients who were nearer to the average.
The degree of improvement as assessed by the clinicians (album 2) clearly correlated strongly with the changes in the selected anthropometric proportion indices. This was confirmed by all three methods of comparison. It is not possible to compare these findings with the work of others, since this type of research has not previously been undertaken. However, the principle of comparing patients' proportion indices with those representing the mean, has previously been used by anthropologists (Ward et al., 1998, 2000![]()
). However, in their work, the indices were essentially used for syndrome diagnosis and the measurements were obtained by manual anthropometry, rather than photogrammetry, as in the present study. Using a conventional photographic approach has the benefit of wide availability, but the quality of results is very dependent on the protocol for standardization. Three-dimensional photographic techniques, such as stereo-photogrammetry, have the benefit of ease of image capture and also allow the use of further proportion indices, involving surface measurements; this approach is currently under development. It is also relevant to point out that facial attractiveness is dependent on a range of other factors that have not been considered here, including dynamic proportions, skin texture and colour, and dental appearance. However, the topic considered, namely static facial morphology, is clearly a dominating factor. Caucasians living in the Toronto area provided the anthropometric data used as a basis for comparison, so limiting the ethnic sample of the album images. However, data involving other ethnic groups, for example black Americans (Ofodile et al., 1993
) is gradually becoming available.
| Conclusions |
|---|
|
|
|---|
- There was a clear relationship between the attractiveness of the patients in album 1 and the degree of subsequent improvement in album 2, i.e. according to clinical judgement, the less attractive patients showed the greatest improvement.
- There was a strong correlation between proportion index changes and clinical assessment of improvement through surgery (album 2), using all three methods of comparison.
- There was good photographic and digitizing repeatability.
- The results were sufficiently encouraging to suggest that there may be a use for proportion indices in the objective quantification of facial attractiveness.
| Acknowledgement |
|---|
We are very grateful to the clinicians who provided the assessments; the orthodontists, Karen Clarke, Claire Hepworth, Allan Jones, Farhad Naini and Steve Newell, and the maxillo-facial surgeons, Malcolm Bailey, Peter Blenkinsopp, Andrew Stewart, Caroline Mills and Paul Robinson.
| References |
|---|
|
|
|---|
-
Alley TR and Cunningham MR. (1991) Average faces are attractive, but very attractive faces are not average. Psychological Science 2 123125.
Becker M and Svensson H. (1998) Morphometry in digital photographs: a promising technique for assessing patients with cleft lip and palate. Scandinavian Journal of Plastic and Reconstructive Hand Surgery 32:295299.[CrossRef]
Bengel W. (1985) Standardisation in dental photography. International Dental Journal 35:210217.[Medline]
Berger JL, Pangrazio-Kulbersh V, Thomas BW, Kaczynski R. (1999) Photographic analysis of facial changes associated with maxillary expansion. American Journal of Orthodontics and Dentofacial Orthopedics 116:563571.[Medline]
Bernstein IH, Lin TD, McClellan P. (1982) Cross- vs. within-racial judgments of attractiveness. Perception and Psychophysics 32:495503.[ISI][Medline]
Claman L, Patton D, Rashid R. (1990) Standardised portrait photography for dental patients. American Journal of Orthodontics and Dentofacial Orthopaedics 98:197205.
Cochrane SM, Cunningham SJ, Hunt NP. (1999) A comparison of the perception of facial profile by the general public and three groups of clinicians. International Journal of Adult Orthodontics and Orthognathic Surgery 14:291295.
Cunningham MR, Barbee AP, Pike CL. (1990) What do women want? Facialmetric assessment of multiple motives in the perception of male facial attractiveness. Journal of Personality and Social Psychology 59:6172.[CrossRef][ISI][Medline]
Edler RJ. (2001) Background considerations to facial aesthetics. Journal of Orthodontics 28:159168.
Farkas LG. (1994) Anthropometry of the head and face. (Raven Press, New York).
Farkas LG and Munro IR. (1987) Anthropometric facial proportions in medicine. (Charles C Thomas, Illinois).
Farkas LG, Bryson W, Klotz J. (1980) Is photogrammetry of the face reliable? . Plastic and Reconstructive Surgery 66:346355.[Medline]
Grammer K and Thornhill R. (1994) Human facial attractiveness and sexual selection: the role of symmetry and averageness. Journal of Comparative Psychology 108:233242.[CrossRef][ISI][Medline]
Greenhill D, Edler R, Wertheim D, Belsham A, Jones GA. (2000) A system for assessing mandibular outlines. International conference on advances in medical signalling and medical information processing. (Institution of Electrical Engineers, London)192199.
Howells DJ and Shaw WC. (1985) The validity and reliability of ratings of dental and facial attractiveness for epidemiologic use. American Journal of Orthodontics 88:402408.[CrossRef][ISI][Medline]
Kolar JC and Salter EM. (1997) Craniofacial anthropometry. (Charles C Thomas, Illinois).
Langlois JH and Roggman LA. (1990) Attractive faces are only average. Psychological Science 1:115121.
Lines PA, Lines RR, Lines CA. (1978) Profilemetrics and facial aesthetics. American Journal of Orthodontics 67:4456.[CrossRef]
Maret SM and Harling GA. (1985) Cross cultural perceptions of physical attractiveness: ratings of photographs of whites by Cruzans and Americans. Perceptual and Motor Skills 60:163166.
Ofodile FA, Bokhari FJ, Ellis C. (1993) The black American nose. Annals of Plastic Surgery 31:209219.[ISI][Medline]
Okkerse JME, Beemer FA, Cordia-de Haan M, Heineman-de Boer JA, Mellenbergh GJ, Wolters WHG. (2001) Facial attractiveness and facial impairment ratings in children with craniofacial malformations. Cleft Palate-Craniofacial Journal 38:386392.[CrossRef]
Patzer GL. (1985) The physical attractiveness phenomena. (Plenum Press, New York).
Patzer GL. (1994) Practice made perfect. Measurement of physical attractiveness: truth-of-consensus. Journal of Esthetic Dentistry 6:185188.[Medline]
Perrett DI, May KM, Yoshikawa S. (1994) Facial shape and judgements of female attractiveness. Nature 368:239242.[CrossRef][Medline]
Phillips C, Tulloch C, Dann C. (1992) Rating of facial attractiveness. Community Dentistry and Oral Epidemiology 20:214220.[CrossRef][ISI][Medline]
Richardson SA. (1970) Age and sex differences in values toward physical handicaps. Journal of Health and Social Behaviour 11:207.[Medline]
Ritter K, Trotman CA, Phillips C. (2002) Validity of subjective evaluations for the assessment of lip scarring and impairment. Cleft Palate-Craniofacial Journal 39:587596.
Roberts-Harry DP, Hathorn IS, Stephens CD. (1992) The ranking of facial attractiveness. European Journal of Orthodontics 14:483488.
Russell KA, Aldman SD, Tompson B, Lee JM. (2001) Nasal morphology and shape parameters as predictors of nasal esthetics in individuals with complete unilateral cleft lip and palate. Cleft Palate-Craniofacial Journal 38:476485.[CrossRef]
Strzalko J and Kaszycka KA. (1991) Physical attractiveness: interpersonal and intrapersonal variability of assessments. Social Biology 39:170176.
Symons D. (1979) (The evolution of human sexuality Oxford University Press, New York).
Thakera JN and Iwawaki S. (1979) Cross-cultural comparisons in interpersonal attraction of females toward males. Journal of Social Psychology 108:121122.
Tobiasen JM. (1987) Social judgements of facial deformity. Cleft Palate Journal 24:323327.[Medline]
Tobiasen JM and Hiebert JM. (1988) Reliability of aesthetic ratings of cleft impairment. Cleft Palate Journal 25:313317.[Medline]
Vegter F and Hage JJ. (2001) Facial anthropometry in cleft patients: a historical appraisal. Cleft Palate-Craniofacial Journal 38:577581.[CrossRef]
Vegter F, Mulder JW, Hage JJ. (1997) Major residual deformities in cleft patients: a new anthropometric approach. Cleft Palate-Craniofacial Journal 34:106110.[CrossRef]
Ward RE, Jamison PL, Allanson JE. (2000) Quantitative approach to identifying abnormal variation in the human face exemplified by a study of 278 individuals with five craniofacial syndromes. American Journal of Medical Genetics 91:817.[Medline]
Ward RE, Jamison PL, Farkas LG. (1998) Craniofacial variability index: a simple measure of normal and abnormal variation in the head and face. American Journal of Medical Genetics 80:232240.[Medline]
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||

