Share to: share facebook share twitter share wa share telegram print page

Median polish

The median polish is a simple and robust exploratory data analysis procedure proposed by the statistician John Tukey. The purpose of median polish is to find an additively-fit model for data in a two-way layout table (usually, results from a factorial experiment) of the form row effect + column effect + overall median.

Median polish utilizes the medians obtained from the rows and the columns of a two-way table to iteratively calculate the row effect and column effect on the data. The results are not meant to be sensitive to the outliers, as the iterative procedure uses the medians rather than the means.

Model for a two-way table

Suppose an experiment observes the variable Y under the influence of two variables. We can arrange the data in a two-way table in which one variable is constant along the rows and the other variable constant along the columns. Let i and j denote the position of rows and columns (e.g. yij denotes the value of y at the ith row and the jth column). Then we can obtain a simple linear regression equation:

where b0, b1, b2 are constants, and xi and zj are values associated with rows and columns, respectively.

The equation can be further simplified if no xi and zj values are present for the analysis:

where ci and dj denote row effects and column effects, respectively.

Procedure

To carry out median polish:

(1) find the row medians for each row, find the median of the row medians, record this as the overall effect.

(2) subtract each element in a row by its row median, do this for all rows.

(3) subtract the overall effect from each row median.

(4) do the same for each column, and add the overall effect from column operations to the overall effect generated from row operations.

(5) repeat (1)-(4) until negligible change occur with row or column medians


References

  • Frederick Mosteller and John Tukey (1977). "Data Analysis and Regression". Reading, MA: Addison-Wesley. ISBN 0-201-04854-X.
  • J.D. Emerson and D.C. Hoaglin (1983). "Analysis of two-way tables by medians". In "Understanding Robust and Exploratory Data Analysis", eds D. C. Hoaglin, F. Mosteller and J. W. Tukey. New York City: John Wiley & Sons. ISBN 0-471-38491-7. pp. 165–210.
  • William N. Venables and Brian D. Ripley (2002). Statistics Complements to Modern Applied Statistics with S, p.4–5. ISBN 0-387-95457-0.
  • Anwar Fitrianto, Hari Wijayanto, Sohel Rana, and Cheong Yee Voon (2014). "Median Polish for Final Grades of MTH3000- and MTH4000- Level Courses". Applied Mathematical Sciences, Vol. 8, no. 126, pp. 6295-6302


Index: pl ar de en es fr it arz nl ja pt ceb sv uk vi war zh ru af ast az bg zh-min-nan bn be ca cs cy da et el eo eu fa gl ko hi hr id he ka la lv lt hu mk ms min no nn ce uz kk ro simple sk sl sr sh fi ta tt th tg azb tr ur zh-yue hy my ace als am an hyw ban bjn map-bms ba be-tarask bcl bpy bar bs br cv nv eml hif fo fy ga gd gu hak ha hsb io ig ilo ia ie os is jv kn ht ku ckb ky mrj lb lij li lmo mai mg ml zh-classical mr xmf mzn cdo mn nap new ne frr oc mhr or as pa pnb ps pms nds crh qu sa sah sco sq scn si sd szl su sw tl shn te bug vec vo wa wuu yi yo diq bat-smg zu lad kbd ang smn ab roa-rup frp arc gn av ay bh bi bo bxr cbk-zam co za dag ary se pdc dv dsb myv ext fur gv gag inh ki glk gan guw xal haw rw kbp pam csb kw km kv koi kg gom ks gcr lo lbe ltg lez nia ln jbo lg mt mi tw mwl mdf mnw nqo fj nah na nds-nl nrm nov om pi pag pap pfl pcd krc kaa ksh rm rue sm sat sc trv stq nso sn cu so srn kab roa-tara tet tpi to chr tum tk tyv udm ug vep fiu-vro vls wo xh zea ty ak bm ch ny ee ff got iu ik kl mad cr pih ami pwn pnt dz rmy rn sg st tn ss ti din chy ts kcg ve 
Prefix: a b c d e f g h i j k l m n o p q r s t u v w x y z 0 1 2 3 4 5 6 7 8 9 
Kembali kehalaman sebelumnya