Introduction

The Lee-Campbell Research Group constructs, analyzes, and disseminates Big Social Science Data collections and an associated Scholarship of Discovery related largely to historical and contemporary China.  Group members  collaborate with James Z. Lee and Cameron D. Campbell on these projects and attend weekly group meetings to discuss their research.  One distinctive feature of our group is our interdisciplinarity, with a diverse mix of social scientific historians and historical social scientists. Another distinctive feature of our group is our solidarity with many faculty group members who began as Lee-Campbell students and postdocs and now besides their own independent research continue to collaborate on major group projects. In addition, we all also work closely with other professors, students, and research assistants outside our group on specific sub-projects.

Our main data projects include the China Multi-Generational Panel Datasets CMGPD, the China University Student Datasets CUSD, the China Government Employee Datasets CGED-Qing and CGED-ROC, the China Rural Revolution Datasets CRRD-LR and CRRD-SQ, and the China Professional Occupation Datasets CPOD.

These five data projects currently have individual level information for 1.8 million people who lived in China between the eighteenth century and the present as well as several hundred thousand related individuals, typically spouses, parents, or other relatives.  More than 800,000 are Qing dynasty populations, largely from the middle of the eighteenth century to the beginning of the twentieth century. Another million are from the Republic of China and People’s Republic of China, almost entirely from the twentieth century. Nearly 900,000 are rural populations from North and Northeast China, half of whom are longitudinally linked over their life course and across generations.  The remaining 900,000 are almost entirely university educated or the historical equivalent, urban populations of government officials, professionals, and university students and faculty drawn from all over China, whose records we are in the process of linking across careers and for a significant proportion across generations.

In 2010 and 2014, we publicly released 3 million longitudinally linked triennial / annual records for 368 thousand unique individuals from the China Multigenerational Panel Database-Liaoning (CMGPD-LN) and China Multigenerational Panel Database-Shuangcheng (CMGPD-SC), through the Inter-university Consortium for Political and Social Research.  In 2019, we publicly released 638 thousand quarterly linked records for 50 thousand Qing Dynasty officials (based on our linkage) who served between 1900-1912 from the China Government Employee Database – Qing (CGED-Q), and plan to release over 4 million observations of 400,000 Qing officials from the CGED-Q in the years to come.  For more details, including links to download the 2019 public release, please visit our CGED-Q Project Page.  We also plan to release the non-proprietary portions of the CGED-ROC, CUSD, CRRD, and CPOD datasets, once we have finished data construction.

Our best known publications, including five prize winning or CHOICE award books published by Harvard University Press, 北京三联出版社, MIT Press, and Stanford University Press, are based largely on a scholarship of discovery from our analyses of the above data. We have also published a number of articles using such data in many Chinese and English-language learned journals as well as such impactful (>2) international journals as American Sociological Review, Annual Review of SociologyChina Quarterly, Chinese Sociological ReviewDemography, Evolution and Human Behavior, and Social Science and Medicine, which have awarded us five additional best article or editors choice awards.

For a preliminary summary of many of our research findings, please see Understanding China, 1700-2000: A Data Analytic Approach available both as a shorter Coursera MOOC and as a longer advanced undergraduate / beginning postgraduate on-line course. We use these course videos in our teaching at HKUST and elsewhere to develop flipped classroom approaches to train students to work together in groups rather than individually, to improve their oral and written communication skills as well as their critical thinking, and to develop their EQ and use and enhance their IQ.

We have also started to share some of our experiences in historical dataset construction, data analysis, and data analytic teaching in a series of methodological articles published in such major Chinese language journals as 《历史研究》 (Historical Research), 《社会》 (Society), 《文史哲》 (Literature, History, Philosophy), and 《清华大学学报》 (Journal of Tsinghua University), The first of which won a 优秀青年成果奖 (Talented Young Scholar Achievement Award) from the Ministry of Education and the last of which won the Parkson Best Article Award from Tsinghua University, to encourage new efforts in construction, analysis, and teaching of Big Social Science Data from archival sources in China.

For further information about our activities, please consult

  • our group people pages for members’ profiles including institutional affiliations, honors, research projects, and email addresses
  • our group project pages for information on each project’s principal co-investigators / collaborators, background, history, funding, research output, user guides, and data access
  • our group publication pages for a list of our publications over the last ten years as well as of fourteen prize winning or equivalent articles, books, and projects
  • our group outreach page for a summary of public interest media, including television as well as newspaper, journal, and web articles that introduce our research to the general public, and an assesment of the public policy impact of our research on Chinese tertiary education, 1912-2012 on contemporary university admission policies in the Peoples Republic of China
  • our group blog for news and updates