Skip to Main Content

Applied Data Science

Jane Dodge

Citing and Citation Managers

It is important to cite the source where the dataset is obtained from. Regardless of the desired citation style, following are the components of a data citation.

 

Key Components of a Data Citation
Author or CreatorThe name(s) of each individual or organizational entity responsible for the creation of the dataset.

Title or Study NameThe title of the dataset, including the edition or version number, if applicable.

Publisher and/or DistributorThe organizational entity that makes the dataset available by archiving, producing, publishing, and/or distributing the dataset.  

Publication DateThe date when the data set was published or released.

Location or IdentifierWeb address or unique, persistent, global identifier used to locate the dataset, e.g. a DOI or a handle.  Append the date retrieved if the title and locator are not specific to the exact instance of the data you used.

 

Additional Elements

Version or Edition—The exact version or edition of the data set.

Access DateDate of access for analysis. Needed to reproduce analysis of continuously updated dynamic datasets.

Format / Material Designator—Database, CD-ROM.

Feature NameA description of the subset of the dataset used.  May be a formal title or a list of variables.

VerifierUsed to confirm that two datasets are identical.  Most commonly a UNF or MD5 checksum.

SeriesUsed if the dataset is part of series of releases (e.g. monthly, yearly).

Contributore.g. editor, compiler 

 

Information: borrowed from https://infoguides.gmu.edu/find-data/cite

Examples of Dataset Citations (Information: borrowed from https://infoguides.gmu.edu/find-data/cite)
APA (6th edition)
Smith, T.W., Marsden, P.V., & Hout, M. (2011).  General social survey, 1972-2010 cumulative file (ICPSR31521-v1) [data file and codebook]. Chicago, IL: National Opinion Research Center [producer].  Ann Arbor, MI: Inter-university Consortium for Political and Social Research [distributor]. doi: 10.3886/ICPSR31521.v1

MLA (7th edition)
Smith, Tom W., Peter V. Marsden, and Michael Hout.  General Social Survey, 1972-2010 Cumulative File.  ICPSR31521-v1. Chicago, IL: National Opinion Research Center [producer].  Ann Arbor, MI: Inter-university Consortium for Political and Social Research [distributor], 2011. Web. 23 Jan 2012. doi:10.3886/ICPSR31521.v1

Chicago (16th edition) (author-date)
Smith, Tom W., Peter V. Marsden, and Michael Hunt. 2011. General Social Survey, 1972-2010 Cumulative File. ICPSR31521-v1. Chicago, IL: National Opinion Research Center. Distributed by Ann Arbor, MI: Inter-university Consortium for Political and Social Research. doi:10.3886/ICPSR31521.v1

Examples of how to cite statistical tables published in a publication or on a website
Michigan State University Libraries has put together a comprehensive guide on how to cite statistical tables and charts.  The guide is filled with examples and is very helpful.

APA, MLA, and Chicago Turabian Citation Style Guides

Citation Managers

This link provides an overview of various citation managers, a comparison chart, and video tutorials of citation management systems.