List and define variables
Choose file naming conventions
Save raw data without processing
Document the following information:
Collecting variables in their rawest, most precise form provides the greatest flexibility when it comes time to analyze the data.
Determine whether there is a data standard for topic of study
Clearly define variables to avoid confusion
Avoid categorizing variables -- collect them as precisely as possible
Avoid collecting compound variables -- collect data in its rawest form
Avoid ambiguous field names
Document all variables in a table or spreadsheet, known as a data dictionary
Always include the same information in the same order for each file related to the same project
Avoid using special characters, e.g. $!@
Avoid using periods and spaces
File names should include letters, numbers, and underscores only
Reserve the three-letter file extensions for codes the system assigns to the file type, e.g. csv, tif
Use a directory hierarchy
Use YYYYMMDD format for date
Use hhmmssTZD format for time (TZD = time zone)
Use Last Name, First and Middle initial format for names
Example: Davis, J. P.
For analyzed or processed data, include descriptions and references to software or code used
Readme files: general project files describing the overall organization, responsible parties, instruments, etc. related to the research data
Registered Reports is a publishing format that emphasizes the importance of the research question and the quality of methodology by conducting peer review prior to data collection. High quality protocols are then provisionally accepted for publication if the authors follow through with the registered methodology.
By doing so, Registered Reports eliminates questionable research practices, such as low statistical power, selective reporting of results, and publication bias.