OMB # 0925-0740

Expiration Date: 07/31/2022

Public reporting burden for this collection of information is estimated to average 10 minutes per response, including the time for reviewing instructions,

searching existing data sources, gathering and maintaining the data needed, and completing and reviewing the collection of information. An agency

may not conduct or sponsor, and a person is not required to respond to, a collection of information unless it displays a currently valid OMB control

number. Send comments regarding this burden estimate or any other aspect of this collection of information, including suggestions for reducing this

burden, to: NIH, Project Clearance Branch, 6705 Rockledge Drive, MSC 7974, Bethesda, MD 20892-7974, ATTN: PRA (0925-0740). Do not return

the completed form to this address.

NCBI is pleased to announce a Single-cell focused codeathon. To apply, please complete this form by December 30th, 2019.

When and where is the codeathon?

January 15-17, 2020 at the New York Genome Center.

Who can participate?

We encourage researchers and data scientists at any stage of their data science journey to apply.  Teams will greatly benefit from people who possess any of the following skills:

  • analyzing single cell data types
  • working knowledge of scripting (e.g., Shell, Python, R)
  • familiarity with methods for manipulating and/or analyzing large datasets
  • developing bioinformatics code, pipelines or tools
  • data visualization  

There is no registration fee associated with attending this event.

*Note:*Participants will need to bring their own laptop to this event. No financial support for travel, lodging, or meals is available for this event.

What are some of the potential team projects?

  • Using single-cell SRA sequence to verify/improve submitted metadata available in SRA.
  • Assessing log-transformation and z-scores for scRNA-seq data analysis.
  • Identifying bulk RNA-seq-derived biomarkers of cancer risk within single-cell populations
  • Defining Cell Fate Regulators in Multi-omic Single Cell Developmental Datasets.
  • Identifying rapid cleavage sites from scRNA-seq data.
  • Understanding cancer evolution through single cell expression dynamics.
  • Using Tabula Muris Senis as a reference for a semi-automated sc-RNA-seq analysis workflow in the cloud.

How are teams formed?

Before the event, we will create five to eight teams, comprised of five to six individuals each with various backgrounds and expertise. Each will be led by an experienced leader.

What will a typical day be like?

We will meet from 9 am to 5 pm each day, with the potential to extend into the evening hours for continued work or optional social events.

Each day, we will gather as a group for a short presentation on a hot topic of interest to the data science and bioinformatics community (such as bioinformatics best practices, coding styles, etc.) and then break out to work on team project pipelines and tools for the analysis of large datasets within a cloud infrastructure. On each day, teams will present short talks to introduce their project (day 1), discuss project progress (day 2), and present the results.

What will we build?

We will make all pipelines, other scripts, software, and programs generated in this codeathon available on a dedicated public GitHub repository (

Each team may submit manuscripts describing the design and use of the software tools they created  to an appropriate journal such as the F1000Research hackathons channelBMC BioinformaticsGigaScienceGenome Research, or PLoS Computational Biology.

How to apply?

To apply, please complete this form.  Applications are due December 30th, 2019 by 3 p.m. EST. We will select participants based on their experience and their motivation to attend.

We encourage prior participants and prior applicants to apply. We will notify the first round of accepted applicants on January 3, 2020. Accepted applicants have until January 8 at noon ET to confirm their participation. International applicants or those with particular skillsets may be accepted early. If you confirm, please make sure that you can attend, as confirming and not attending prevents other scientists from attending this event. Please provide a monitored email address, in case there are follow-up questions.


Entrants retain ownership of all intellectual property rights (including moral rights) in the code submitted to as well as developed in the codeathon. Employees of the U.S. Government attending as part of their official duties retain no copyright in their work and their work is in the public domain in the U.S.

The Government disclaims any rights in the code submitted or developed in the codeathon.

Participants agree to publish the code and any related data in GitHub.

Please feel free to contact Allissa Dillman if you have questions or need more information.

How comfortable are you with these programming languages?
How comfortable are you with:
Here are Some Approximate Project Titles(which are expected to evolve); please pick top three you are interested in working on