
Bioinformatics Analysis for Conservation Genomics
- Tutorials that emphasize hands-on training in computational tools used in conservation genomic analyses
- Design of conservation genomics projects
- Data collection via whole genome sequencing and reduced representation methods
- Genome assembly and annotation
- Read data processing and mapping
- Variant calling and genotyping from NGS data
- Analysis of genetic structure and admixture
- Estimation of genome-wide diversity, inbreeding, and historical demography
- Kinship estimation
- RNAseq methods and analysis
- Use of Unix command line to access Smithsonian high-performance computing resources and pre-compiled software
Participants should have completed basic genetics and evolutionary biology coursework or have previous experience with genetics, genomics, or evolutionary biology.
- Dates
March 20-29, 2023
- Available Formats
Graduate (CONS 697, 3 credits)
Professional Training (SMSC 530, 6.75 CEUs)- Cost
Graduate: See Mason’s graduate tuition rates
Professional Training: $2,119.00*
*Professionals from certain countries are eligible for a reduced rate of $1,619.00 which will automatically be reflected during the registration process.
- Deadlines
Apply by January 9, 2023
Payment Deadline: January 23, 2023
Meet the Faculty

Smithsonian Institution Data Science Lab, Office of the Chief Information Officer
Curriculum
This course provides an in-depth survey of the concepts, methods, and software used in conservation genomics research. Participants will learn how to: 1) design conservation genomics projects, 2) handle complex genomic data, 3) use common population genomics software packages and tools, 4) interpret, visualize, and publish results, and 5) practice collaborative research. For this course, participants will be working with a novel dataset from a threatened species for which a chromosome-length genome assembly and whole genome sequencing data of additional individuals will already have been generated. Participants will collaborate with each other and the instructors to analyze these data and then prepare a genome report manuscript for submission to a peer-reviewed journal (e.g., G3, Genome Biology and Evolution, Journal of Heredity, or Molecular Ecology Resources). THIS is an example of the type of publication participants will work toward. Participants that actively work on the analysis and writing process will be included as coauthors on the resulting manuscript. Please note that analyses and manuscript preparation may extend beyond the 10-day course. A Slack channel for the course will be set up to foster communication among participants and instructors where messages, papers, and programs can be shared. This experience will provide participants the opportunity to conduct sophisticated genomic analyses as applied to conservation and learn best practices for collaborations.
Target Audience
This course is designed for graduate students and early-career researchers interested in designing conservation genomics projects or already generating genome data to acquire the skills necessary to: assemble and annotate a genome or transcriptome, estimate genetic diversity and structure, calculate inbreeding levels, infer demographic history, estimate kinship, and compare data and results to existing genomic resources. The hands-on tutorials in this workshop will be performed on a High-Performance Computing Cluster via the Unix command line. Although some review of Unix will be included in the course, all participants should be familiar with basic Unix commands. We recommend this short tutorial: (https://swcarpentry.github.io/shell-novice/
What’s Included
The total cost for professional training includes our Housing ($528.00) and Dining ($341.00) Packages and covers:
- Instruction, course manual, and other course materials
- Airport pick-up and drop-off shuttle service at Dulles International Airport (IAD) at pre-arranged times. We do not provide ground transportation shuttle service to or from any other airports in the Washington, DC, region
- Transportation for course activities
- Daily full-service buffet at the SMSC Dining Commons.
- Housing at the SMSC Residential Facility, including a shared room with bathroom (single rooms available at twice the cost of shared housing)
Acceptance does not guarantee you a seat in the course. Seats are allocated as registration payments are received, and early registration is strongly encouraged to ensure you get a spot.
Interested in Commuting?
Participants with a documented, permanent address local to campus may elect to commute with permission from SMSC. Commuters are required to purchase a reduced meal plan ($100) which includes lunch and snack breaks for each day of instruction. Additional dining options are available and may be selected during registration. Email SCBItraining@si.edu for additional information.

“A really comprehensive crash course, by the end of the course I felt comfortable navigating Unix and implementing high throughput data analytics for genome assembly. Overall a fantastic course with a great staff!”

“This is an intense hands-on and full-immersion course in bioinformatics and genomics. It is an excellent introduction to genome assembly, annotation and analyses in conservation genomics. I recommend it for anyone who wants to get started on research that takes a genomic approach in conservation – it will help you develop tools in the field and introduce you to top genomicists and bioinformaticians at the Smithsonian.”