Biopython
Overview
Biophython is a Python package to simplify various data-processing tasks in genomics. This tutorial provides an easy introduction.
Biopython is good for -
1. Quick analysis of nucleotide and protein sequences. You can easily extract a segment from a longer nucleotide sequence, get its reverse complement , do nucleotide to protein translation and so on.
2. Parsing of all kinds of files including simple FASTA files, BLAST output, MUSCLE output, PDB files, and so on.
3. Submitting requests to online databases and fetching responses from them. For example, you can programmatically run BLAST at NCBI instead of manually filling up the form.
4. Statistical and bioinformatics analysis - clustering, motifs, phylogeny, etc.
Skills You Acquire
- You will learn to use the Biopython package to analyze genomic data.
- Often we see the solution and not the entire thought-process to arrive at the solution. These videos show the thought-process.
Class Style
These modules are video-assisted. The recorded videos clearly explain the technical materials. In addition, the modules include text with all codes, data and explanations.
Prerequisites
Python basics.
Lessons
-
Lesson 1
Welcome Status: Complete
-
Lesson 2
Seq Status: Complete
-
Lesson 3
SeqRecord Status: Complete
-
Lesson 4
Parsing FASTA Files Status: Complete
-
Lesson 5
Analyzing Alignment Files Status: Complete
-
Lesson 6
PArsing GFF Files Status: Incomplete
-
Lesson 7
Converting Phylogeny Tree Status: Incomplete
-
Lesson 8
Reading NCBI Data Status: Incomplete
-
Lesson 9
Running Programs over the Internet Status: Incomplete
-
Lesson 10
Processing of PDB Files Status: Incomplete