
























Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Community
Ask the community for help and clear up your study doubts
Discover the best universities in your country according to Docsity users
Free resources
Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors
BINF511 EXAM STUDY GUIDE 2025-2026
Typology: Exams
1 / 32
This page cannot be seen from the preview
Don't miss anything!
What is a binary file? - Answer A file that is machine readable. It requires an interpreter to be human readable. What is the purpose of a README file? - Answer They are plain text files that give basic information about the directory or software. These are often used to ensure the user knows the purpose of the directory. They are especially useful in multiuser systems. Based on the given tree, what is the absolute path to phone numbers.txt - Answer /user 1/desktop/phone numbers.txt/ Based on the given tree, give the relative path from info.pdf to user 2 - Answer ../../../user 2/ What are the parts of this UNIX prompt: hugin:~stromvik% - Answer hugin: = computer name ~ = home directory stromvik = login username % = prompt (can be various systems %$>) what does the unix comand pwd do? - Answer prints the working directory what does the unix command ls do? - Answer prints a list of the contents of the working directory what does the unix command ls -l do? - Answer long list - it prints a list of the contents of the working directory with more details than the standard ls If a file had the permission 764 what would it mean? - Answer The owner can read, write and execute. The group can read and write. The world can read. what are the numerical values associated with the read, write and execute permissions?
what unix command will display the username? - Answer whoami what does the the unix command cp do? - Answer copies the specified file to the specified location what does the unix command mv do? - Answer 1. it can make a copy of the file if you choose a file name as the location
adds the output to the end of an existing file
how do you compress your file in unix? - Answer gzip. it will ad the extension .gz what does the unix command gunzip do? - Answer uncompresses the specified file. what is safe file transfer protocol (sftp)? - Answer it is an electronic way to transfer files between a local and remote computer. how to you establish a connection between two computers for a safe file transfer protocol? - Answer after the local computers prompt, type sftp and the name of the computer you are connecting to (usually your username on that computer@IPadress) What are the rules for file names using unix? - Answer you can use. and _. You can not use spaces or other special characters. what do the unix commands put and mput do when using safe file transfer protocol? - Answer put = moves 1 file from local machine to remote machine mput = moves multiple files from local machine to remote machine what do the unix commands get and mget do when using safe file transfer protocol? - Answer get = moves 1 file from the remote machine to the local machine mget = moves multiple files from the remote machine to the local machine What is the purpose of a loop in unix - Answer it allows you to automate a task that you want done to multiple files so you do not need to type the same commands repeatedly
what does the unix command echo do? - Answer it prints whatever is on the rest of the line is to the standard out.
how do you stop a loop in unix? - Answer type end as the last line
What does HTML stand for? - Answer Hypertext Markup Language
what is HTML used for? - Answer programming language to structure and format information for webpages
what is XML? - Answer Extensible Markup Language
what is XML used for? - Answer to write files that can be parsed easily. The goal is to
make 1 file type that multiple programs and databases can use standard tags and share information
what is ASN.1? - Answer Abstract syntax notation 1
what is ASN.1 used for? - Answer formal language to structure information to share between applications (similar to XML). Genbank is based on ASN.
what is the central dogma of molecular biology? - Answer DNA -> RNA -> Protein
What is the genome? - Answer A collection of all of the genes (DNA) within an organism.
what is the ORFome? - Answer A collection of all of the open reading frames (ORFs) in an organism.
What is the transcriptome? - Answer A collection of all of the transcripts (RNA) in an organism
What is the proteome? - Answer A collection of all of the gene products (proteins) in an organism
What is the metabolome? - Answer A collection of all of the metabolites in an organism
what does comparative genomics assess? - Answer Typically compares the genomes of 2 different organisms
what does functional genomics assess? - Answer Study's the function of genes. This will often include transcriptomics
what does structural genomics study? - Answer focuses on the physical structure and
adaptors and added to a flow cell. Clusters of DNA are formed via bridge amplification. At each base, a fluorescently labelled terminator is added and read. It is then removed so the next base can be called.
Describe the basics of Oxford Nanopore sequencing - Answer DNA molecule is pulled through a pore in the membrane. As the molecule moves through, the charge across the membrane changes allowing for base calling.
Describe the basics of Single molecule real time (SMRT) sequencing - Answer uses a zeromode wave guide: a compartment with only 1 polymerase and 1 DNA template. Fluorescent molecules are cleaved off if they are incorporated. This is detected and allows for base calling.
Describe the basics of 10x Genomics sequencing - Answer Microfluidics based methods of single cell sequencing. Uses barcoding to determine which molecule you are "on". I don't think they are doing this anymore.
What sequencing methods are considered next gen (2nd gen) - Answer Illumina, 454 pyrosequencing, Ion torrent
What sequencing methods are considered long read (3rd gen) - Answer Oxford Nanopore, Single molecule real time (SMRT), and 10x genomics
What is a genomic DNA library? - Answer a collection of all of the DNA in a organism.
What are 2 methods of whole genome sequencing - Answer Shotgun sequencing, Clone-contig approach
What are the steps of shotgun sequencing - Answer 1. fragment DNA into large pieces (2000kb)
What are the steps of clone-contig approach sequencing - Answer 1. clones are screened and ordered
What is Expressed sequencing tags (EST) sequencing - Answer 1. Early version of transcriptomics (not used much anymore)
What is serial analysis of gene experssion (SAGE) sequencing? - Answer 1. SAGE library is created from a pool of mRNA
What are some problems associated with Serial analysis of gene expression (SAGE)? - Answer 1. tags may match to multiple genes
What is RNA-seq - Answer Transcriptome profiling using deep sequencing techniques. Typically uses illumina sequencing
How are genes represented on Affymatrix Gene Chips? - Answer by 25 mers in 2 versions, 1 perfect match and 1 mismatch. Perfect match - mismatch = intensity value of the probe pair
What are the steps for an Affymatrix Gene Chip experiment? - Answer 1. primer hybridization 2.reverse transcription (first strand cDNA synthesis)
What are the drawbacks of gene chips? - Answer 1. are not available for all genes/genomes
What are the 2 RNAseq analysis methods? - Answer 1. Align reads to genome
What are 2 methods used to DETERMINE protein structure? - Answer 1. Xray crystallography
What are 4 methods to PREDICT protein structure? - Answer 1. protein modeling
What are some problems that occur with proteomics techniques? - Answer 1. protein-protein interactions
What are 3 large scale protein study techniques? - Answer 1. Immobilized- gradient 2D PAGE
What dimensions are proteins typically separated by in immobilized gradent 2D PAGE - Answer 1. isoelectric focus (pH)
what does PAGE stand for (2D PAGE)? - Answer Polyacrylamide gel electrophoresis
How does N-terminal sequencing work? - Answer Amino acid residues are cleaved one at a time, and are analyzed by high performance liquid chromatography
What are 3 mass spec techniques? - Answer 1. Electrospray ionization mass spectrometry (ESI-MS)
What is phasing in genome assembly? - Answer Assemble DNA into pseudochromosomes and identify alleles in each of the chromosomes
what is HiC? - Answer Chromosome confirmation capture sequencing
What does TargetScan do? - Answer searches 3' UTR for miRNA reverse complement sequences
What are the 3 methods of DNA methylation? - Answer 1. during replication (at the replication fork)
What are some reasons DNA may be methylated - Answer 1. genome maintenance and control of transposon activity
What are the steps involved in Bisulphite genomic sequencing? - Answer 1. fragment DNA
What is commonly used as a control in bisulphite sequencing? - Answer Chloroplast DNA because it is entirely unmethylated
What is ChIP sequencing and what is it used for? - Answer Chromatin ImmunoPrecipitation Sequencing. Used to identify DNA sequences that proteins bind to.
What are the steps involved in ChIP sequencing? - Answer 1. cross-link proteins to DNA
What does NCBI stand for? - Answer National Center for Biotechnology Information
What databases are apart of the International Nucleotide Sequence Database Collaboration (INSDC) - Answer GenBank, EMBL, DDBJ
What can sequence data in NCBI be linked to? - Answer proteins/DNA, taxonomy, genome, mapping, protein structures and domains, and PubMed Journal Literature
What are the 3 sections of a GenBank flat file? - Answer 1. Header - information that applies to the entire file
What information is in the header of a genbank flat file? - Answer Locus name, length of
What are missense mutations? - Answer changes the AA produced
What are non-sense mutations? - Answer changes the AA to a stop codon
What are silent mutations? - Answer change in nucleotide sequence that does not change the AA. This usually occurs in the 3rd codon position due to the tRNA wobble
What are frameshift mutations? - Answer Either an insertion or a deletion that affects all of the following AA.
What is sequence identity? - Answer The % of positions where the nucleotide or amino acids are an exact match
what is sequence similarity? - Answer The % of positions where the sequence has a similar match. This is based on groups and biochemical properties. Only applies to amino acid sequences.
What is sequence homology? - Answer A qualitive measure of the relatedness between 2 sequences. This is an absolute value, so sequences are either homologous or not.
What are the 3 choices that can occur for each base pair position when aligning 2 sequences? - Answer 1. The sequences should be aligned with each other
What does BLAST stand for? - Answer Basic Local Alignment Search Tool
What are the 4 types of BLAST? - Answer 1. BLASTN - Nucleotide query to nucleotide database
What are the step of a BLAST search? - Answer 1. index words
How are nucleotide sequence alignments scored? - Answer matches are given a + value, and mismatches are given a - value. Point mutations and segmental mutations are taken into account
How are amino acid sequence alignments scored? - Answer Scored based on how some AA substitutions are better than others. Typically based on BLOSUM or PAM matrices.
What does BLOSUM stand for? - Answer BLOcks SUbstitution Matrix
What does PAM stand for? - Answer Percent Accept Mutations
What does the number in a BLOSUM matrix mean (ex. BLOSUM62)? - Answer It is the % identity between species. So a lower number is for more distantly related sequences and a higher value is for more related sequences.
What is a BLAST E-value? - Answer The expect value. It is the likelyhood that the alignment is significant. The number of alignment that are expected to find with a score greater than or equal to the score you have in a search against a random database. The lower the value the better.
What unix command allows you to create a BLAST database? - Answer makeblastdb
What file extensions will be created when making a command line nucleotide BLAST database? - Answer .ndb, .not, .ntf, .nsq, .nhr, .nin
How does ClustalW work? - Answer it uses a guide tree (created by neighbour joining) for multiple sequence alignments. It generates a pairwise distance matrix for all of the sequences. It does pairwise alignments of the outer branches, creating sequence profiles. The sequence profiles are aligned to make a full sequence alignment.
What is MAP for EST contig assembly? - Answer Multiple alignment program. Terminal and long internal gaps are not penelized. The user supplies the scoring parameters. Uses plain sequence data. Works with either nucleic acid or protein data. Runs in Unix.
What is CAP for contig assembly? - Answer Contig assembly program. Uses base quality values for overlaps and consensus sequence. Uses genomic or EST data. Runs under UNIX.
What are the steps of CAP? - Answer 1. remove poor quality end reads
what is Phrap? - Answer Fragment assembly program. Uses base quality values for overlaps and consensus sequence. works with EST or genomic data. takes trimmed phred files as input
What are the steps of Phrap contig assembly? - Answer 1. read in files
What are the categories in the Nucleic Acids Research database issue? - Answer 1. nucleic acid sequence and structure, transcriptional regulation
What is GMOD? - Answer generic model organism database. An opensource project to develop a complete set of software for creating and administering a model organism database.
What tools does GMOD include? - Answer genome visualization and editing tools, literature curation tools, database schema, biological ontology tools, a set of standard operating tools
What are some examples of Databases that are built on GMOD? - Answer WormBase, FlyBase, Mouse Genome Informatics, Gramene, Rat Genome Database, TAIR, EcoCyc, Saccharomyces Genome Database
What do function centric databases focus on? - Answer metabolic pathways, protein function, classifications
What are some examples of a metabolic reconstruction database? - Answer KEGG, WIT2, MetaCyc
What is the difference between a file system and a relational database - Answer File systems store data in files, relational databases store information in tables.