NAR Top Articles - Methods Online
Demonstration of CRISPR/Cas9/sgRNA-mediated targeted gene modification in Arabidopsis, tobacco, sorghum and rice
Jiang, WZ; Zhou, HB; Bi, HH; Fromm, M; Yang, B; Weeks, DP
Nucleic Acids Res. 2013, 41, e188-e188
Free Full Text
The type II CRISPR/Cas system from Streptococcus pyogenes and its simplified derivative, the Cas9/single guide RNA (sgRNA) system, have emerged as potent new tools for targeted gene knockout in bacteria, yeast, fruit fly, zebrafish and human cells. Here, we describe adaptations of these systems leading to successful expression of the Cas9/sgRNA system in two dicot plant species, Arabidopsis and tobacco, and two monocot crop species, rice and sorghum. Agrobacterium tumefaciens was used for delivery of genes encoding Cas9, sgRNA and a non-fuctional, mutant green fluorescence protein (GFP) to Arabidopsis and tobacco. The mutant GFP gene contained target sites in its 5' coding regions that were successfully cleaved by a CAS9/sgRNA complex that, along with error-prone DNA repair, resulted in creation of functional GFP genes. DNA sequencing confirmed Cas9/sgRNA-mediated mutagenesis at the target site. Rice protoplast cells transformed with Cas9/sgRNA constructs targeting the promoter region of the bacterial blight susceptibility genes, OsSWEET14 and OsSWEET11, were confirmed by DNA sequencing to contain mutated DNA sequences at the target sites...
Multiplex CRISPR/Cas9-based genome engineering from a single lentiviral vector
Kabadi, AM; Ousterout, DG; Hilton, IB; Gersbach, CA
Nucleic Acids Res. 2014, 42, e147-e147
Free Full Text
Engineered DNA-binding proteins that manipulate the human genome and transcriptome have enabled rapid advances in biomedical research. In particular, the RNA-guided CRISPR/Cas9 system has recently been engineered to create site-specific double-strand breaks for genome editing or to direct targeted transcriptional regulation. A unique capability of the CRISPR/Cas9 system is multiplex genome engineering by delivering a single Cas9 enzyme and two or more single guide RNAs (sgRNAs) targeted to distinct genomic sites. This approach can be used to simultaneously create multiple DNA breaks or to target multiple transcriptional activators to a single promoter for synergistic enhancement of gene induction. To address the need for uniform and sustained delivery of multiplex CRISPR/Cas9-based genome engineering tools, we developed a single lentiviral system to express a Cas9 variant, a reporter gene and up to four sgRNAs from independent RNA polymerase III promoters that are incorporated into the vector by a convenient Golden Gate cloning method. Each sgRNA is efficiently expressed and can mediate multiplex gene editing and sustained transcriptional activation in immortalized and primary human cells...
Evaluation of general 16S ribosomal RNA gene PCR primers for classical and next-generation sequencing-based diversity studies
Klindworth, A; Pruesse, E; Schweer, T; Peplies, J; Quast, C; Horn, M; Glockner, FO
Nucleic Acids Res. 2013, 41, e1-e1
Free Full Text
16S ribosomal RNA gene (rDNA) amplicon analysis remains the standard approach for the cultivation-independent investigation of microbial diversity. The accuracy of these analyses depends strongly on the choice of primers. The overall coverage and phylum spectrum of 175 primers and 512 primer pairs were evaluated in silico with respect to the SILVA 16S/18S rDNA non-redundant reference dataset (SSURef 108 NR). Based on this evaluation a selection of 'best available' primer pairs for Bacteria and Archaea for three amplicon size classes (100-400, 400-1000, epsilon 1000 bp) is provided. The most promising bacterial primer pair (S-D-Bact-0341-b-S-17/S-D-Bact-0785-a-A-21), with an amplicon size of 464 bp, was experimentally evaluated by comparing the taxonomic distribution of the 16S rDNA amplicons with 16S rDNA fragments from directly sequenced metagenomes. The results of this study may be used as a guideline for selecting primer pairs with the best overall coverage and phylum spectrum for specific applications, therefore reducing the bias in PCR-based microbial diversity studies.
Easy quantitative assessment of genome editing by sequence trace decomposition
Brinkman, EK; Chen, T; Amendola, M; van Steensel, B
Nucleic Acids Res. 2014, 42, e168-e168
Free Full Text
The efficacy and the mutation spectrum of genome editingmethods can vary substantially depending on the targeted sequence. A simple, quick assay to accurately characterize and quantify the induced mutations is therefore needed. Here we present TIDE, a method for this purpose that requires only a pair of PCR reactions and two standard capillary sequencing runs. The sequence traces are then analyzed by a specially developed decomposition algorithm that identifies the major induced mutations in the projected editing site and accurately determines their frequency in a cell population. This method is cost-effective and quick, and it provides much more detailed information than current enzyme-based assays. An interactive web tool for automated decomposition of the sequence traces is available. TIDE greatly facilitates the testing and rational design of genome editing strategies.
Computational analysis of bacterial RNA-Seq data
McClure, R; Balasubramanian, D; Sun, Y; Bobrovskyy, M; Sumby, P; Genco, CA; Vanderpool, CK; Tjaden, B
Nucleic Acids Res. 2013, 41, e140-e140
Free Full Text
Recent advances in high-throughput RNA sequencing (RNA-seq) have enabled tremendous leaps forward in our understanding of bacterial transcriptomes. However, computational methods for analysis of bacterial transcriptome data have not kept pace with the large and growing data sets generated by RNA-seq technology. Here, we present new algorithms, specific to bacterial gene structures and transcriptomes, for analysis of RNA-seq data. The algorithms are implemented in an open source software system called Rockhopper that supports various stages of bacterial RNA-seq data analysis, including aligning sequencing reads to a genome, constructing transcriptome maps, quantifying transcript abundance, testing for differential gene expression, determining operon structures and visualizing results. We demonstrate the performance of Rockhopper using 2.1 billion sequenced reads from 75 RNA-seq experiments conducted with Escherichia coli, Neisseria gonorrhoeae, Salmonella enterica, Streptococcus pyogenes and Xenorhabdus nematophila. We find that the transcriptome maps generated by our algorithms are highly accurate when compared with focused experimental data from E. coli and N. gonorrhoeae...
svaseq: removing batch effects and other unwanted noise from sequencing data
Nucleic Acids Res. 2014, 42, 00-a-
Free Full Text
It is now known that unwanted noise and unmodeled artifacts such as batch effects can dramatically reduce the accuracy of statistical inference in genomic experiments. These sources of noise must be modeled and removed to accurately measure biological variability and to obtain correct statistical inference when performing high-throughput genomic analysis. We introduced surrogate variable analysis (sva) for estimating these artifacts by (i) identifying the part of the genomic data only affected by artifacts and (ii) estimating the artifacts with principal components or singular vectors of the subset of the data matrix. The resulting estimates of artifacts can be used in subsequent analyses as adjustment factors to correct analyses. Here I describe a version of the sva approach specifically created for count data or FPKMs from sequencing experiments based on appropriate data transformation. I also describe the addition of supervised sva (ssva) for using control probes to identify the part of the genomic data only affected by artifacts. I present a comparison between these versions of sva and other methods for batch effect estimation on simulated data, real count-based data and FPKM-based data...
An optimized streptavidin-binding RNA aptamer for purification of ribonucleoprotein complexes identifies novel ARE-binding proteins
Leppek, K; Stoecklin, G
Nucleic Acids Res. 2014, 42, e13-e13
Free Full Text
Determining the composition of messenger ribonucleoprotein (mRNP) particles is essential for a comprehensive understanding of the complex mechanisms underlying mRNA regulation, but is technically challenging. Here we present an RNA-based method to identify RNP components using a modified streptavidin (SA)-binding RNA aptamer termed S1m. By optimizing the RNA aptamer S1 in structure and repeat conformation, we improved its affinity for SA and found a 4-fold repeat of S1m (4 x S1m) to be more efficient than the established MS2 and PP7 systems from bacteriophages. We then attached the AU-rich element (ARE) of tumor necrosis factor alpha (TNF alpha), a well-known RNA motif that induces mRNA degradation, via 4 x S1m to a SA matrix, and used the resulting RNA affinity column to purify ARE-binding proteins (BPs) from cellular extracts. By quantitative mass spectrometry using differential dimethyl labeling, we identified the majority of established ARE-BPs and detected several RNA-BPs that had previously not been associated with AREs. For two of these proteins, Rbms1 and Roxan, we confirmed specific binding to the TNF alpha ARE...
Long non-coding RNAs function annotation: a global prediction method based on bi-colored networks
Guo, XL; Gao, L; Liao, Q; Xiao, H; Ma, XK; Yang, XF; Luo, HT; Zhao, GG; Bu, DC; Jiao, F; Shao, QX; Chen, RS; Zhao, Y
Nucleic Acids Res. 2013, 41, e35-e35
Free Full Text
More and more evidences demonstrate that the long non-coding RNAs (lncRNAs) play many key roles in diverse biological processes. There is a critical need to annotate the functions of increasing available lncRNAs. In this article, we try to apply a global network-based strategy to tackle this issue for the first time. We develop a bi-colored network based global function predictor, long non-coding RNA global function predictor ('lnc-GFP'), to predict probable functions for lncRNAs at large scale by integrating gene expression data and protein interaction data. The performance of lnc-GFP is evaluated on protein-coding and lncRNA genes. Cross-validation tests on protein-coding genes with known function annotations indicate that our method can achieve a precision up to 95%, with a suitable parameter setting. Among the 1713 lncRNAs in the bi-colored network, the 1625 (94.9%) lncRNAs in the maximum connected component are all functionally characterized. For the lncRNAs expressed in mouse embryo stem cells and neuronal cells, the inferred putative functions by our method highly match those in the known literature.
Impact of sequencing depth in ChIP-seq experiments
Jung, YL; Luquette, LJ; Ho, JWK; Ferrari, F; Tolstorukov, M; Minoda, A; Issner, R; Epstein, CB; Karpen, GH; Kuroda, MI; Park, PJ
Nucleic Acids Res. 2014, 42, e74-e74
Free Full Text
In a chromatin immunoprecipitation followed by high-throughput sequencing (ChIP-seq) experiment, an important consideration in experimental design is the minimum number of sequenced reads required to obtain statistically significant results. We present an extensive evaluation of the impact of sequencing depth on identification of enriched regions for key histone modifications (H3K4me3, H3K36me3, H3K27me3 and H3K9me2/me3) using deep-sequenced datasets in human and fly. We propose to define sufficient sequencing depth as the number of reads at which detected enrichment regions increase < 1% for an additional million reads. Although the required depth depends on the nature of the mark and the state of the cell in each experiment, we observe that sufficient depth is often reached at < 20 million reads for fly. For human, there are no clear saturation points for the examined datasets, but our analysis suggests 40-50 million reads as a practical minimum for most marks. We also devise a mathematical model to estimate the sufficient depth and total genomic coverage of a mark...
High-efficiency genome editing via 2A-coupled co-expression of fluorescent proteins and zinc finger nucleases or CRISPR/Cas9 nickase pairs
Duda, K; Lonowski, LA; Kofoed-Nielsen, M; Ibarra, A; Delay, CM; Kang, QH; Yang, Z; Pruett-Miller, SM; Bennett, EP; Wandall, HH; Davis, GD; Hansen, SH; Frodin, M
Nucleic Acids Res. 2014, 42, e84-e84
Free Full Text
Targeted endonucleases including zinc finger nucleases (ZFNs) and clustered regularly interspaced short palindromic repeats (CRISPRs)/Cas9 are increasingly being used for genome editing in higher species. We therefore devised a broadly applicable and versatile method for increasing editing efficiencies by these tools. Briefly, 2A peptide-coupled co-expression of fluorescent protein and nuclease was combined with fluorescence-activated cell sorting (FACS) to allow for efficient isolation of cell populations with increasingly higher nuclease expression levels, which translated into increasingly higher genome editing rates. For ZFNs, this approach, combined with delivery of donors as single-stranded oligodeoxynucleotides and nucleases as messenger ribonucleic acid, enabled high knockin efficiencies in demanding applications, including biallelic codon conversion frequencies reaching 30-70% at high transfection efficiencies and similar to 2% at low transfection efficiencies...
- About this journal
- NAR Methods online
- 2015 Database Issue
- 2015 Web Server Issue
- NAR Special Collections
- Referee Information
- Rights & Permissions
- Dispatch date of the next issue
- This journal is a member of the Committee on Publication Ethics (COPE)
- view Recent Comments on articles
- We are mobile – find out more
- Journals Career Network
Impact factor: 9.112
5-Yr impact factor: 8.867
Senior Executive Editors
- Instructions to authors
- Scope and Criteria for Consideration
- Submit a manuscript now
- Self-archiving policy
Open access options for authors