Individuals with trisomy 21 display complex phenotypes with differing degrees of severity. Numerous reliable methods have been established to diagnose the initial trisomy in these patients, but the identification and characterization of the genetic basis of the phenotypic variation in individuals with trisomy remains challenging. To date, methods that can accurately determine genotypes in trisomic DNA samples are expensive, require specialized equipment and complicated analyses. Here we report proof-of-concept results for an Invader® assay-based genotyping procedure that can determine SNP genotypes in trisomic genomic DNA samples in a simple and cost-effective manner. The procedure requires only two experimental steps: a real-time measurement of the fluorescent Invader® signal and analysis with a specifically designed clustering algorithm. The approach was tested using genomic DNA samples from 23 individuals with trisomy 21, and results were compared to genotypes previously determined with pyrosequencing. Additional assays for 15 SNPs were tested in a set of 21 DNA samples to assess assay performance. Our method successfully identified the correct SNP genotypes for the trisomic genomic DNA samples tested, and thus provides an alternative to determine SNP genotypes in trisomic DNA samples for subsequent association studies in patients with Down syndrome and other trisomies.
We describe a method to identify and recover minor human immunodeficiency virus type 1 (HIV-1) sequence variants from a complex population. The original heteroduplex tracking assay (HTA) was modified by incorporating a biotin tag into the probe to allow for direct sequence determination of the query strand. We used this approach to recover sequences from minor HIV-1 variants in the V3 region of the env gene, and to identify minor drug-resistant variants in pro. The biotin-HTA targeting of the V3 region of env allowed us to detect minor V3 variants, of which 45% were classified as CXCR4-using viruses. In addition, the biotin-protease HTA was able to detect mixtures of wild-type sequence and drug-resistance mutations in four subjects that were not detected by bulk sequence analysis. The biotin-HTA is a robust assay that first separates genetic variants then allows direct sequence analysis of major and minor variants.
Plasmids are ubiquitous mobile elements that serve as a pool of many host beneficial traits such as antibiotic resistance in bacterial communities. To understand the importance of plasmids in horizontal gene transfer, we need to gain insight into the ‘evolutionary history’ of these plasmids, i.e. the range of hosts in which they have evolved. Since extensive data support the proposal that foreign DNA acquires the host's nucleotide composition during long-term residence, comparison of nucleotide composition of plasmids and chromosomes could shed light on a plasmid's evolutionary history. The average absolute dinucleotide relative abundance difference, termed -distance, has been commonly used to measure differences in dinucleotide composition, or ‘genomic signature’, between bacterial chromosomes and plasmids. Here, we introduce the Mahalanobis distance, which takes into account the variance–covariance structure of the chromosome signatures. We demonstrate that the Mahalanobis distance is better than the -distance at measuring genomic signature differences between plasmids and chromosomes of potential hosts. We illustrate the usefulness of this metric for proposing candidate long-term hosts for plasmids, focusing on the virulence plasmids pXO1 from Bacillus anthracis, and pO157 from Escherichia coli O157:H7, as well as the broad host range multi-drug resistance plasmid pB10 from an unknown host.
Genomic parasites have evolved distinctive lifestyles to optimize replication in the context of the genomes they inhabit. Here, we introduced new DNA into eukaryotic cells using bacteriophage Mu DNA transposition complexes, termed ‘transpososomes’. Following electroporation of transpososomes and selection for marker gene expression, efficient integration was verified in yeast, mouse and human genomes. Although Mu has evolved in prokaryotes, strong biases were seen in the target site distributions in eukaryotic genomes, and these biases differed between yeast and mammals. In Saccharomyces cerevisiae transposons accumulated outside of genes, consistent with selection against gene disruption. In mouse and human cells, transposons accumulated within genes, which previous work suggests is a favorable location for efficient expression of selectable markers. Naturally occurring transposons and viruses in yeast and mammals show related, but more extreme, targeting biases, suggesting that they are responding to the same pressures. These data help clarify the constraints exerted by genome structure on genomic parasites, and illustrate the wide utility of the Mu transpososome technology for gene transfer in eukaryotic cells.
Long hairpin RNA (lhRNA) construct-induced gene silencing facilitates the study of gene function in plants and animals, but constructing multiple lhRNA vectors using traditional approaches is both time-consuming and costly. Also, most of the existing approaches are based on sequence-specific cloning of individual sequences, and are therefore not suitable for preparing hpRNA libraries from a pool of mixed target sequences. Here we describe a rolling-circle amplification (RCA)-mediated hpRNA (RMHR) construction system suitable for generating libraries of lhRNA constructs from any gene of interest or pool of genes. Using RMHR we successfully generated a lhRNA library from a Arabidopsis cDNA population containing known and unknown genes, with an average size of 500–800 bp for the inverted-repeat inserts. To validate the RMHR system, lhRNA constructs targeting the β-glucuronidase (GUS) gene were tested using Agrobacterium infiltration and shown to be effective at inducing GUS silencing in tobacco leaves. Our results indicate that the RMHR technique permits rapid, efficient and low-cost preparation of genome-wide lhRNA expression libraries.
Bisulfite treatment can be used to ascertain the methylation states of individual cytosines in DNA. Ideally, bisulfite treatment deaminates unmethylated cytosines to uracils, and leaves 5-methylcytosines unchanged. Two types of bisulfite-conversion error occur: inappropriate conversion of 5-methylcytosine to thymine, and failure to convert unmethylated cytosine to uracil. Conventional bisulfite treatment requires hours of exposure to low-molarity, low-temperature bisulfite (‘LowMT’) and, sometimes, thermal denaturation. An alternate, high-molarity, high-temperature (‘HighMT’) protocol has been reported to accelerate conversion and to reduce inappropriate conversion. We used molecular encoding to obtain validated, individual-molecule data on failed- and inappropriate-conversion frequencies for LowMT and HighMT treatments of both single-stranded and hairpin-linked oligonucleotides. After accounting for bisulfite-independent error, we found that: (i) inappropriate-conversion events accrue predominantly on molecules exposed to bisulfite after they have attained complete or near-complete conversion; (ii) the HighMT treatment is preferable because it yields greater homogeneity among sites and among molecules in conversion rates, and thus yields more reliable data; (iii) different durations of bisulfite treatment will yield data appropriate to address different experimental questions; and (iv) conversion errors can be used to assess the validity of methylation data collected without the benefit of molecular encoding.
A novel method to efficiently generate unmarked in-frame gene deletions in Rhodococcus equi was developed, exploiting the cytotoxic effect of 5-fluorocytosine (5-FC) by the action of cytosine deaminase (CD) and uracil phosphoribosyltransferase (UPRT) enzymes. The opportunistic, intracellular pathogen R. equi is resistant to high concentrations of 5-FC. Introduction of Escherichia coli genes encoding CD and UPRT conferred conditional lethality to R. equi cells incubated with 5-FC. To exemplify the use of the codA::upp cassette as counter-selectable marker, an unmarked in-frame gene deletion mutant of R. equi was constructed. The supA and supB genes, part of a putative cholesterol catabolic gene cluster, were efficiently deleted from the R. equi wild-type genome. Phenotypic analysis of the generated supAB mutant confirmed that supAB are essential for growth of R. equi on cholesterol. Macrophage survival assays revealed that the supAB mutant is able to survive and proliferate in macrophages comparable to wild type. Thus, cholesterol metabolism does not appear to be essential for macrophage survival of R. equi. The CD-UPRT based 5-FC counter-selection may become a useful asset in the generation of unmarked in-frame gene deletions in other actinobacteria as well, as actinobacteria generally appear to be 5-FC resistant and 5-FU sensitive.
We have adapted an electrophoretic mobility shift assay (EMSA) to isolate genomic DNA fragments that bind the archaeal transcription initiation factors TATA-binding protein (TBP) and transcription factor B (TFB) to perform a genome-wide search for promoters. Mobility-shifted fragments were cloned, tested for their ability to compete with known promoter-containing fragments for a limited concentration of transcription factors, and sequenced. We applied the method to search for promoters in the genome of Methanocaldococcus jannaschii. Selection was most efficient for promoters of tRNA genes and genes for several presumed small non-coding RNAs (ncRNA). Protein-coding gene promoters were dramatically underrepresented relative to their frequency in the genome. The repeated isolation of these genomic regions was partially rectified by including a hybridization-based screening. Sequence alignment of the affinity-selected promoters revealed previously identified TATA box, BRE, and the putative initiator element. In addition, the conserved bases immediately upstream and downstream of the BRE and TATA box suggest that the composition and structure of archaeal natural promoters are more complicated.
Poly(ADP-ribose) (pADPr) is a polymer assembled from the enzymatic polymerization of the ADP-ribosyl moiety of NAD by poly(ADP-ribose) polymerases (PARPs). The dynamic turnover of pADPr within the cell is essential for a number of cellular processes including progression through the cell cycle, DNA repair and the maintenance of genomic integrity, and apoptosis. In spite of the considerable advances in the knowledge of the physiological conditions modulated by poly(ADP-ribosyl)ation reactions, and notwithstanding the fact that pADPr can play a role of mediator in a wide spectrum of biological processes, few pADPr binding proteins have been identified so far. In this study, refined in silico prediction of pADPr binding proteins and large-scale mass spectrometry-based proteome analysis of pADPr binding proteins were used to establish a comprehensive repertoire of pADPr-associated proteins. Visualization and modeling of these pADPr-associated proteins in networks not only reflect the widespread involvement of poly(ADP-ribosyl)ation in several pathways but also identify protein targets that could shed new light on the regulatory functions of pADPr in normal physiological conditions as well as after exposure to genotoxic stimuli.
The Arabidopsis RNA-binding protein AtGRP8 undergoes negative autoregulation at the post-transcriptional level. An elevated AtGRP8 protein level promotes the use of a cryptic 5' splice site to generate an alternatively spliced transcript, as_AtGRP8, retaining the 5' half of the intron with a premature termination codon. In mutants defective in nonsense-mediated decay (NMD) abundance of as_AtGRP8 but not its pre-mRNA is elevated, indicating that as_AtGRP8 is a direct NMD target, thus limiting the production of functional AtGRP8 protein. In addition to its own pre-mRNA, AtGRP8 negatively regulates the AtGRP7 transcript through promoting the formation of the equivalent alternatively spliced as_AtGRP7 transcript, leading to a decrease in AtGRP7 abundance. Recombinant AtGRP8 binds to its own and the AtGRP7 pre-mRNA, suggesting that this interaction is relevant for the splicing decision in vivo. AtGRP7 itself is part of a negative autoregulatory circuit that influences circadian oscillations of its own and the AtGRP8 transcript through alternative splicing linked to NMD. Thus, we identify an interlocked feedback loop through which two RNA-binding proteins autoregulate and reciprocally crossregulate by coupling unproductive splicing to NMD. A high degree of evolutionary sequence conservation in the introns retained in as_AtGRP8 or as_AtGRP7 points to an important function of these sequences.
The presence of a homing endonuclease gene (HEG) within a microbial intron or intein empowers the entire element with the ability to invade genomic targets. The persistence of a homing endonuclease lineage depends in part on conservation of its DNA target site. One such rDNA sequence has been invaded both in archaea and in eukarya, by LAGLIDADG and His–Cys box homing endonucleases, respectively. The bases encoded by this target include a universally conserved ribosomal structure, termed helix 69 (H69) in the large ribosomal subunit. This region forms the ‘B2a’ intersubunit bridge to the small ribosomal subunit, contacts bound tRNA in the A- and P-sites, and acts as a trigger for ribosome disassembly through its interactions with ribosome recycling factor. We have determined the DNA-bound structure and specificity profile of an archaeal LAGLIDADG homing endonuclease (I-Vdi141I) that recognizes this target site, and compared its specificity with the analogous eukaryal His–Cys box endonuclease I-PpoI. These homodimeric endonuclease scaffolds have arrived at similar specificity profiles across their common biological target and analogous solutions to the problem of accommodating conserved asymmetries within the DNA sequence, but with differences at individual base pairs that are fine-tuned to the sequence conservation of archaeal versus eukaryal ribosomes.
A straightforward enzymatic protocol for converting regular DNA into pseudo-complementary DNA could improve the performance of oligonucleotide microarrays by generating readily hybridizable structure-free targets. Here we screened several highly destabilizing analogs of G and C for one that could be used with 2-aminoadenine (nA) and 2-thiothymine (sT) to generate structure-free DNA that is fully accessible to complementary probes. The analogs, which included bioactive bases such as 6-thioguanine (sG), 5-nitrocytosine (NitroC), 2-pyrimidinone (P; the free base of zebularine) and 6-methylfuranopyrimidinone (MefP), were prepared as dNTPs and evaluated as substrates for T7 and Phi29 DNA polymerases that lacked editor function. Pairing properties of the analogs were characterized by solution hybridization assays using modified oligonucleotides or primer extension products. P and MeP did not support robust primer extension whereas sG and NitroC did. In hybridization assays, however, sG lacked discrimination and NitroC paired too strongly to C. The dNTPs of two other base analogs, 7-nitro-7-deazahypoxanthine (NitrocH) and 2-thiocytosine (sC), exhibited the greatest promise. Either analog could be used with nA and sT to generate DNA that was nearly structure-free. Hybridization of probes to these modified DNAs will require the development of base analogs that pair strongly to NitrocH or sC.
To investigate nucleic acid base pairing and stacking via atom-specific mutagenesis and crystallography, we have synthesized for the first time the 6-Se-deoxyguanosine phosphoramidite and incorporated it into DNAs via solid-phase synthesis with a coupling yield over 97%. We found that the UV absorption of the Se-DNAs red-shifts over 100 nm to 360 nm ( = 2.3 x 104 M–1 cm–1), the Se-DNAs are yellow colored, and this Se modification is relatively stable in water and at elevated temperature. Moreover, we successfully crystallized a ternary complex of the Se-G-DNA, RNA and RNase H. The crystal structure determination and analysis reveal that the overall structures of the native and Se-modified nucleic acid duplexes are very similar, the selenium atom participates in a Se-mediated hydrogen bond (Se ... H–N), and the SeG and C form a base pair similar to the natural G–C pair though the Se-modification causes the base-pair to shift (approximately 0.3 Å). Our biophysical and structural studies provide new insights into the nucleic acid flexibility, duplex recognition and stability. Furthermore, this novel selenium modification of nucleic acids can be used to investigate chemogenetics and structure of nucleic acids and their protein complexes.
Insulators are defined as a class of regulatory elements that delimit independent transcriptional domains within eukaryotic genomes. According to previous data, an interaction (pairing) between some Drosophila insulators can support distant activation of a promoter by an enhancer. Here, we have demonstrated that pairs of well-studied insulators such as scs–scs, scs’–scs’, 1A2–1A2 and Wari–Wari support distant activation of the white promoter by the yeast GAL4 activator in an orientation-dependent manner. The same is true for the efficiency of the enhancer that stimulates white expression in the eyes. In all insulator pairs tested, stimulation of the white gene was stronger when insulators were inserted between the eye enhancer or GAL4 and the white promoter in opposite orientations relative to each other. As shown previously, Zw5, Su(Hw) and dCTCF proteins are required for the functioning of different insulators that do not interact with each other. Here, strong functional interactions have been revealed between DNA fragments containing binding sites for either Zw5 or Su(Hw) or dCTCF protein but not between heterologous binding sites [Zw5–Su(Hw), dCTCF–Su(Hw), or dCTCF–Zw5]. These results suggest that insulator proteins can support selective interactions between distant regulatory elements.
RecG and RuvAB are proposed to act at stalled DNA replication forks to facilitate replication restart. To define the roles of these proteins in fork regression, we used a combination of assays to determine whether RecG, RuvAB or both are capable of acting at a stalled fork. The results show that RecG binds to the C-terminus of single-stranded DNA binding protein (SSB) forming a stoichiometric complex of 2 RecG monomers per SSB tetramer. This binding occurs in solution and to SSB protein bound to single stranded DNA (ssDNA). The result of this binding is stabilization of the interaction of RecG with ssDNA. In contrast, RuvAB does not bind to SSB. Side-by-side analysis of the catalytic efficiency of the ATPase activity of each enzyme revealed that (–)scDNA and ssDNA are potent stimulators of the ATPase activity of RecG but not for RuvAB, whereas relaxed circular DNA is a poor cofactor for RecG but an excellent one for RuvAB. Collectively, these data suggest that the timing of repair protein access to the DNA at stalled forks is determined by the nature of the DNA available at the fork. We propose that RecG acts first, with RuvAB acting either after RecG or in a separate pathway following protein-independent fork regression.
HIV-1 integrase (IN) oligomerization and DNA recognition are crucial steps for the subsequent events of the integration reaction. Recent advances described the involvement of stable intermediary complexes including dimers and tetramers in the in vitro integration processes, but the initial attachment events and IN positioning on viral ends are not clearly understood. In order to determine the role of the different IN oligomeric complexes in these early steps, we performed in vitro functional analysis comparing IN preparations having different oligomerization properties. We demonstrate that in vitro IN concerted integration activity on a long DNA substrate containing both specific viral and nonspecific DNA sequences is highly dependent on binding of preformed dimers to viral ends. In addition, we show that IN monomers bound to nonspecific DNA can also fold into functionally different oligomeric complexes displaying nonspecific double-strand DNA break activity in contrast to the well known single strand cut catalyzed by associated IN. Our results imply that the efficient formation of the active integration complex highly requires the early correct positioning of monomeric integrase or the direct binding of preformed dimers on the viral ends. Taken together the data indicates that IN oligomerization controls both the enzyme specificity and activity.