Gene Ndas_3381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3381 
Symbol 
ID9247246 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4039224 
End bp4042247 
Gene Length3024 bp 
Protein Length1007 aa 
Translation table11 
GC content74% 
IMG OID 
Productribonuclease, Rne/Rng family 
Protein accessionYP_003681292 
Protein GI297562318 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGACA ACGAGCCCAA CAACGGTGCC GACGGCACCG CGGGTACAAC TGAAACAACC 
AACAACACGG CGGCGGCGCC CTCCACGGGG GGCGGTGAGG TGCGCGTGAC CACGCCCGCG
CCCGCCAAGG GCGTCCGCCG CGCCGCCGGA CCACCCCCCG AACCCGACGA CGTCTTCACC
CCCCCCTCCG CGCCCGTCAC CTCCCTCGCC GGTTCCGGCG AGGGGGGCAC GGTCAAGACC
TCGGTGGCCA CCACCTCGCG CAGGCGCACC GTGCGCGCCG CGGGCCGTCC CGACGAGGAC
GCCGCCCCGC GCCGTTCGCG CGCCTCCGCT GACGACGAGG GCACCGCTCC GCGCCGGGCC
CATGCCTCCG CTGACGAGGA GGCCGCTCCG CGCCGTTCGC GCGCCTCCTC CGCCTCGGAG
GACACCGCCC CCTCCACGGC CAGGAGCCGC GGCCGTTCGC GCGCCCGCTC CGCCCCGGCC
GCGGAGACCG CCGGGCCCGC GCCCTCCGAG GCGGCCGACG AGCCCGCCAA GGACGAGGAG
ACCGTGGAGG CGGGCGGCGG CAACGTCTTC CAGGCCCCCT CCCTGCTCTT CCAGCCCCCG
GTGGCCAGCG CCGCCCCCGT GCCCGCACGC AGCACCGCCC CGGCGGAGGC CGAGGAGGAG
TCCGAGGAGG CCGAGGAGGA GACCGCCCAG GCCGCCGTCG CGGAGGAGAC CGGCGACGGG
GACGACGACC GCCCCAGCCG CCGTCGCCGC CGTCGCGGAG GCCGGGGACG CGGCCGCTCC
CGCGCGGACG AGAACGAGTC CGCCGAGGCC CCCGAGTCCG CCGAGGACAA GGCGGAGCAG
CCCGGCGACA GGGACCGGGG CGAGGACGAC GGCGCGGCCG ACGAGTCCGC CGAGTCCGAC
GACCGCAGCA GCCGCCGTCG GCGCCGCCGT CGCCGCCGCT CCGGCGGGGG AGAGGGCCCC
GAGGCCACCC CGGACGACCC GCCCAACACC GTGGTCCGGG TGCGCGAACC CCGCCAGGAG
AAGCCGATCG AGGACGAGGT CCAGGCGGTC CGCGGCTCCA CCCGGCTGGA GGCCAAGAAG
CAGCGCCGCC GCGAGGGCCG TGAGCAGGGC CGCCGCCGCG CCCCCATCGT CACCGAGTCG
GAGTTCCTGG CGCGGCGGGA GTCGGTCAAG CGGGACCTGG TGGTGCGCCG CGTCGAGGAC
CGCACCCAGA TCGCCGTCCT CGAGGACGAC ATCCTCGTCG AGCACTACGT CGACCGGGCC
ACGCACCGCT CCTACGTGGG CAACGTGTAC CTGGGCCGGG TGCAGAACGT GCTGCCCTCG
ATGGAGGCCG CGTTCGTCGA CATCGGCAAG GGCCGCAACG CCGTCCTGTA CGCGGGCGAG
GTCAACTGGG ACTCCTTCGG CCTGGACGGC CAGCCCAAGC GCATCGAGTC GGTGCTCAAG
TCCGGCCAGT CGGTCCTGGT GCAGGTCACC AAGGACCCCG TCGGCCACAA GGGCGCCCGC
CTGACCAGCC AGATCAGCCT GCCCGGCCGC TACCTCGTGT ACGTGCCCGG CGGTTCGATG
ACCGGCATCA GCCGCAAGCT CCCCGACAAG GAGCGCGCGC GCCTCAAGCA GATCCTCAAG
AAGGTCATGC CCTCCGGCGC GGGCGTCATC GTGCGCACGG CCGCCGAGGG GGCCAGCGAG
GAGGAGCTGG AGCGCGACAT CACCCGTCTC GCCAAGCAGT GGGACTCCAT CAAGCGCAAG
TCCAAGTCGG CCAACGCCCC CTCGCTGCTC AACAGCGAGC CCGACCTGAC GGTGCGGGTC
GTGCGCGACG TGTTCAACGA GGACTTCTCC AGCCTCGTCG TGTCCGGTGA GGAGGCCTGG
ACCACCGTCC GCGAGTACGT GGACTACGTG GCCCCCAACC TCTCCGAGCG CCTCTCGCAC
TGGAACGAGG ACCGCGACGT CTTCGCCGCC TACCGCGTCG ACGAGCAGAT CAACAAGGCC
CTGGAGCGCA AGGTCTGGCT GCCCAGCGGC GGATCGCTGG TCATCGACCG CACCGAGGCC
ATGACCGTGG TCGACGTCAA CACCGGCAAG TTCACCGGTC AGGGCGGCAA CCTGGAGGAG
ACGGTCACCA AGAACAACCT GGAGGCGGCC GAGGAGATCG TCCGCCAGCT CCGGTTGCGC
GACATCGGCG GCATCATCGT CATCGACTTC ATCGACATGG TCCTGGAGTC CAACCGCGAC
CTGGTGCTGC GGCGCATGCT GGAGTGCCTC TCGCGCGACC GCACCAAGCA CCAGGTGGCC
GAGGTGACCT CGCTGGGGCT GGTCCAGATG ACCCGCAAGC GGGTCGGCCA GGGCCTGCTG
GAGGCCTTCT CGCACAGCTG CGAGCACTGC AACGGCCGCG GCCTGGTGGT CGCCAGCGAC
CCGGTCGAGA GCAAGGGCGG CGGCAGCGGC AACGGCCGCA AGAAGAAGAA GGGCAAGGGC
GAGCCCGACC AGGCGGCCGA GAAGCCGGAG AAGGCCCAGA AGCCGTCCTC CGAGAAGGCC
GACGGGGCCG ACGGGCCGGA GAAGGCCGAG AAGGCGGGCG CTGACTCCGA CGAGTCCGCC
GAGGCGGACG CGGCGCGGAC CGAGGAGGCA CCCGAGGCCC CGCAGACGGC CGAGGCCCCG
GCCTCCGAGC CCGCCAAGAA GACCCGTAAG CGGGCCTCCC GCTCCCGCAA GGCCGAGGCC
GCCGCGGAGG AGCAGGAGGA GATCACCGCC ACGGAGGAGG CTCCTGAGGC CGCGCAGCCC
GAGCAGGCCG CGGAGGCCCC CGAGGAGGCC CCGGCCAAGA AGACCCGCGC CCGGCGCACC
ACCCGTCGTA CCGCCCGCGG GGCGGCCGAC ACCGGTGAGG CCGAGGCCGC GGGCGGTGAG
AGCGCCCCCG CCGAGGCGGA CGCGACCGCC GCCGAGACGC CCGCCGAGGC CGGTGACGAC
GAGGCCGCCG AGCGGCCCAA GCGTCGCCGC ACCCGGCGCA CCAGGGCGGC GGCCACCCCG
CCGACCGCCG TGGACGCGGG CTGA
 
Protein sequence
MLDNEPNNGA DGTAGTTETT NNTAAAPSTG GGEVRVTTPA PAKGVRRAAG PPPEPDDVFT 
PPSAPVTSLA GSGEGGTVKT SVATTSRRRT VRAAGRPDED AAPRRSRASA DDEGTAPRRA
HASADEEAAP RRSRASSASE DTAPSTARSR GRSRARSAPA AETAGPAPSE AADEPAKDEE
TVEAGGGNVF QAPSLLFQPP VASAAPVPAR STAPAEAEEE SEEAEEETAQ AAVAEETGDG
DDDRPSRRRR RRGGRGRGRS RADENESAEA PESAEDKAEQ PGDRDRGEDD GAADESAESD
DRSSRRRRRR RRRSGGGEGP EATPDDPPNT VVRVREPRQE KPIEDEVQAV RGSTRLEAKK
QRRREGREQG RRRAPIVTES EFLARRESVK RDLVVRRVED RTQIAVLEDD ILVEHYVDRA
THRSYVGNVY LGRVQNVLPS MEAAFVDIGK GRNAVLYAGE VNWDSFGLDG QPKRIESVLK
SGQSVLVQVT KDPVGHKGAR LTSQISLPGR YLVYVPGGSM TGISRKLPDK ERARLKQILK
KVMPSGAGVI VRTAAEGASE EELERDITRL AKQWDSIKRK SKSANAPSLL NSEPDLTVRV
VRDVFNEDFS SLVVSGEEAW TTVREYVDYV APNLSERLSH WNEDRDVFAA YRVDEQINKA
LERKVWLPSG GSLVIDRTEA MTVVDVNTGK FTGQGGNLEE TVTKNNLEAA EEIVRQLRLR
DIGGIIVIDF IDMVLESNRD LVLRRMLECL SRDRTKHQVA EVTSLGLVQM TRKRVGQGLL
EAFSHSCEHC NGRGLVVASD PVESKGGGSG NGRKKKKGKG EPDQAAEKPE KAQKPSSEKA
DGADGPEKAE KAGADSDESA EADAARTEEA PEAPQTAEAP ASEPAKKTRK RASRSRKAEA
AAEEQEEITA TEEAPEAAQP EQAAEAPEEA PAKKTRARRT TRRTARGAAD TGEAEAAGGE
SAPAEADATA AETPAEAGDD EAAERPKRRR TRRTRAAATP PTAVDAG