Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3381 |
Symbol | |
ID | 9247246 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4039224 |
End bp | 4042247 |
Gene Length | 3024 bp |
Protein Length | 1007 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | ribonuclease, Rne/Rng family |
Protein accession | YP_003681292 |
Protein GI | 297562318 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCGACA ACGAGCCCAA CAACGGTGCC GACGGCACCG CGGGTACAAC TGAAACAACC AACAACACGG CGGCGGCGCC CTCCACGGGG GGCGGTGAGG TGCGCGTGAC CACGCCCGCG CCCGCCAAGG GCGTCCGCCG CGCCGCCGGA CCACCCCCCG AACCCGACGA CGTCTTCACC CCCCCCTCCG CGCCCGTCAC CTCCCTCGCC GGTTCCGGCG AGGGGGGCAC GGTCAAGACC TCGGTGGCCA CCACCTCGCG CAGGCGCACC GTGCGCGCCG CGGGCCGTCC CGACGAGGAC GCCGCCCCGC GCCGTTCGCG CGCCTCCGCT GACGACGAGG GCACCGCTCC GCGCCGGGCC CATGCCTCCG CTGACGAGGA GGCCGCTCCG CGCCGTTCGC GCGCCTCCTC CGCCTCGGAG GACACCGCCC CCTCCACGGC CAGGAGCCGC GGCCGTTCGC GCGCCCGCTC CGCCCCGGCC GCGGAGACCG CCGGGCCCGC GCCCTCCGAG GCGGCCGACG AGCCCGCCAA GGACGAGGAG ACCGTGGAGG CGGGCGGCGG CAACGTCTTC CAGGCCCCCT CCCTGCTCTT CCAGCCCCCG GTGGCCAGCG CCGCCCCCGT GCCCGCACGC AGCACCGCCC CGGCGGAGGC CGAGGAGGAG TCCGAGGAGG CCGAGGAGGA GACCGCCCAG GCCGCCGTCG CGGAGGAGAC CGGCGACGGG GACGACGACC GCCCCAGCCG CCGTCGCCGC CGTCGCGGAG GCCGGGGACG CGGCCGCTCC CGCGCGGACG AGAACGAGTC CGCCGAGGCC CCCGAGTCCG CCGAGGACAA GGCGGAGCAG CCCGGCGACA GGGACCGGGG CGAGGACGAC GGCGCGGCCG ACGAGTCCGC CGAGTCCGAC GACCGCAGCA GCCGCCGTCG GCGCCGCCGT CGCCGCCGCT CCGGCGGGGG AGAGGGCCCC GAGGCCACCC CGGACGACCC GCCCAACACC GTGGTCCGGG TGCGCGAACC CCGCCAGGAG AAGCCGATCG AGGACGAGGT CCAGGCGGTC CGCGGCTCCA CCCGGCTGGA GGCCAAGAAG CAGCGCCGCC GCGAGGGCCG TGAGCAGGGC CGCCGCCGCG CCCCCATCGT CACCGAGTCG GAGTTCCTGG CGCGGCGGGA GTCGGTCAAG CGGGACCTGG TGGTGCGCCG CGTCGAGGAC CGCACCCAGA TCGCCGTCCT CGAGGACGAC ATCCTCGTCG AGCACTACGT CGACCGGGCC ACGCACCGCT CCTACGTGGG CAACGTGTAC CTGGGCCGGG TGCAGAACGT GCTGCCCTCG ATGGAGGCCG CGTTCGTCGA CATCGGCAAG GGCCGCAACG CCGTCCTGTA CGCGGGCGAG GTCAACTGGG ACTCCTTCGG CCTGGACGGC CAGCCCAAGC GCATCGAGTC GGTGCTCAAG TCCGGCCAGT CGGTCCTGGT GCAGGTCACC AAGGACCCCG TCGGCCACAA GGGCGCCCGC CTGACCAGCC AGATCAGCCT GCCCGGCCGC TACCTCGTGT ACGTGCCCGG CGGTTCGATG ACCGGCATCA GCCGCAAGCT CCCCGACAAG GAGCGCGCGC GCCTCAAGCA GATCCTCAAG AAGGTCATGC CCTCCGGCGC GGGCGTCATC GTGCGCACGG CCGCCGAGGG GGCCAGCGAG GAGGAGCTGG AGCGCGACAT CACCCGTCTC GCCAAGCAGT GGGACTCCAT CAAGCGCAAG TCCAAGTCGG CCAACGCCCC CTCGCTGCTC AACAGCGAGC CCGACCTGAC GGTGCGGGTC GTGCGCGACG TGTTCAACGA GGACTTCTCC AGCCTCGTCG TGTCCGGTGA GGAGGCCTGG ACCACCGTCC GCGAGTACGT GGACTACGTG GCCCCCAACC TCTCCGAGCG CCTCTCGCAC TGGAACGAGG ACCGCGACGT CTTCGCCGCC TACCGCGTCG ACGAGCAGAT CAACAAGGCC CTGGAGCGCA AGGTCTGGCT GCCCAGCGGC GGATCGCTGG TCATCGACCG CACCGAGGCC ATGACCGTGG TCGACGTCAA CACCGGCAAG TTCACCGGTC AGGGCGGCAA CCTGGAGGAG ACGGTCACCA AGAACAACCT GGAGGCGGCC GAGGAGATCG TCCGCCAGCT CCGGTTGCGC GACATCGGCG GCATCATCGT CATCGACTTC ATCGACATGG TCCTGGAGTC CAACCGCGAC CTGGTGCTGC GGCGCATGCT GGAGTGCCTC TCGCGCGACC GCACCAAGCA CCAGGTGGCC GAGGTGACCT CGCTGGGGCT GGTCCAGATG ACCCGCAAGC GGGTCGGCCA GGGCCTGCTG GAGGCCTTCT CGCACAGCTG CGAGCACTGC AACGGCCGCG GCCTGGTGGT CGCCAGCGAC CCGGTCGAGA GCAAGGGCGG CGGCAGCGGC AACGGCCGCA AGAAGAAGAA GGGCAAGGGC GAGCCCGACC AGGCGGCCGA GAAGCCGGAG AAGGCCCAGA AGCCGTCCTC CGAGAAGGCC GACGGGGCCG ACGGGCCGGA GAAGGCCGAG AAGGCGGGCG CTGACTCCGA CGAGTCCGCC GAGGCGGACG CGGCGCGGAC CGAGGAGGCA CCCGAGGCCC CGCAGACGGC CGAGGCCCCG GCCTCCGAGC CCGCCAAGAA GACCCGTAAG CGGGCCTCCC GCTCCCGCAA GGCCGAGGCC GCCGCGGAGG AGCAGGAGGA GATCACCGCC ACGGAGGAGG CTCCTGAGGC CGCGCAGCCC GAGCAGGCCG CGGAGGCCCC CGAGGAGGCC CCGGCCAAGA AGACCCGCGC CCGGCGCACC ACCCGTCGTA CCGCCCGCGG GGCGGCCGAC ACCGGTGAGG CCGAGGCCGC GGGCGGTGAG AGCGCCCCCG CCGAGGCGGA CGCGACCGCC GCCGAGACGC CCGCCGAGGC CGGTGACGAC GAGGCCGCCG AGCGGCCCAA GCGTCGCCGC ACCCGGCGCA CCAGGGCGGC GGCCACCCCG CCGACCGCCG TGGACGCGGG CTGA
|
Protein sequence | MLDNEPNNGA DGTAGTTETT NNTAAAPSTG GGEVRVTTPA PAKGVRRAAG PPPEPDDVFT PPSAPVTSLA GSGEGGTVKT SVATTSRRRT VRAAGRPDED AAPRRSRASA DDEGTAPRRA HASADEEAAP RRSRASSASE DTAPSTARSR GRSRARSAPA AETAGPAPSE AADEPAKDEE TVEAGGGNVF QAPSLLFQPP VASAAPVPAR STAPAEAEEE SEEAEEETAQ AAVAEETGDG DDDRPSRRRR RRGGRGRGRS RADENESAEA PESAEDKAEQ PGDRDRGEDD GAADESAESD DRSSRRRRRR RRRSGGGEGP EATPDDPPNT VVRVREPRQE KPIEDEVQAV RGSTRLEAKK QRRREGREQG RRRAPIVTES EFLARRESVK RDLVVRRVED RTQIAVLEDD ILVEHYVDRA THRSYVGNVY LGRVQNVLPS MEAAFVDIGK GRNAVLYAGE VNWDSFGLDG QPKRIESVLK SGQSVLVQVT KDPVGHKGAR LTSQISLPGR YLVYVPGGSM TGISRKLPDK ERARLKQILK KVMPSGAGVI VRTAAEGASE EELERDITRL AKQWDSIKRK SKSANAPSLL NSEPDLTVRV VRDVFNEDFS SLVVSGEEAW TTVREYVDYV APNLSERLSH WNEDRDVFAA YRVDEQINKA LERKVWLPSG GSLVIDRTEA MTVVDVNTGK FTGQGGNLEE TVTKNNLEAA EEIVRQLRLR DIGGIIVIDF IDMVLESNRD LVLRRMLECL SRDRTKHQVA EVTSLGLVQM TRKRVGQGLL EAFSHSCEHC NGRGLVVASD PVESKGGGSG NGRKKKKGKG EPDQAAEKPE KAQKPSSEKA DGADGPEKAE KAGADSDESA EADAARTEEA PEAPQTAEAP ASEPAKKTRK RASRSRKAEA AAEEQEEITA TEEAPEAAQP EQAAEAPEEA PAKKTRARRT TRRTARGAAD TGEAEAAGGE SAPAEADATA AETPAEAGDD EAAERPKRRR TRRTRAAATP PTAVDAG
|
| |