Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3378 |
Symbol | |
ID | 7873869 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3694096 |
End bp | 3697263 |
Gene Length | 3168 bp |
Protein Length | 1055 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643700315 |
Product | ribonuclease R |
Protein accession | YP_002890349 |
Protein GI | 237654035 |
COG category | [K] Transcription |
COG ID | [COG0557] Exoribonuclease R |
TIGRFAM ID | [TIGR00358] VacB and RNase II family 3'-5' exoribonucleases [TIGR02063] ribonuclease R |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTCGCA AAACGAAAAA AACCGATCAG CAGCGCTCCG GCGCCCCCTC CCGCAGAACG CCGGCCGCGC CGCAGCAGGG CGAGCTTGCG CCGCGCGCCA ACCCGCCCGG GCGCAAGGCC GGGCGCGGCA CACTTGCCGT GCACGCCGCG CCCGAACCCG CCGCCCAAGG CAGCACTCGG CACACCGGCT CCCCGGCGCG CGGCCCTGGC CGCAGTCGGG CCGCGCGCAA GGCGGCAGCC GTGCAGGCCA GCGCCGTCCG TCTCGCCGAC CCCTTCTACG AGCGTGAGAC GCAGCAGTAC GACAACCCGC TCCCCAGCCG CGAATACGTG CTGCAGATCC TCGCCGAGCG CGGCGTGCCG ATGCCCCTCG CCGAGCTCGC CGGCGCGCTC GACATCGCCC CGCACGAGCT CGACTTCTTC GACCGTCGCC TGCGCGCGAT GGAGCGCGAC GGCCAGGTCG TGCGCAACCG CCGCGACGCC TACCTGCTGC CGGCCAAGGC CGACCTCATC AAGGGGCGCG TCGAGGGCCA CCCCGACGGC TTCGGCTTCC TGCGCCGCGA CGACGGCGAG CCCGACGTCT TTCTCGGTCC CAAGGAGATG CGCGAGGTGC TGCACGGCGA CCGCGTCATG GTGCGCATCT CCGGCACGGA CCGCCGCGGC CGTCCCGAGG GCAAGCTGGT CGAGGTGCTC GAGCGCGCCA ACTCCCGCGT CGTCGGTCGT GTCATCAACG AGCACGGCGT GATGATCGTC GTGCCCGAGA ACCGCCGCCT GGCGCAGGAC ATCCTGGTCG CCCCGGGCGG GCGCAAGAAG CCCGAGCCAG GCCAGATCGT CACCGTCGAG CTGGTCGAGC AGCCCACCAA GTTCGCCCAG CCGATCGGGC GCATCGTCGA GGTGCTGGGC AACTACGCCG ACCCCGGCAT GGAGATCGAG ATCGCGCTGC GCAAGCACGA CCTGCCCTTC GAGTTCTCCT CCGAGGCCAA GGCGCAGACG CGCAAGCTGC CCGACGTGGT GCGCAAGAAG GACTGGGCCG GCCGCGAGGA CCTCACCAAG TTGCCGTTGG TCACCATCGA CGGCGAGACC GCCAAGGACT TCGACGACGC GGTGTATTGC GAGCGCCAGG GCAAGGGTTA CCGCCTCATC GTCGCCATCG CCGACGTCTC GCACTACGTC GATGCCGCCA GCGCGCTCGA CAAGGATGCC TTCGATCGCG GCAACTCGGT GTACTTCCCG CGCCGCGTCA TCCCGATGCT GCCCGAGAAG CTCTCCAACG GCCTGTGCTC GCTCAACCCG CAGGTCGAGC GCCTGGCCAT GGTCGCCGAC ATGAACATCG CGGCCACCGG CGAGATCAGG AACTACCGCT TCTACCCCGC GGTGATCTGG TCGCACGCGC GCCTGACCTA CACGAAGGTT GCGGCGGCCC TGTACGACAA GGACCCGGCC GTGCGCGCCG AACTCGCCGC GCTGCTCCCC CACCTCGAGC AGCTCGACCA GCTCTTCCGC GTGCTGCTCA AGGCGCGCGC GAAGCGCGGC GCGATTGACT TCGAGACCAC CGAGACGCGC ATGATCTTCG ACGATAACGG CAAGATCGCG CAGATCGTCC CCGAGGTGCG CAACGATGCC CACCGCCTGA TCGAGGAGTG CATGCTCGCC GCCAACGTGT GCGCCTCCGA CTTCCTCGCC ACGCGCGAGC ACCCGGCGTT GTACCGCGTC CACGACTCGC CCTCCGAGGA CAAGCTCGCC AAGCTGCGCG AGTTCCTCAA GGAGTTCGGC CTCGGCCTCG GCGGCGGCGA CGAGCCGCGC GCCGCCGACT TCGCCAGGCT GCTCGAGCAG GTCAAGGACC GCCCCGACGC CCAATTGTTG CAGACCGTCA TGCTGCGTTC GCTCAAGCAG GCGATGTACA GCCCGGACAA CGTCGGCCAC TTCGGCCTCG CCTACGAGTC CTATACCCAC TTCACCTCAC CGATCCGGCG CTATCCCGAC CTGCTGATCC ACCGCGGCAT CAAGGCCGCG CTCGCCGGCG AGCAGTACCG CCCCGGCGAC TGGGAGCAGA TCGGCCTGCA CTGCTCGATG ACCGAGCGCC GCGCCGACGA CGCCACCCGC GACGTGGTCG CCTTCCTCAA GTGCTACTTC ATGCAGGACC GCGTCGGCGA GGAGTTCGTT GGCAGCGTGT CGGCGGTGGT GCCTTTCGGT CTCTTCGTGG CGCTCGACGA CATCTTCATC GAGGGGCTGC TGCACATCTC CGACCTCGGC AGCGATTACT TCCACTACGA CGAGACCCGC CATGCGCTCA TGGGTGAGCG CACCGGCAAG CAGTTCCGCC TGTCCGACCG GGTCAAGGTG CAGCTCGTGC GGGTGGATAT GGCCACCAAC AAGATCGACT TCCGCCTCAT CGAGGGGCCG CTCCCGGTCG AGGCCAAAGC CCCGCCGAAG GTCGCCGAAG TGGTGGCCGC GGAGGTCGTG CCCGTGGAAG GGAAGAGGGC GCGCAAGCCG CGCAGCAAGA AGGCTGAGGT GGAGGAGAGG GTCGAGGCCA CACCTGCGCC GGTGGTCGAG GCTGTGCCTG CCGCAGCGGA GCCCGCCGCG CCCAAGGTGC GCGGCAAGCG GACGAAGAAG GCCGCACCCG AAGCGCTTGC GGAGGTGCCC GTCGCGCCGG CGGTCGAGTC CACGGCTGCC GCAGCGGAGC CCGCCGCGCC CAAGGCGCGC GGCAAGCGGG CGAAGAAGGC TGCGCCCGAA GTGGTCGCGG AGGTGCTCGT CGCACCGGTG GGCGAGTCTG TGACTGCTGC CGAGGCGTCG GCGACGCCCA AGGCGCGCGG CAAGCGCGCG AAGAAGGTGG TCGCCGGGCA GGTGGCCGAG GTCGTGGCTG CGCCGGCGAT CGAGCCTGTG CCCGCCGCCG CGGAACCTGT CGCGCCCAAG GCGCGCGGCA AGCGTGCGAA GAAGGCTGCG ATCGCAACGG TCGCCGAGGT CGTCGTCGCA CCGGTGCTCG AGGTTACGCC GGATGCGGCT CCGCCCGCCG CGCCCAAGAC ACGCGGCAAG CGCGCGAAGA AGGCCGTCGC CGGCACCGCG ACCGAAGTCC CCGCCGCGTC CGCCGCCGAG GCGCTAGAAT CCGCGCCCCC GGTACCGGCC CGACGTGCCG GCCGCAAGAC CACCACAAAG GGCTCCGACC GTGGCTAA
|
Protein sequence | MTRKTKKTDQ QRSGAPSRRT PAAPQQGELA PRANPPGRKA GRGTLAVHAA PEPAAQGSTR HTGSPARGPG RSRAARKAAA VQASAVRLAD PFYERETQQY DNPLPSREYV LQILAERGVP MPLAELAGAL DIAPHELDFF DRRLRAMERD GQVVRNRRDA YLLPAKADLI KGRVEGHPDG FGFLRRDDGE PDVFLGPKEM REVLHGDRVM VRISGTDRRG RPEGKLVEVL ERANSRVVGR VINEHGVMIV VPENRRLAQD ILVAPGGRKK PEPGQIVTVE LVEQPTKFAQ PIGRIVEVLG NYADPGMEIE IALRKHDLPF EFSSEAKAQT RKLPDVVRKK DWAGREDLTK LPLVTIDGET AKDFDDAVYC ERQGKGYRLI VAIADVSHYV DAASALDKDA FDRGNSVYFP RRVIPMLPEK LSNGLCSLNP QVERLAMVAD MNIAATGEIR NYRFYPAVIW SHARLTYTKV AAALYDKDPA VRAELAALLP HLEQLDQLFR VLLKARAKRG AIDFETTETR MIFDDNGKIA QIVPEVRNDA HRLIEECMLA ANVCASDFLA TREHPALYRV HDSPSEDKLA KLREFLKEFG LGLGGGDEPR AADFARLLEQ VKDRPDAQLL QTVMLRSLKQ AMYSPDNVGH FGLAYESYTH FTSPIRRYPD LLIHRGIKAA LAGEQYRPGD WEQIGLHCSM TERRADDATR DVVAFLKCYF MQDRVGEEFV GSVSAVVPFG LFVALDDIFI EGLLHISDLG SDYFHYDETR HALMGERTGK QFRLSDRVKV QLVRVDMATN KIDFRLIEGP LPVEAKAPPK VAEVVAAEVV PVEGKRARKP RSKKAEVEER VEATPAPVVE AVPAAAEPAA PKVRGKRTKK AAPEALAEVP VAPAVESTAA AAEPAAPKAR GKRAKKAAPE VVAEVLVAPV GESVTAAEAS ATPKARGKRA KKVVAGQVAE VVAAPAIEPV PAAAEPVAPK ARGKRAKKAA IATVAEVVVA PVLEVTPDAA PPAAPKTRGK RAKKAVAGTA TEVPAASAAE ALESAPPVPA RRAGRKTTTK GSDRG
|
| |