Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1218 |
Symbol | |
ID | 7083878 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 1349288 |
End bp | 1353313 |
Gene Length | 4026 bp |
Protein Length | 1341 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 643698234 |
Product | hypothetical protein |
Protein accession | YP_002354873 |
Protein GI | 217969639 |
COG category | [S] Function unknown |
COG ID | [COG3164] Predicted membrane protein |
TIGRFAM ID | [TIGR02099] conserved hypothetical protein TIGR02099 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0577669 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCCGGC AGCGCTGCAA CCGCGCTCCC ACACAACGTC CTGCGAGCAT CCGACTGAGC CCCTCCGCCC CGCACGCACC TGCCGAATCC GTACCCCCGT CCGTCGACGG CCTGCCGGCC GCGCGCGGCG GCTGGCGGCG CGCGGCCGGC AGGCTGTCCG ATGCGCTGCT GCTGGTCTGG TGCGTGCTCG CGGTCGGGGT GCTGGTGCTG CGCCATCTCG TGCTGCCGGC AGTGGGCGAA CTGCGCGTGC CGATCGCCGA GTTGCTCGGC GAGCGCCTTG GCGTGGCGGT GGCGATCGAG CGCATCGAGG GCGGCTGGGC GGGCTGGCGA CCGCGCCTGC GCCTGGCCGG GGTGAGCCTG TCCGACGCGC AGGGCCGCCC CGCGCTGACC CTGCCCGAGA TCGACGCCAC CCTGGCGTGG TCCTCGCTGG CGATGCTCGA GCCGCATTTC CACCGCCTCG AGATCCGCGC GCCGGACCTG TTGGTACGCC GCGATCCGGA GGGAGCGCTG CACGTCGCAG GCATCGCGCT TGCACACGGG GACGAGGGCG GCGACGGCGG CCTCGGCTGG CTGCTCGCGC AGCGCCAGAT CCTCATCCAC GGCGCGCGTC TTGCCTGGGC GGACGAAGCG CGCGGTGCGC CGGTGCTCGA GCTCGAAGAG GTGGAGTTCC GCCTCGATCG ACGCACGCGT CCGGCGCGCT TCGCGCTCGA GGCACGGCTG CCGGCCGCCC TCGGCGAGCG TGCGAGCCTG CGCGGCGAGC TGCGCCGCAT CGACCCGCTG CGCCCGCAGC GTTTCGCCGG TCGGTTGTAC GTCGAACTCG GGCGCACGGC GCTGGGCAAC TGGCGGCCCT GGGTCGACCT GCCTTTCGGC CTGGAAGGCG AGGGTGCGCT GCAAGCCTGG CTCGACGCGG ACGGCCGTGG CGGTCATGCG CTGACCGCCG ACCTCGCGCT GGACGGGGTG CGGGCCGTGC TCGGCGAGGA TCTGCCGGTT CTCGAGCTCG CCCGTCTCGA CGGTCGTGTC GAGCTGGCCC GCGGCGAGGA TGGATTCCGC TTCGCCGGCC GCGGGCTCGG GTTCGAAACC GCCCAGGGCC TCGTGCTGCC CCCCACCGAT GTCGAGCTGG GGCTGGATCC GGCGCTCGAT GGCAAGGCGC GCGATCGTGG CACCGGCGAG GAGCAGGCCG GGCGCGGGCC CGCGGCGGCC GGGGGGCGCC TCGTCGCAGA CCGGCTCGAA CTGGCCACGC TCGTCGCGCT CGCCGCCCAC CTGCCGCTCG ATGCCGGGCT GCGCTCGCGC CTGGCGGCGC TTGCGCCGCG CGGTCGGGTC GAGGGCCTGC GCCTCGCCTG GAAGGGGGAG CCCGCCGCGC CCGAGCACTG GGCGCTGACG GCAAGCTTCG CCGGGCTCGG GCTGGCGGCG CAAGACGGTC TGCCCGGCGT CACCGGGCTC TCCGGCAGTG TGGACGGCGA CCGGCAGCGC GGGCGCTTCC GGCTTGCCGC GGCGGACATG CGGATCGACC TGCCGGCGGT CTTCGATCCC GGTCCGCTGG ACTTCTCGCG CCTGCAGGCG CAGGGCGGCT GGCAGCGCGC GGAGGGCCGG CTGGCGATCG TGCTCGACGA GGCCGCGTTC GAGAACGCCG ACGCGGCCGG CACCGCCGCG GGCCGTTACT GGCCGGCGAC GACGGGGGCG GGCGAGATCG ATCTGCAGGC GCGCCTGACC CGCGCCGAGT CGACCGCGGT GTGGCGCTAC GTGCCCAGGG CGGTCAACGA CGACACGCGC GACTGGCTGC GCCGCGGGAT CCGCTCCGCG AAGGTGCCCG AGGCGCGGCT GGAATTGCGG GGCGCGCTCG ACGACTTCCC CTTCCGTGCC GGCCAGGGCC GCTTCCTGGT GAGCGTTCAG GTTGCCGACG CCGCGCTCGA CTACGCCCCC GGCTGGCCGG GCATCGAGGG CATCGACGGC GAGGTGCGCT TCGAGGGCCC GGGTCTGCGC ATCCTCGTGC CGCGCGGGCG CATCTTCGGG GTGGATCTGG TGGACGTGGT GGCGGAGGTG CCCGACCTCG ACCAACTTCC GAGCGAGATC ATGACCATCA CCGGTCGTGC GCGCGGTCCG ACCGCCGAGT TCCTGCGCTT CGTCTCGGCG AGCCCGGTGT CGCGCCGCAT CGACGGCTTC ACCGACGGCA TGGTGGCCGA GGGGCGGGGC GAACTGGAAC TCGAGCTGGT GATGCCGCTG CGCGCGACGA TCGACTCCCG CGTGCAGGGC GAATACCGCT TCGCCGACAA CCGCATCGCG CTGCTGGAGG CCTTGCCGCC GCTCGAGGCG GCGCGCGGGC GGCTGCGCTT CACCGCGGAC ACCCTGAGCA TCCCGGAGGC GCAGGCGCGC CTGTTCGGCA ATCCATTGCG CCTCTCCGCG CAGACCGGCG AGGACGGCAC GGTGCGCTTC CGCGCCGCCG GGCGGGCGGA CGCCGCGGCG CTGGCGGCAG CCTGGGAGGT CCCGCTGCTC GACGAACTCG CCGGCACCGC GGAGTGGTCC GCGGACATCC GCGTCGGCCG GCGCGGCACG CGGGTCGAGA TCGCCTCCGA CCTTGCCGGT TTCGCCTGCA GCCTGCCGCC GCCGTTCGGC AAATCGGCCG CCGAGGGCTG GCCGGCGAGG CTGGCTTTCG AGCGTCCGGC CGGCAGCCAG GATGCGCGCA TCACCCTCGC GATCGACGAC CGCCTGCGCG CGGAGCTGCT GCGCTCCGCC GGCGGGGTGG TGCGCGGCGG CGTTGCGCTC GGGCGTGCGA GCGCAGACGC GCCGCCGGGT TCCGAGGCGG GCGTGCGTGT CGCCGCGGTG CTCGATCGCC TCGACCTCGA CGCCTGGCGC GCCGTGCTGG AGCGCGCCGC GCCCCTTCCC GCCGCGGGCG CGGAGGAGGC TGCCGTGGCC GGCGATGGAG CGCCCGCCGC GGTGGCGGAG TTCGCGCTCG CGGCCGACGA GGTGGGTGTG TTCGGCCAGA CCTTCAAGGC GGTGGACCTG CGCGCGCGCG CCGATCCGGG CGGCTGGAAG GCGCGCCTGG AGAGCGATCG CGCGGACGGC GAGTTCGACT GGCGGGCGGC GGGCAAGGGC ACGCTCGCGG CGCGCTTCCG CCACCTGCGT CTGGTGCGCG AGGCCGGCGG CGCGCAGGCT GGCGCAGACG GCGACTCCGC GGGCGACGAG GCGGTGGACG ATCCGCCGCA GCGCCTGCCC GCGCTCGACG TGGTGGCCGA GCGCTTCGAC ATCAACGAGC TCGAACTCGG CCGCCTCGAG GTCCTCGCAC GCAACCGCGG CAACCTCTGG CAGCTCGACC GCTTCGCACT GGTGAGCGCC GACGGACGCC TCGCCGGCAA GGGGCAGTGG CGCGCGGGCG CGCGCCAGCG CACCGAGCTC GACTTCGCGC TCGAGACCGC GTCGATGGGT GGGCTGATCC GGCGCCTGGG CCACCCCGAC GTGGTGCGCG GCGGCTCTGC CAGCCTGGCC GGCCAGCTCG CATGGCGGGG TGCGCCGACC CGGATCGACT ATCCCAGCCT GTCGGGCAGG CTGCGGCTGG AGACCGGCGC GGGCCAGTTC AACCGGCTCG AACCCGGCGT CGGCCGTCTG CTCGGCATCC TCAGCCTGCA GTCGCTGCCG CGCCGGATCA CGCTCGACTT CCGCGACGTG TTCTCCGAAG GATTCGCCTT CGATCGCATT TCCGGTAGCA TCGAGCTTGC CCGCGGTGTG CTGCGCAGCG AGGACCTGGA GATCCGCGGA CCCGCGGCGC GCGTCGCGAT GAGCGGCAGC GCCGACGTGG TCTCCGAAAC GCAGGACCTG CGCGTTCTGG TGCAGCCGAC GCTGTCCGAG TCGGTGGCGA TCGGCGCGGC GGCGGGTTTG GTGAATCCGG TCGCCGGCGT GGTGACCTAT CTCGCCCAGA AGGTGCTCAG CGACCCGATC GAGAAGCTCT TCGCCTTCGA GTACACGGTG ACCGGCGCGT GGTCCGACCC CCAGGTCGCC AAGCGCGGCA GGGCCGCTGC GCCGGAGTCG CGCTGA
|
Protein sequence | MRRQRCNRAP TQRPASIRLS PSAPHAPAES VPPSVDGLPA ARGGWRRAAG RLSDALLLVW CVLAVGVLVL RHLVLPAVGE LRVPIAELLG ERLGVAVAIE RIEGGWAGWR PRLRLAGVSL SDAQGRPALT LPEIDATLAW SSLAMLEPHF HRLEIRAPDL LVRRDPEGAL HVAGIALAHG DEGGDGGLGW LLAQRQILIH GARLAWADEA RGAPVLELEE VEFRLDRRTR PARFALEARL PAALGERASL RGELRRIDPL RPQRFAGRLY VELGRTALGN WRPWVDLPFG LEGEGALQAW LDADGRGGHA LTADLALDGV RAVLGEDLPV LELARLDGRV ELARGEDGFR FAGRGLGFET AQGLVLPPTD VELGLDPALD GKARDRGTGE EQAGRGPAAA GGRLVADRLE LATLVALAAH LPLDAGLRSR LAALAPRGRV EGLRLAWKGE PAAPEHWALT ASFAGLGLAA QDGLPGVTGL SGSVDGDRQR GRFRLAAADM RIDLPAVFDP GPLDFSRLQA QGGWQRAEGR LAIVLDEAAF ENADAAGTAA GRYWPATTGA GEIDLQARLT RAESTAVWRY VPRAVNDDTR DWLRRGIRSA KVPEARLELR GALDDFPFRA GQGRFLVSVQ VADAALDYAP GWPGIEGIDG EVRFEGPGLR ILVPRGRIFG VDLVDVVAEV PDLDQLPSEI MTITGRARGP TAEFLRFVSA SPVSRRIDGF TDGMVAEGRG ELELELVMPL RATIDSRVQG EYRFADNRIA LLEALPPLEA ARGRLRFTAD TLSIPEAQAR LFGNPLRLSA QTGEDGTVRF RAAGRADAAA LAAAWEVPLL DELAGTAEWS ADIRVGRRGT RVEIASDLAG FACSLPPPFG KSAAEGWPAR LAFERPAGSQ DARITLAIDD RLRAELLRSA GGVVRGGVAL GRASADAPPG SEAGVRVAAV LDRLDLDAWR AVLERAAPLP AAGAEEAAVA GDGAPAAVAE FALAADEVGV FGQTFKAVDL RARADPGGWK ARLESDRADG EFDWRAAGKG TLAARFRHLR LVREAGGAQA GADGDSAGDE AVDDPPQRLP ALDVVAERFD INELELGRLE VLARNRGNLW QLDRFALVSA DGRLAGKGQW RAGARQRTEL DFALETASMG GLIRRLGHPD VVRGGSASLA GQLAWRGAPT RIDYPSLSGR LRLETGAGQF NRLEPGVGRL LGILSLQSLP RRITLDFRDV FSEGFAFDRI SGSIELARGV LRSEDLEIRG PAARVAMSGS ADVVSETQDL RVLVQPTLSE SVAIGAAAGL VNPVAGVVTY LAQKVLSDPI EKLFAFEYTV TGAWSDPQVA KRGRAAAPES R
|
| |