Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2886 |
Symbol | |
ID | 7873788 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 3125843 |
End bp | 3129484 |
Gene Length | 3642 bp |
Protein Length | 1213 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643699807 |
Product | glycosyl transferase family 2 |
Protein accession | YP_002889862 |
Protein GI | 237653548 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.242978 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCACCG TCAGGCACAT TCCGGGGCTA TACGCCAAAC GGCTCCACCC TTATTACATC GTCTCCCCCA GCTATGACCG CAAATCGTCC GGCATCCGCG TCATGCATCA GCTCTGCGCA AGCCTCAACC AGCTGGGAGC CGAGGCCTAT ATCCGGACGC CTCTCCGCAG CCATGCGCTA CACACCCCGC AGCTAACGCC AGACATCCAG AACGAACACC TCGAGGCCGG CCTCAACCCC ATCGCCATTT ACCCGGAAAT CGTCGGCTCC AATCCCTTGG ACTGCAACGT GGTGGTCCGC TACATCCTGA ACCGACCTGG GTTGCTGGGT GAAAGGCCGA ACTACAGCCC GGACGACCTT TACTGGCTCC ATAACGCAGA GGTAGCCGAA GCCGTTCCCC ATCATGAGGG CGTACTGCAC ATGCCGGCAG TGGACACCTC GATTTTCAAC AACGTGGACA ATCCCCACGA CAACAACCGG CAAGGCGCGT GCATCTACTT TGGCCGCTAC GTCGAAGGAG CCAAAGCTTT TCCCGAACTG ACGGACCGGT GCACCGTAAT CACCAAGGAT TTTCCAACCA GTCACGAGGA ACTCGCTGCA CTGTTTCGAC GCAGCACCCA CGTCTACTGC TTCGAGAACA CGTCCATCTC GATGGAGGCT CGTCTTTGCG GATGCCCAGT CGTCCAACTG CCCAGTCCCT ATGCCGATCC GGACAAACCC TTCGCCGACA GCCTCGGACT CTCGGACGCA CTGGTCACCT CCGACGATGC GGAAACACTC GCCGAAGCCC GTCGCCTATT GCCCTCGCTC ACCCAGAAGT ACAGACGTCT CGAGGCAGCG TACTGGGAAG AACTGGAACG CCTGCTAGAC AAGACCCAAG CCGTCGCCGA AAGCCGCCCC AAGACCCACA AGAGCCCGAC CAAGGACAAT ACCCTCCAGC TACATTATTC GAATTGGCGC GCCCGCACGG CATTCGCCGA AATCGATGCG GAGATACTCG CCGAACGCAT GATGCTCAAA TGGACGAAAC GTCCAGGCAT TCACCTGCTC ATGACCCTCC ATGGCAGCGA GGAGGCTCTG CTCGCCGACA CACTCGACAG TCTGGCCACG CAGCTATACC CGGACTGGCT GCTTACGGTC GTGACCGACC TGCCCCAACC CGAAGGCCTG GATGAGTCAG CGAACCTGCA GTGGCTTTCT CTGCAGGACG CCGTTCATAT CAACTACGTG ATCGACGAAG TTGCAGCGGC CTCGCCCGGC TCATGGCTTG CCCGGATCGA TCCGGGCCTG AGCCTGGAGC CCCAGGCTCT GCAAGTATTC GCCGATTACA TCAATTCTCG GCCGATGTGG CATCTGATTT ATTGTGACGA GGACACACGA GAGGCGGATG GGAGCTTGAG CCAACCGCTG TTCAAGCCGG ACTTCGATCT CGACCTTTTC AGGGCACAAA ATTATATCGG CGCATTCGCC CTGATCAACA AGGAGGCATT CCTGGTTGCC GGGCGCTATG GGGAACACCG TGGCGCCGAG ATATACGACC TGAGTCTTCG CATATCAGAC CACGTTGGAC CACGGGCTAT CGGTCATATA AGCCAGATGC TCGTTCATCT CCCCCGCGAG TCGACTCGGG CAATCGCCCC CGAGGCAGAG AAGGCGGCAG TAATCGATCA CCTGGAACGC CAAGGAGTAC CGGCGAAGGT GAGCGACGGG ATCGTTTACG GCACCCGCCG CATCGACTAT CTCTGGCCCG AAAATCCGTT GGTCAGCATC GTCATCCAGA CCAGGGATCG CGAAGAATAT TTGCGCCCCC TGCTCGAAAG CATGCATGAG CTCACGCAAT ACACCAACTA CGAGCTCGTG ATCGTCGACA ACGAAACCAG CGACCCCGAT GCGCTTGAAT GGTTGAAGGC TCTGCCGAGC GATCCGCGCT GGCATGGCCG CATGACGGTA ATTGAAAGAA AGGGCAAGTT CAACTGGTCC GCGGGAGCAA ACATCGGCGC GGGAGCAGCC AGTGGAAAAT ATCTGCTGTT CCTGGACAAC GATATTCACG TTGTTCAGAA GGAGTGGCTG GGCAGGATGC TAGGCATCGC GCAGCGCCCG GAGGTCGGTA TTGTCGGACC ACGCCTTGCT TTCGCCGAGA CCGCGAAGAT TCAGGATGGC GGCTGGATCC TCGGGCTGAA TGGCCTCGCG GCGACCCCTT GGAACTGCGA AATGGAATTG ACTGAGCCCG GGTACATGGG TCGTGCAGTC TGTGATCAGC ACGTGGGGGC GGTGAGCGGC TCCGCATTGC TCGTTCGTGC ATCCCTTTTC GACGACTTAT GCGGCTTCAA CGAAACCGAT TTTCCGCAAT TCAACGGAGC GCTTGACCTC TGCCTTCGTG TTTCGGAAAG AGGCCTCCAG ATCGTGTGGA CACCCTACTC GATGCTTGTA CACTACGGAG GCGTATCGAC CCACGAGCGT CGGAAAGACA TAGCCAGTGC GCTAGAAGAC GTCATTGCAG GCAAGGTCGA GCGTGAAGGC ATTCTGCAAA GATGGCTCCC CGTGCTCGCC AGCGACCCCG GGTACAATAA GAATCTCAGC CTCGTCGAGA CATTCAAGCC GGATCACATC GCGCCGATCG TGTGGGACAC CAATTTCCAC GATCGCTCGC GGATCCTGGG CATCCCATTG AGCGGCGGGG CTGGCGAGTA CCGTCTTCGC GCGCCCTTGC GCATCATCGC GCGTGCCGGG TTGGCGCAGA CCGCGATCTG CGAGCCTCCC GCTCATCTGA GCGTCCGAGT CCTCTCACCT ATCGAGATCG CCCGGACCGC ACCGGATTCG ATCATTTTCC ATCAACCGGT CGACGACCAT CAGACGAACA CACTACAGAG TCTGGCCAAT CTCCTCCCTC GGGTACGTCG GATCATCACA ATCGATGACT TGTTCACCGC GGTCCCCAAG AAGAACTCGT TCTACAAGTT TCAATACAAG GATGCCCGCC CCCGCCTGCG GAGAACGCTG GGCTTGGCCG ACAAACTGGT CGTCAGCACC CAGCCGCTTG CAGACTTCTG CTCGAGCATG ATCGAAGACA TCCGTATCAT GCCCAACTGC CTCGAGAGAG CGATCTGGGA CGGAGCCCGG CCGCCAGCGC TCCCACGACG CAAACCCCGG GTGGGATGGG CGGGTGCCCA GCAGCACCTT GGCGATCTCG AGATCATCTA TCCGGTCGTC GAGGCCCTTT CGGAAGAAGT GGATTGGATC TTCATGGGGA TGTGCCCCGA CCCTCTCCGT CCTTTCGTGC GAGAGTTTCA TGACTTCGTT CGAGACTTCG AAGCCTATCC CGCGGCACTC GCCAAGCTCG ACCTGGATCT CGCGATCGCG CCACTCGAGC TGAATGCCTT CAACGAGGCG AAGAGCAACC TGCGACTGCT CGAGTATGGC TTCATGGGTT GGCCGGTGAT CTGCACCGAC ATCTTCCCTT ATCAGGATGC GCCGGTGACC CGCCTGCCGA ACGAGCCCCA GAGGTGGATT TCCGCGATCC GCGAGCACTT GGCCGAACCC GAAGCCATGC GAGCTGCCGG CAAGGCACTA CAGGAATGGG TCCTCGCCAA CTTCATCCTT GAGGATCACG CGACCGACTG GCTCGAGGCC TACGGTCCCT GA
|
Protein sequence | MPTVRHIPGL YAKRLHPYYI VSPSYDRKSS GIRVMHQLCA SLNQLGAEAY IRTPLRSHAL HTPQLTPDIQ NEHLEAGLNP IAIYPEIVGS NPLDCNVVVR YILNRPGLLG ERPNYSPDDL YWLHNAEVAE AVPHHEGVLH MPAVDTSIFN NVDNPHDNNR QGACIYFGRY VEGAKAFPEL TDRCTVITKD FPTSHEELAA LFRRSTHVYC FENTSISMEA RLCGCPVVQL PSPYADPDKP FADSLGLSDA LVTSDDAETL AEARRLLPSL TQKYRRLEAA YWEELERLLD KTQAVAESRP KTHKSPTKDN TLQLHYSNWR ARTAFAEIDA EILAERMMLK WTKRPGIHLL MTLHGSEEAL LADTLDSLAT QLYPDWLLTV VTDLPQPEGL DESANLQWLS LQDAVHINYV IDEVAAASPG SWLARIDPGL SLEPQALQVF ADYINSRPMW HLIYCDEDTR EADGSLSQPL FKPDFDLDLF RAQNYIGAFA LINKEAFLVA GRYGEHRGAE IYDLSLRISD HVGPRAIGHI SQMLVHLPRE STRAIAPEAE KAAVIDHLER QGVPAKVSDG IVYGTRRIDY LWPENPLVSI VIQTRDREEY LRPLLESMHE LTQYTNYELV IVDNETSDPD ALEWLKALPS DPRWHGRMTV IERKGKFNWS AGANIGAGAA SGKYLLFLDN DIHVVQKEWL GRMLGIAQRP EVGIVGPRLA FAETAKIQDG GWILGLNGLA ATPWNCEMEL TEPGYMGRAV CDQHVGAVSG SALLVRASLF DDLCGFNETD FPQFNGALDL CLRVSERGLQ IVWTPYSMLV HYGGVSTHER RKDIASALED VIAGKVEREG ILQRWLPVLA SDPGYNKNLS LVETFKPDHI APIVWDTNFH DRSRILGIPL SGGAGEYRLR APLRIIARAG LAQTAICEPP AHLSVRVLSP IEIARTAPDS IIFHQPVDDH QTNTLQSLAN LLPRVRRIIT IDDLFTAVPK KNSFYKFQYK DARPRLRRTL GLADKLVVST QPLADFCSSM IEDIRIMPNC LERAIWDGAR PPALPRRKPR VGWAGAQQHL GDLEIIYPVV EALSEEVDWI FMGMCPDPLR PFVREFHDFV RDFEAYPAAL AKLDLDLAIA PLELNAFNEA KSNLRLLEYG FMGWPVICTD IFPYQDAPVT RLPNEPQRWI SAIREHLAEP EAMRAAGKAL QEWVLANFIL EDHATDWLEA YGP
|
| |