Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1651 |
Symbol | |
ID | 7084070 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 1852091 |
End bp | 1853059 |
Gene Length | 969 bp |
Protein Length | 322 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643698671 |
Product | Trans-hexaprenyltranstransferase |
Protein accession | YP_002355302 |
Protein GI | 217970068 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0142] Geranylgeranyl pyrophosphate synthase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00299697 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTCCGTCC AGCAGCTCTT CGCGCCGATC GCCGCCGACA TGCAAGCGGT GGATACGGTC ATCCGCAACC GACTGCACTC GGACGTGGCC CTGATCCGCC GCATCGCGGA ATACATCGTC GGCAGCGGCG GCAAGCGGCT GCGCCCCGCG CTGGTGCTCT ACGCGGCGGG CGCGCTCGAC TATCGCGGCG TGCATCACCA CGAGCTCGCC GCGGTCGTCG AGTTCATCCA CACCGCCAGC CTGCTGCACG ACGACGTCGT CGACGAGTCC GACCTGCGCC GCGGCAAGCG TACCGCCAAC GCCGCCTTCG GCAACGCCTC TGCCGTGCTG GTGGGCGACT TCCTCTATTC GCGTGCCTTC CAGATGATGG TCGGGGTCGA CGAGATGCGC GTGATGCGCG TGCTCGCCGA CGCCACCAAC ATCATCGCCG AGGGCGAGGT GCTGCAGCTG CTCAACTGCC ACAACGCCGA CGTCGTCATC GAAGACTACC TGCGCGTGAT CCGCTACAAG ACCGCCAAGC TCTTCGAGGC CGCGGCGCGC CTGGGCGGCA TCGTGGCCGG TGCGGACGAC GCGCTCGAGC AGCGCCTGGC CGCCTTTGGC ATGCACCTTG GCACCGCCTT CCAACTCATC GACGACGTGC TCGACTACTC CGCCGACGAG GCCGACACCG GCAAGCACCT TGGGGACGAC CTCGCCGAAG GCAAGCCGAC GCTGCCGCTG ATCCACGTGA TGCAGCATGG CACGGCGGAG CAGGGCGCGC TGGTGCGTCA CGCGATCGAA GGCGGCGGCC GCGGCGATTT CGCGGCCGTG CTCGCGGCGA TCCAGTCGAC CGGCGCGCTC GAAGAGACGC GCCGCTACGC GGAAGCGGAG GCGAAGCTCG CAATCGACGC GATTTCGGTG CTTCCCCCTT CCATTTTCAA GGAAGCGCTG CTACAATTAT CGGACTTTGC AGTTCGGCGA AAACACTGA
|
Protein sequence | MSVQQLFAPI AADMQAVDTV IRNRLHSDVA LIRRIAEYIV GSGGKRLRPA LVLYAAGALD YRGVHHHELA AVVEFIHTAS LLHDDVVDES DLRRGKRTAN AAFGNASAVL VGDFLYSRAF QMMVGVDEMR VMRVLADATN IIAEGEVLQL LNCHNADVVI EDYLRVIRYK TAKLFEAAAR LGGIVAGADD ALEQRLAAFG MHLGTAFQLI DDVLDYSADE ADTGKHLGDD LAEGKPTLPL IHVMQHGTAE QGALVRHAIE GGGRGDFAAV LAAIQSTGAL EETRRYAEAE AKLAIDAISV LPPSIFKEAL LQLSDFAVRR KH
|
| |