Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1597 |
Symbol | |
ID | 7084806 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 1787293 |
End bp | 1788678 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643698617 |
Product | sugar transferase, PEP-CTERM system associated |
Protein accession | YP_002355249 |
Protein GI | 217970015 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2148] Sugar transferases involved in lipopolysaccharide synthesis |
TIGRFAM ID | [TIGR03013] sugar transferase, PEP-CTERM system associated [TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGAAGG TGTTCAGCCA CTACCTTCCC ACGCACACCC TGCAGCAGGT GCTGTTCGAC GCCTTGACGC TTTTTGCGGT GGTGCTTGCA GCGGTTGCGG TACTCGTGAC GCCGACAAGC GCGGTCGACT GGGCAATCTG CATTCCATCG GCGCTTACCT TCGCTGCAAG CATGATCGCC CTCAATGCGG CGATGGGTTT GTACCGACCG GTGTATCAGC GTTCCCATCG GCAGACCTTC GCGCGCGTCG CCCTGTCGCT GACCGTGAGC GTACCAGTGG CCTACCTCGT GTTCGGGCTG TTGCCGTGGT CCGAAATTGC CCCGCACTCC CTGCAGCTCA GCGTATTGCT GTTGCTCTGT TCGCTGTTGC TGGTCCGCGG GCTGGTCAAT CAGCGCCAGG CCTCTGCCCT TTTCGTGCCG CGCGTGCTGA TCGTGGGTAC CGGCCGTGAC GCGCGCATGG TGCAACAGGA CCTGGTCCGT CCGCTGCAGC GCAGCGTGGA AGTGGTCGGC TTTCTTCCGG TGGGGGGAAA CGAACGGGTC GAGGTGGAGC AGCACGCCGT GTTGCCACCG GGCAGCTTGC TGGAGGTGGT GCGCAACCTG CGTGTGGACG AGGTGATCGT CGCGGTGCGC GAACGTCGGG GGGGAGTGCT CTCCTTGCGC GAACTGCTGG ACTGCAAGTT GCTCGGAACG CGCATCCTCG ATCTGTCGAC CTTCTTCGAG CGCGTGCAGG GGCAAGTGAG GCTGGATTCG CTGCGGGCGA GCTGGCTGAT CTATGGTGAC GGCTTCCGCC AAGGCTGGAC GCGAACCTTC GTCAAGCGTT GTTTCGACCT CGTCGTCGGG ACCGCGCTCC TGCTCGTGGC CTTGCCGATC ATGGTGCTCA CTGCGCTGTT GATCCTGCTG GAGGATGGCG CGCCGATCTT TTACTCGCAG GAGCGTGTGG GACGCGGTGG GAAGCCGTTC CGTGTCATCA AGTTCCGCAG CATGCGGCGT GACGCCGAGA AGGACGGCAA GCCGCGGTGG GCGACTTCGA ACGACGATCG AGTGACACGC GTCGGGCGTT TCATCCGCAA GCTGCGTATC GATGAGCTTC CGCAGCTGTT CAACGTGCTC AAGGGCGACA TGAGTCTGGT CGGCCCGCGC CCGGAGCGCC CCTATTTTGT GGACCAGCTT ACCCAGCAGA TTCCGTTCTA TGCGGTGCGG CACTGCGTCA AGCCCGGCGT CACCGGGTGG GCGCAGGTTC GTTACCAGTA CGGGGCTTCG GTCGACGATG CCGCGGAGAA GCTCCAGTAC GATCTGTATT ACGTGAAGAA CCACTCGCTG ATCCTGGATA CCCTTGTCCT CTTCGAGACG GTGAGGGTCG TCCTGACCGG TGAAGGCGCG CACTGA
|
Protein sequence | MLKVFSHYLP THTLQQVLFD ALTLFAVVLA AVAVLVTPTS AVDWAICIPS ALTFAASMIA LNAAMGLYRP VYQRSHRQTF ARVALSLTVS VPVAYLVFGL LPWSEIAPHS LQLSVLLLLC SLLLVRGLVN QRQASALFVP RVLIVGTGRD ARMVQQDLVR PLQRSVEVVG FLPVGGNERV EVEQHAVLPP GSLLEVVRNL RVDEVIVAVR ERRGGVLSLR ELLDCKLLGT RILDLSTFFE RVQGQVRLDS LRASWLIYGD GFRQGWTRTF VKRCFDLVVG TALLLVALPI MVLTALLILL EDGAPIFYSQ ERVGRGGKPF RVIKFRSMRR DAEKDGKPRW ATSNDDRVTR VGRFIRKLRI DELPQLFNVL KGDMSLVGPR PERPYFVDQL TQQIPFYAVR HCVKPGVTGW AQVRYQYGAS VDDAAEKLQY DLYYVKNHSL ILDTLVLFET VRVVLTGEGA H
|
| |