Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3789 |
Symbol | |
ID | 7874031 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 4175458 |
End bp | 4176681 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643700731 |
Product | lipopolysaccharide biosynthesis protein |
Protein accession | YP_002890755 |
Protein GI | 237654441 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGAAA GCACGATGCA GAAGGCGGCG ATCGAGAGCC CGCAGGGCCA AATGGTGGTG GTGGAGGACG ACGACGAGAT CTCTCTGCTC GATCTCGCGA TCGTGCTCGC CAAGTTCAAG AAGCTGATCC TCGGCCTGCC GGTGCTGGTG GGGGCCTTGA CGGTGGGCGC GACGCTTCTG ATGACGCCGA TCTTCACCGC GACCACGGCC ATCCTGCCGC CGCAGCAATC GCAGTCGACC GCCTCGGCGC TGCTCGGCCA GCTCGGCGGG CTCGCCGGCA TTGCCGGCGC AGCGGCCGGG ATCAAGAACC CGAGCGACCT CTACGTCGGC ATGCTGAAGA GCCGCACCGT GGCCGATGCG ATGATCGCGC GCTTCGACCT GGTGAACTAT TACGAGGATG AGTTTGCGGA GGACGCGCGC AAGTCGCTGG AGAATGTATC CAGCTTCACC GCCGGCAAGG ACGGCATCAT CACCATCTCG GTCGATGACA AGGACCCCGA GCTCGCCGCG AAGATGGCCA ACGCCTACGT GGAAGAGCTG AACAGGCTCA CCGAGGTGCT GGCGGTGACC GAGGCCTCGC AGAAGCGCCT CTTCTTCGAA CGCCAGATGG TCGACGCGCG TGACCGCCTC GTGGCGGCCG AGATCGAGGC GCGCTCGGCG ATGGAGCGGG GTGGCCTGGC GAGCATCGAC GCCCAGGGCC AGGCGATGAT CGAGGTGACG GCGCGGCTGC GCGGGCAGAT CTCGGTGAAG GAGGTCGAGA TCGGCGCCAT GCGCGCCTTC GCCGCCGAGG AAAACCCCCG CCTCAAGGCT GCGCAGCAGG AGCTGCTTGC GCTGCAGACC GAGCTCGCAC GCATCGAGGG CGCGAGCGCG CTGCGCGACA CCCAGGTCGG TGGCGAATCG AGCGCCGCCG CGACCAACCT GCAGTTGCTG CGCAACGTGA AGTACTACGA GACGCTGTAC CAGATGCTGG CGCAGCAGTT CGAGCTCGCC AAGATCGAGG AGGCCAAGGA CAGCGCGCTG ATCCAGGTGC TGGACACCGC CATCCCGCCC GAGCGCAAGT CCAAGCCCAA GCGCGCCCTG ATCGTGATCC TCGCCGTGCT CGCCGCCGGC TTCGTCGCCG TGCTGATCGC CTTCATGAAG GAAGCCGCCC AGCGTGCCGC CGAAGACCCC GAAAGCGCGG AGCGCATGCA GTTGTTCAAG AAATATATGT CCTGGCGGGC GTGA
|
Protein sequence | MNESTMQKAA IESPQGQMVV VEDDDEISLL DLAIVLAKFK KLILGLPVLV GALTVGATLL MTPIFTATTA ILPPQQSQST ASALLGQLGG LAGIAGAAAG IKNPSDLYVG MLKSRTVADA MIARFDLVNY YEDEFAEDAR KSLENVSSFT AGKDGIITIS VDDKDPELAA KMANAYVEEL NRLTEVLAVT EASQKRLFFE RQMVDARDRL VAAEIEARSA MERGGLASID AQGQAMIEVT ARLRGQISVK EVEIGAMRAF AAEENPRLKA AQQELLALQT ELARIEGASA LRDTQVGGES SAAATNLQLL RNVKYYETLY QMLAQQFELA KIEEAKDSAL IQVLDTAIPP ERKSKPKRAL IVILAVLAAG FVAVLIAFMK EAAQRAAEDP ESAERMQLFK KYMSWRA
|
| |