Gene Tmz1t_3789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3789 
Symbol 
ID7874031 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp4175458 
End bp4176681 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content66% 
IMG OID643700731 
Productlipopolysaccharide biosynthesis protein 
Protein accessionYP_002890755 
Protein GI237654441 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGAAA GCACGATGCA GAAGGCGGCG ATCGAGAGCC CGCAGGGCCA AATGGTGGTG 
GTGGAGGACG ACGACGAGAT CTCTCTGCTC GATCTCGCGA TCGTGCTCGC CAAGTTCAAG
AAGCTGATCC TCGGCCTGCC GGTGCTGGTG GGGGCCTTGA CGGTGGGCGC GACGCTTCTG
ATGACGCCGA TCTTCACCGC GACCACGGCC ATCCTGCCGC CGCAGCAATC GCAGTCGACC
GCCTCGGCGC TGCTCGGCCA GCTCGGCGGG CTCGCCGGCA TTGCCGGCGC AGCGGCCGGG
ATCAAGAACC CGAGCGACCT CTACGTCGGC ATGCTGAAGA GCCGCACCGT GGCCGATGCG
ATGATCGCGC GCTTCGACCT GGTGAACTAT TACGAGGATG AGTTTGCGGA GGACGCGCGC
AAGTCGCTGG AGAATGTATC CAGCTTCACC GCCGGCAAGG ACGGCATCAT CACCATCTCG
GTCGATGACA AGGACCCCGA GCTCGCCGCG AAGATGGCCA ACGCCTACGT GGAAGAGCTG
AACAGGCTCA CCGAGGTGCT GGCGGTGACC GAGGCCTCGC AGAAGCGCCT CTTCTTCGAA
CGCCAGATGG TCGACGCGCG TGACCGCCTC GTGGCGGCCG AGATCGAGGC GCGCTCGGCG
ATGGAGCGGG GTGGCCTGGC GAGCATCGAC GCCCAGGGCC AGGCGATGAT CGAGGTGACG
GCGCGGCTGC GCGGGCAGAT CTCGGTGAAG GAGGTCGAGA TCGGCGCCAT GCGCGCCTTC
GCCGCCGAGG AAAACCCCCG CCTCAAGGCT GCGCAGCAGG AGCTGCTTGC GCTGCAGACC
GAGCTCGCAC GCATCGAGGG CGCGAGCGCG CTGCGCGACA CCCAGGTCGG TGGCGAATCG
AGCGCCGCCG CGACCAACCT GCAGTTGCTG CGCAACGTGA AGTACTACGA GACGCTGTAC
CAGATGCTGG CGCAGCAGTT CGAGCTCGCC AAGATCGAGG AGGCCAAGGA CAGCGCGCTG
ATCCAGGTGC TGGACACCGC CATCCCGCCC GAGCGCAAGT CCAAGCCCAA GCGCGCCCTG
ATCGTGATCC TCGCCGTGCT CGCCGCCGGC TTCGTCGCCG TGCTGATCGC CTTCATGAAG
GAAGCCGCCC AGCGTGCCGC CGAAGACCCC GAAAGCGCGG AGCGCATGCA GTTGTTCAAG
AAATATATGT CCTGGCGGGC GTGA
 
Protein sequence
MNESTMQKAA IESPQGQMVV VEDDDEISLL DLAIVLAKFK KLILGLPVLV GALTVGATLL 
MTPIFTATTA ILPPQQSQST ASALLGQLGG LAGIAGAAAG IKNPSDLYVG MLKSRTVADA
MIARFDLVNY YEDEFAEDAR KSLENVSSFT AGKDGIITIS VDDKDPELAA KMANAYVEEL
NRLTEVLAVT EASQKRLFFE RQMVDARDRL VAAEIEARSA MERGGLASID AQGQAMIEVT
ARLRGQISVK EVEIGAMRAF AAEENPRLKA AQQELLALQT ELARIEGASA LRDTQVGGES
SAAATNLQLL RNVKYYETLY QMLAQQFELA KIEEAKDSAL IQVLDTAIPP ERKSKPKRAL
IVILAVLAAG FVAVLIAFMK EAAQRAAEDP ESAERMQLFK KYMSWRA