Gene Tmz1t_3246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3246 
Symbol 
ID7874467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3550640 
End bp3551716 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content68% 
IMG OID643700180 
Productlipopolysaccharide biosynthesis protein-like protein 
Protein accessionYP_002890218 
Protein GI237653904 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3754] Lipopolysaccharide biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAACG CCCGCGCGAT CGCCTTCCAC CTGCCCCAAT ACCACCCCAC CCCCGAGAAC 
GACGAGTGGT GGGGCAAGGG CTTCACCGAG TGGACCAACA CCACCCGGGC GAAGCCCCTC
TACCCGGGCC ACTATCAGCC GCACCTGCCG GCCGACCTGG GTTACTACGA CCTGCGCCTG
CCCGAGGCCC GCCACGCGCA GGCCGAGCTC GCCCGCGCAT ACGGCATCGA GGGCTTCTGC
TACTACCACT ACTGGTTCGG CGGACGGCGC ATCCTCGAGC GCCCGGTCAA CGAGATCCTC
GCCAGCGGCG AGCCCGACCT GCCGTTCTGC CTGTGCTGGG CCAACCACAG CTGGAGCAAC
ATCTGGCAGG GCGTGGCCGA TCGCATGCTG ATCGAGCAGA CCTACCCGGG CATGGACGAC
CACCGCCGCC ACTTCGAATG GCTGCTCGCC GCATTCCACG ATCCGCGCTA CATCCGCGTC
GACGGCAAGC CGATGTTCCT GATCTACAAC CCGCCCGACA TTCCCGACGT GGCGCGGGTG
ATGGACTACT GGCGCGAGCT CGCTGCGCAG GCGGGGCTGC CGGGGCTGCA CCTGGTCGCG
GTGAATTACC TCGGCGCTGC GGTCGATCCG GCCGACTTCG GCATGGACGC CGCCACCTGG
CAGCCCCTGC CGCCCAAGAG CGGCCACATC CCGTGGCGCT ACCCGGCCTT GAAGGCGCGC
ATGCGCATGG CGAAGGGAAA ATACAAGCTG ACCGTGCTGG ACTACGCCCG CATCATGAGC
GGTCTGACCC GCGCCAGCCC GCCGCAATTC ACCGAGTACC CCACGGTGCT CCCGAACTGG
GACAACACCC CGCGCTCGGG TCTCAACGGA CTGGTGCTGC ACGGCTCGAC CCCGGAGCTC
TTCAAGACCG TGCTGCGCCG CGGCGTCGAC CTGGTGCAGG GCTACCCGGC CGAGCAGCGC
ATCGTCTTCA TCAAGGCCTG GAACGAGTGG GCCGAGGGCA ACTACCTCGA ACCCGACCAG
CGCTTCGGCC ACGGCTACCT GCGCGCGGTG CGCGAGGTCC TGCAGGAGAC GCGCTGA
 
Protein sequence
MSNARAIAFH LPQYHPTPEN DEWWGKGFTE WTNTTRAKPL YPGHYQPHLP ADLGYYDLRL 
PEARHAQAEL ARAYGIEGFC YYHYWFGGRR ILERPVNEIL ASGEPDLPFC LCWANHSWSN
IWQGVADRML IEQTYPGMDD HRRHFEWLLA AFHDPRYIRV DGKPMFLIYN PPDIPDVARV
MDYWRELAAQ AGLPGLHLVA VNYLGAAVDP ADFGMDAATW QPLPPKSGHI PWRYPALKAR
MRMAKGKYKL TVLDYARIMS GLTRASPPQF TEYPTVLPNW DNTPRSGLNG LVLHGSTPEL
FKTVLRRGVD LVQGYPAEQR IVFIKAWNEW AEGNYLEPDQ RFGHGYLRAV REVLQETR