Gene ECH74115_2591 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2591 
SymbolmsbB 
ID6968239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2447347 
End bp2448318 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content51% 
IMG OID643386456 
Productlipid A biosynthesis (KDO)2-(lauroyl)-lipid IVA acyltransferase 
Protein accessionYP_002270938 
Protein GI209397852 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1560] Lauroyl/myristoyl acyltransferase 
TIGRFAM ID[TIGR02208] lipid A biosynthesis (KDO)2-(lauroyl)-lipid IVA acyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00419583 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value0.625718 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAACGA AAAAAAATAA TAGCGAATAC ATTCCTGAGT TTGATAAATC CTTTCGCCAC 
CCGCGCTACT GGGGAGCATG GCTGGGCGTA GCAGCGATGG CGGGTATCGC TTTAACGCCG
CCAAAGTTCC GTGATCCCAT TCTGGCACGG CTGGGACGTA TTGCCGGACG ACTGGGAAAA
AGCTCACGCC GTCGTGCGTT AATCAATCTG TCGCTCTGCT TTCCAGAACG TAGTGAAGCT
GAACGCGAAG CGATTGTTGA TGAGATGTTT GCCACCGCGC CGCAAGCGAT GGCAATGATG
GCTGAGTTGG CAATACGCGG GCCGGAGAAA ATTCAGCCGC GCGTTGACTG GCAAGGGCTG
GAGATCATCG AAGAGATGCG GCGTAATAAC GAGAAAGTTA TCTTTCTAGT GCCGCACGGT
TGGGCCGTCG ATATTCCTGC CATGCTGATG GCCTCGCAAG GGCAGAAAAT GGCAGCGATG
TTCCATAATC AGGGCAACCC GGTTTTTGAT TATGTCTGGA ACACGGTGCG TCGTCGCTTT
GGCGGTCGTC TGCATGCGAG AAATGACGGT ATTAAACCAT TCATCCAGTC GGTACGTCAG
GGGTACTGGG GATATTATTT ACCCGATCAG GATCATGGCC CAGAGCACAG CGAATTTGTG
GATTTCTTTG CCACCTATAA AGCGACGTTG CCCGCGATTG GTCGTTTGAT GAAAGTGTGC
CGTGCGCGCG TTGTACCGCT GTTTCCGATT TATGATGGCA AGACGCATCG TCTGACGATT
CAGGTGCGCC CACCGATGGA TGATCTGTTA GAGGCGGATG ATCATACGAT TGCGCGGCGG
ATGAATGAAG AAGTCGAGAT TTTTGTTGGT CCGCGACCAG AACAATACAC CTGGATACTA
AAATTGCTGA AAACTCGCAA ACCGGGCGAA ATCCAGCCGT ATAAGCGCAA AGATCTTTAT
CCCATCAAAT AA
 
Protein sequence
METKKNNSEY IPEFDKSFRH PRYWGAWLGV AAMAGIALTP PKFRDPILAR LGRIAGRLGK 
SSRRRALINL SLCFPERSEA EREAIVDEMF ATAPQAMAMM AELAIRGPEK IQPRVDWQGL
EIIEEMRRNN EKVIFLVPHG WAVDIPAMLM ASQGQKMAAM FHNQGNPVFD YVWNTVRRRF
GGRLHARNDG IKPFIQSVRQ GYWGYYLPDQ DHGPEHSEFV DFFATYKATL PAIGRLMKVC
RARVVPLFPI YDGKTHRLTI QVRPPMDDLL EADDHTIARR MNEEVEIFVG PRPEQYTWIL
KLLKTRKPGE IQPYKRKDLY PIK