Gene EcSMS35_1331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1331 
SymbolmsbB 
ID6146686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1319911 
End bp1320882 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content51% 
IMG OID641616209 
Productlipid A biosynthesis (KDO)2-(lauroyl)-lipid IVA acyltransferase 
Protein accessionYP_001743389 
Protein GI170680587 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1560] Lauroyl/myristoyl acyltransferase 
TIGRFAM ID[TIGR02208] lipid A biosynthesis (KDO)2-(lauroyl)-lipid IVA acyltransferase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0569564 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAACGA AAAAAAATAA TAGCGAATAC ATTCCTGAGT TTGATAAATC CTTTCGCCAC 
CCGCGCTACT GGGGAGCATG GCTGGGCGTA GCAGCGATGG CGGGTATTGC TTTAACGCCG
CCAAAGCTCC GTGATCCCAT TCTGGCACGG CTGGGACGTT TTGCCGGACG ACTGGGAAAA
AGCTCACGCC GTCGTGCGTT AATCAATCTG TCGCTCTGCT TTCCAGAACG TAGTGAAGCT
GAACGCGAAG CGATTGTTGA TGAGATGTTT GCCACTGCGC CGCAAGCGAT GGCAATGATG
GCTGAGTTGG CAATACGCGG GCCGGAGAAA ATTCAGCCGC GCGTTGAATG GCAAGGGCTG
GAGATCATCG AAGAGATGCG GCGTAATAAC GAGAAAGTGA TTTTTCTGGT GCCGCACGGT
TGGGCCGTCG ATATTCCTGC CATGCTGATG GCCTCGCAAG GGCAGAAAAT GGCAGCGATG
TTCCATAATC AGGGCAACCC GGTTTTTGAT TATGTCTGGA ACACGGTGCG TCGTCGCTTT
GGTGGTCGTC TGCATGCGAG AAATGATGGT ATTAAACCAT TCATCCAGTC GGTACGTCAG
GGGTACTGGG GATATTATTT ACCCGATCAG GATCATGGCC CAGAGCACAG CGAATTTGTT
GATTTCTTTG CCACCTATAA AGCGACGTTG CCCGCGATTG GTCGTTTGAT GAAAGTGTGC
CGTGCGCGCG TTGTACCGCT GTTTCCGATT TATGATGGCA AGACGCATCG CCTGACGATT
CAGGTGCGCC CACCGATGGA TGATCTGTTA GAGGCGGATG ACCATACGAT TGCGCGGCGG
ATGAATGAAG AAGTCGAGAT TTTTGTTGGT CCGCGACCAG AACAATACAC CTGGATATTA
AAATTGCTGA AAACTCGCAA ACCGGGCGAA ATCCAACCGT ATAAGCGCAA AGATCTTTAT
CCCATCAAAT AA
 
Protein sequence
METKKNNSEY IPEFDKSFRH PRYWGAWLGV AAMAGIALTP PKLRDPILAR LGRFAGRLGK 
SSRRRALINL SLCFPERSEA EREAIVDEMF ATAPQAMAMM AELAIRGPEK IQPRVEWQGL
EIIEEMRRNN EKVIFLVPHG WAVDIPAMLM ASQGQKMAAM FHNQGNPVFD YVWNTVRRRF
GGRLHARNDG IKPFIQSVRQ GYWGYYLPDQ DHGPEHSEFV DFFATYKATL PAIGRLMKVC
RARVVPLFPI YDGKTHRLTI QVRPPMDDLL EADDHTIARR MNEEVEIFVG PRPEQYTWIL
KLLKTRKPGE IQPYKRKDLY PIK