Gene EcHS_A1948 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1948 
SymbolmsbB 
ID5592955 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1958199 
End bp1959170 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content52% 
IMG OID640921093 
Productlipid A biosynthesis (KDO)2-(lauroyl)-lipid IVA acyltransferase 
Protein accessionYP_001458642 
Protein GI157161324 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1560] Lauroyl/myristoyl acyltransferase 
TIGRFAM ID[TIGR02208] lipid A biosynthesis (KDO)2-(lauroyl)-lipid IVA acyltransferase 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value0.301276 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAACGA AAAAAAATAA TAGCGAATAC ATTCCTGAGT TTGATAAATC CTTTCGCCAC 
CCGCGCTACT GGGGAGCATG GCTGGGCGTA GCAGCGATGG CGGGTATCGC TTTAACGCCG
CCAAAGTTCC GTGATCCCAT TCTGGCACGG CTGGGACGTT TTGCCGGACG ACTGGGAAAA
AGCTCACGCC GTCGTGCGTT AATCAATCTG TCGCTCTGCT TTCCAGAACG TAGTGAAGCT
GAACGCGAAG CGATTGTTGA TGAGATGTTT GCCACCGCGC CGCAAGCGAT GGCAATGATG
GCTGAGTTGG CAATACGCGG GCCGGAGAAA ATTCAGCCGC GCGTTGACTG GCAAGGGCTG
GAGATCATCG AAGAGATGCG GCGTAATAAC GAGAAAGTTA TCTTTCTGGT GCCGCACGGT
TGGGCCGTCG ATATTCCTGC CATGCTGATG GCCTCGCAAG GGCAGAAAAT GGCAGCGATG
TTCCATAATC AGGGCAACCC GGTTTTTGAT TATGTCTGGA ACACGGTGCG TCGTCGCTTT
GGCGGTCGTC TGCATGCGAG AAATGACGGT ATTAAACCAT TCATCCAGTC GGTACGTCAG
GGGTACTGGG GATATTATTT ACCCGATCAG GATCATGGCC CAGAGCACAG CGAATTTGTG
GATTTCTTTG CCACCTATAA AGCGACGTTG CCCGCGATTG GTCGTTTGAT GAAAGTGTGC
CGTGCGCGCG TTGTACCGCT GTTTCCGATT TATGATGGCA AGACGCATCG TCTGACGATT
CAGGTGCGCC CACCGATGGA TGATCTGTTA GAGGCGGATG ATCATACGAT TGCGCGGCGG
ATGAATGAAG AAGTCGAGAT TTTTGTTGGT CCGCGACCAG AACAATACAC CTGGATACTA
AAATTGCTGA AAACTCGCAA ACCGGGCGAA ATCCAGCCGT ATAAGCGCAA AGATCTTTAT
CCCATCAAAT AA
 
Protein sequence
METKKNNSEY IPEFDKSFRH PRYWGAWLGV AAMAGIALTP PKFRDPILAR LGRFAGRLGK 
SSRRRALINL SLCFPERSEA EREAIVDEMF ATAPQAMAMM AELAIRGPEK IQPRVDWQGL
EIIEEMRRNN EKVIFLVPHG WAVDIPAMLM ASQGQKMAAM FHNQGNPVFD YVWNTVRRRF
GGRLHARNDG IKPFIQSVRQ GYWGYYLPDQ DHGPEHSEFV DFFATYKATL PAIGRLMKVC
RARVVPLFPI YDGKTHRLTI QVRPPMDDLL EADDHTIARR MNEEVEIFVG PRPEQYTWIL
KLLKTRKPGE IQPYKRKDLY PIK