Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1331 |
Symbol | msbB |
ID | 6146686 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1319911 |
End bp | 1320882 |
Gene Length | 972 bp |
Protein Length | 323 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641616209 |
Product | lipid A biosynthesis (KDO)2-(lauroyl)-lipid IVA acyltransferase |
Protein accession | YP_001743389 |
Protein GI | 170680587 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1560] Lauroyl/myristoyl acyltransferase |
TIGRFAM ID | [TIGR02208] lipid A biosynthesis (KDO)2-(lauroyl)-lipid IVA acyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0569564 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAACGA AAAAAAATAA TAGCGAATAC ATTCCTGAGT TTGATAAATC CTTTCGCCAC CCGCGCTACT GGGGAGCATG GCTGGGCGTA GCAGCGATGG CGGGTATTGC TTTAACGCCG CCAAAGCTCC GTGATCCCAT TCTGGCACGG CTGGGACGTT TTGCCGGACG ACTGGGAAAA AGCTCACGCC GTCGTGCGTT AATCAATCTG TCGCTCTGCT TTCCAGAACG TAGTGAAGCT GAACGCGAAG CGATTGTTGA TGAGATGTTT GCCACTGCGC CGCAAGCGAT GGCAATGATG GCTGAGTTGG CAATACGCGG GCCGGAGAAA ATTCAGCCGC GCGTTGAATG GCAAGGGCTG GAGATCATCG AAGAGATGCG GCGTAATAAC GAGAAAGTGA TTTTTCTGGT GCCGCACGGT TGGGCCGTCG ATATTCCTGC CATGCTGATG GCCTCGCAAG GGCAGAAAAT GGCAGCGATG TTCCATAATC AGGGCAACCC GGTTTTTGAT TATGTCTGGA ACACGGTGCG TCGTCGCTTT GGTGGTCGTC TGCATGCGAG AAATGATGGT ATTAAACCAT TCATCCAGTC GGTACGTCAG GGGTACTGGG GATATTATTT ACCCGATCAG GATCATGGCC CAGAGCACAG CGAATTTGTT GATTTCTTTG CCACCTATAA AGCGACGTTG CCCGCGATTG GTCGTTTGAT GAAAGTGTGC CGTGCGCGCG TTGTACCGCT GTTTCCGATT TATGATGGCA AGACGCATCG CCTGACGATT CAGGTGCGCC CACCGATGGA TGATCTGTTA GAGGCGGATG ACCATACGAT TGCGCGGCGG ATGAATGAAG AAGTCGAGAT TTTTGTTGGT CCGCGACCAG AACAATACAC CTGGATATTA AAATTGCTGA AAACTCGCAA ACCGGGCGAA ATCCAACCGT ATAAGCGCAA AGATCTTTAT CCCATCAAAT AA
|
Protein sequence | METKKNNSEY IPEFDKSFRH PRYWGAWLGV AAMAGIALTP PKLRDPILAR LGRFAGRLGK SSRRRALINL SLCFPERSEA EREAIVDEMF ATAPQAMAMM AELAIRGPEK IQPRVEWQGL EIIEEMRRNN EKVIFLVPHG WAVDIPAMLM ASQGQKMAAM FHNQGNPVFD YVWNTVRRRF GGRLHARNDG IKPFIQSVRQ GYWGYYLPDQ DHGPEHSEFV DFFATYKATL PAIGRLMKVC RARVVPLFPI YDGKTHRLTI QVRPPMDDLL EADDHTIARR MNEEVEIFVG PRPEQYTWIL KLLKTRKPGE IQPYKRKDLY PIK
|
| |