Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3220 |
Symbol | kpsE |
ID | 6144874 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3291344 |
End bp | 3292492 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641618053 |
Product | polysialic acid capsule export inner-membrane protein KpsE |
Protein accession | YP_001745203 |
Protein GI | 170680931 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3524] Capsule polysaccharide export protein |
TIGRFAM ID | [TIGR01010] polysaccharide export inner-membrane protein, BexC/CtrB/KpsE family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGATAA AAGTGAAGTC TGCCGTATCC TGGATGCGTG CTCGTCTGTC TGCCATCTCA CTGGCAGATA TCCAAAAACA CCTGGCGAAA ATCATCATTC TGGCACCGAT GGCGGTGCTG CTGATCTATC TGGCTATCTT CAGCCAGCCT CGCTATATGA GCGAGTCGAA AGTCGCCATT AAACGCTCGG ATGATTTAAA CAGCGGCAGC CTGAATTTTG GTCTGCTTCT GGGTGCCTCT AACCCCAGTT CCGCAGAAGA TGCGTTGTAT CTGAAAGAGT ACATCAACTC GCCGGATATG CTGGCGGCGC TGGATAAGCA ACTAAATTTT CGTGAAGCGT TTAGCCACAG CGGGCTCGAT TTTCTTAATC ATCTTAGCAA GGATGAAACC GCAGAAGGCT TCCTGAAGTA CTACAAGGAC CGTATCAACG TCTCGTATGA CGATAAAACC GGATTACTGA ATATTCAGAC GCAGGGCTTT AGCCCGGAGT TTGCGCTTAA GTTTAACCAG ACCGTGCTGA AAGAGTCAGA GCGCTTTATC AATGAGATGT CACATCGCAT CGCGCGTGAC CAGCTTGCCT TTGCAGAAAC GGAGATGGAA AAGGCACGCC AGCGTCTGGA CGCCAGCAAA GCGGAATTGC TCTCTTATCA GGACAACAAC AACGTTCTGG ATCCACAGGC ACAGGCACAG GCGGCGAGCA CGTTAGTGAA TACGCTGATG GGCCAGAAGA TCCAGATGGA AGCGGACCTG CGGAACTTGC TGACGTATCT GCGTGAGGAC GCCCCGCAAG TTGTGAGTGC GCGTAATGCG ATTCAGTCAT TGCAGGCACA AATTGACGAA GAAAAAAGCA AAATCACAGC GCCGCAGGGT GACAAGCTAA ACCGTATGGC GGTGGATTTT GAAGAAATCA AATCAAAAGT AGAGTTCAAC ACCGAGCTGT ACAAACTGAC CCTGACCTCC ATTGAAAAGA CCCGTGTAGA AGCGGCTCGT AAGCTCAAGG TGCTGTCAGT GATCAGTTCG CCACAGTTGC CGCAGGAATC CTCTTTTCCA AATATCCCTT ATTTGATCGC CTGCTGGTTA CTGGTGTGCT GCCTGCTGTT CGGCACCCTG AAACTGTTGC TGGCTGTTAT TGAAGATCAC CGAGACTAA
|
Protein sequence | MLIKVKSAVS WMRARLSAIS LADIQKHLAK IIILAPMAVL LIYLAIFSQP RYMSESKVAI KRSDDLNSGS LNFGLLLGAS NPSSAEDALY LKEYINSPDM LAALDKQLNF REAFSHSGLD FLNHLSKDET AEGFLKYYKD RINVSYDDKT GLLNIQTQGF SPEFALKFNQ TVLKESERFI NEMSHRIARD QLAFAETEME KARQRLDASK AELLSYQDNN NVLDPQAQAQ AASTLVNTLM GQKIQMEADL RNLLTYLRED APQVVSARNA IQSLQAQIDE EKSKITAPQG DKLNRMAVDF EEIKSKVEFN TELYKLTLTS IEKTRVEAAR KLKVLSVISS PQLPQESSFP NIPYLIACWL LVCCLLFGTL KLLLAVIEDH RD
|
| |