Gene EcSMS35_3220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3220 
SymbolkpsE 
ID6144874 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3291344 
End bp3292492 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content50% 
IMG OID641618053 
Productpolysialic acid capsule export inner-membrane protein KpsE 
Protein accessionYP_001745203 
Protein GI170680931 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3524] Capsule polysaccharide export protein 
TIGRFAM ID[TIGR01010] polysaccharide export inner-membrane protein, BexC/CtrB/KpsE family 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGATAA AAGTGAAGTC TGCCGTATCC TGGATGCGTG CTCGTCTGTC TGCCATCTCA 
CTGGCAGATA TCCAAAAACA CCTGGCGAAA ATCATCATTC TGGCACCGAT GGCGGTGCTG
CTGATCTATC TGGCTATCTT CAGCCAGCCT CGCTATATGA GCGAGTCGAA AGTCGCCATT
AAACGCTCGG ATGATTTAAA CAGCGGCAGC CTGAATTTTG GTCTGCTTCT GGGTGCCTCT
AACCCCAGTT CCGCAGAAGA TGCGTTGTAT CTGAAAGAGT ACATCAACTC GCCGGATATG
CTGGCGGCGC TGGATAAGCA ACTAAATTTT CGTGAAGCGT TTAGCCACAG CGGGCTCGAT
TTTCTTAATC ATCTTAGCAA GGATGAAACC GCAGAAGGCT TCCTGAAGTA CTACAAGGAC
CGTATCAACG TCTCGTATGA CGATAAAACC GGATTACTGA ATATTCAGAC GCAGGGCTTT
AGCCCGGAGT TTGCGCTTAA GTTTAACCAG ACCGTGCTGA AAGAGTCAGA GCGCTTTATC
AATGAGATGT CACATCGCAT CGCGCGTGAC CAGCTTGCCT TTGCAGAAAC GGAGATGGAA
AAGGCACGCC AGCGTCTGGA CGCCAGCAAA GCGGAATTGC TCTCTTATCA GGACAACAAC
AACGTTCTGG ATCCACAGGC ACAGGCACAG GCGGCGAGCA CGTTAGTGAA TACGCTGATG
GGCCAGAAGA TCCAGATGGA AGCGGACCTG CGGAACTTGC TGACGTATCT GCGTGAGGAC
GCCCCGCAAG TTGTGAGTGC GCGTAATGCG ATTCAGTCAT TGCAGGCACA AATTGACGAA
GAAAAAAGCA AAATCACAGC GCCGCAGGGT GACAAGCTAA ACCGTATGGC GGTGGATTTT
GAAGAAATCA AATCAAAAGT AGAGTTCAAC ACCGAGCTGT ACAAACTGAC CCTGACCTCC
ATTGAAAAGA CCCGTGTAGA AGCGGCTCGT AAGCTCAAGG TGCTGTCAGT GATCAGTTCG
CCACAGTTGC CGCAGGAATC CTCTTTTCCA AATATCCCTT ATTTGATCGC CTGCTGGTTA
CTGGTGTGCT GCCTGCTGTT CGGCACCCTG AAACTGTTGC TGGCTGTTAT TGAAGATCAC
CGAGACTAA
 
Protein sequence
MLIKVKSAVS WMRARLSAIS LADIQKHLAK IIILAPMAVL LIYLAIFSQP RYMSESKVAI 
KRSDDLNSGS LNFGLLLGAS NPSSAEDALY LKEYINSPDM LAALDKQLNF REAFSHSGLD
FLNHLSKDET AEGFLKYYKD RINVSYDDKT GLLNIQTQGF SPEFALKFNQ TVLKESERFI
NEMSHRIARD QLAFAETEME KARQRLDASK AELLSYQDNN NVLDPQAQAQ AASTLVNTLM
GQKIQMEADL RNLLTYLRED APQVVSARNA IQSLQAQIDE EKSKITAPQG DKLNRMAVDF
EEIKSKVEFN TELYKLTLTS IEKTRVEAAR KLKVLSVISS PQLPQESSFP NIPYLIACWL
LVCCLLFGTL KLLLAVIEDH RD