Gene EcSMS35_3223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3223 
SymbolkpsC 
ID6144878 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3294939 
End bp3296972 
Gene Length2034 bp 
Protein Length677 aa 
Translation table11 
GC content51% 
IMG OID641618056 
Productpolysialic acid capsule polysaccharide export protein KpsC 
Protein accessionYP_001745206 
Protein GI170680172 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3563] Capsule polysaccharide export protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGGCA TTTACTCGCC TGGCATCTGG CGTATTCCGC ATCTGGAGAA ATTTCTGGCG 
CAACCGTGCC AGAAACTTTC TCTGCTGCGT CCTGTTCCGC AAAACGTCGA TGCTATCGCC
GTGTGGGGAC ATCGTCCCAG CGCGGCGAAA CCAGTCGCCA TCGCCAAAGC CGCGGGAAAA
CCCGTCATTC GTCTGGAAGA TGGATTTGTG CGTTCGCTGG GTCTTGGCGT CAATGGCGAG
CCGCCGCTTT CTCTGGTGGT GGATGATTGT GGCATTTACT ACGATGCCAG CAAGCCTTCA
GCACTGGAGA AACTGGTACA GGATAAAGCC GGAAATACAG CTCTGATAAG CCAGGCCAGA
GAAGCGATGC ACACCATCGT GACCGGGGAT TTGTCGAAAT ATAACCTGGC ACCTGCGTTT
GTGGCTGATG AGTCTGAACG TTCAGACATC GTTCTGGTTG TCGATCAGAC ATTTAATGAT
ATGTCAGTGA CGTATGGCAA TGCTGGCCCG CATGAGTTTG CTGCCATGCT GGAAGCCGCG
ATGGCGGAAA ATCCTCAAGC CGAAATTTGG GTGAAGGTGC ACCCAGATGT ACTGGAAGGA
AAAAAAACAG GTTATTTCGC CGATCTGCGC GCCACGCAAC GAGTACGTTT AATTGCCGAG
AATGTCAGCC CGCAGTCGCT GTTGCGACAC GTTTCCCGGG TTTACGTCGT GACCTCCCAG
TACGGCTTTG AAGCCTTGCT GGCAGGAAAA CCAGTAACAT GTTTCGGCCA GCCCTGGTAT
GCAGGCTGGG GCTTAACCGA CGATCGCCAT CCGCAGTCCG CTTTGTTATC TGCCCGACGC
GGTTCTGCCA CGCTGGAGGA ACTTTTTGCC GCTGCATACC TGCGTTACTG TCGCTATATC
TACCCGCAAA CGGGAGAAGT AAGCAATCTA TTTACCGTGC TGCAATGGCT GCAATTACAA
CGTCGACATC TGCAACAGCG TAATGGTTAT TTATGGGCGC CAGGCTTAAC GCTGTGGAAG
TCAGCGATCC TGAAACCTTT CTTGCAAACG GCAACAAACC GGCTGAGTTT TTCACGTCGC
TGTACTGCGG CGAGCGCCTG CGTGGTATGG GGTGTAAAGG GAGAACAGCA ATGGCGAGCC
GAAGCGCAGC GAAAATCACT GCCGTTATGG CGAATGGAAG ATGGTTTTCT GCGTTCATCC
GGACTTGGCT CTGACCTGCT GCCGCCGCTA TCGCTGGTGC TGGATAAACG CGGTATTTAC
TATGACGCCA CGCGCCCCAG CGATCTGGAA GTGCTGCTGA ATCATAGCCA GCTAACGCTG
GCGCAGAAGA TGCGAGCTGA AAAATTACGC CAGCGACTGG TTGAAAGCAA ACTGAGTAAG
TACAACCTGG GAGCCGATTT CTCTCTACCA GCCGAAGCCA AAGATAAAAA AGTTATCCTG
GTGCCGGGTC AGGTAGAAGA CGATGCCTCA ATTAAAACCG GCACCGTTTC TATTAAGAGC
AACCTTGAGT TATTACGCAC AGTACGCGAG CGTAATCCGC ACGCTTACAT TATTTATAAA
CCGCACCCGG ATGTACTGGT GGGGAATCGC AAGGGCAATA TTCCGACAGA ACTAATTGCT
GAACTCGCTG ATTATCAGGC ACTGGACGCA GATATTATTC AATGCATTCT GCGCGCAGAT
GAAGTGCACA CCATGACATC ATTGTCCGGG TTTGAAGCGT TATTACATGG CAAGCACGTA
CATTGTTACG GCCTGCCCTT CTATGCCGGT TGGGGTTTAA CCGTCGATGA ACATCGCTGC
CCGCGTCGCG AGCGAAAATT AACGTTAGCG GATTTGATCT ATCAGGCGCT GATTGTTTAT
CCAACCTATA TCCACCCAAC ATGGCTACAA CCTATTACGG TAGAAGAAGC TGCGGAATAT
TTAATCAAGA CGCCGCGCAA GCCGATGTTT ATTACCCGAA AAAAAGCGGT AATACGCTAT
TACCGCAAAT TAATTATGTT CTGCAAGGTC AGATTTGGCC AAACAATTTC ATAG
 
Protein sequence
MIGIYSPGIW RIPHLEKFLA QPCQKLSLLR PVPQNVDAIA VWGHRPSAAK PVAIAKAAGK 
PVIRLEDGFV RSLGLGVNGE PPLSLVVDDC GIYYDASKPS ALEKLVQDKA GNTALISQAR
EAMHTIVTGD LSKYNLAPAF VADESERSDI VLVVDQTFND MSVTYGNAGP HEFAAMLEAA
MAENPQAEIW VKVHPDVLEG KKTGYFADLR ATQRVRLIAE NVSPQSLLRH VSRVYVVTSQ
YGFEALLAGK PVTCFGQPWY AGWGLTDDRH PQSALLSARR GSATLEELFA AAYLRYCRYI
YPQTGEVSNL FTVLQWLQLQ RRHLQQRNGY LWAPGLTLWK SAILKPFLQT ATNRLSFSRR
CTAASACVVW GVKGEQQWRA EAQRKSLPLW RMEDGFLRSS GLGSDLLPPL SLVLDKRGIY
YDATRPSDLE VLLNHSQLTL AQKMRAEKLR QRLVESKLSK YNLGADFSLP AEAKDKKVIL
VPGQVEDDAS IKTGTVSIKS NLELLRTVRE RNPHAYIIYK PHPDVLVGNR KGNIPTELIA
ELADYQALDA DIIQCILRAD EVHTMTSLSG FEALLHGKHV HCYGLPFYAG WGLTVDEHRC
PRRERKLTLA DLIYQALIVY PTYIHPTWLQ PITVEEAAEY LIKTPRKPMF ITRKKAVIRY
YRKLIMFCKV RFGQTIS