Gene EcSMS35_3224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3224 
SymbolkpsS 
ID6144338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3296995 
End bp3298200 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content45% 
IMG OID641618057 
Productpolysialic acid capsule biosynthesis protein KpsS 
Protein accessionYP_001745207 
Protein GI170680977 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3562] Capsule polysaccharide export protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGGTA ATGCACTAAC CGTTTTATTA TCCGGTAAAA AATATCTGCT ATTGCAGGGG 
CCAATGGGAC CTTTTTTCAA TGACGTCGCC GAATGGTTAG AGTCATTAGG CCGTAACGCT
GTGAATGTTG TCTTCAACGG TGGGGATCGT TTTTACTGCC GCCATCGACA ATATCTGGCT
TACTACCAAA CGCCGAAAGA GTTTCCTGGT TGGTTGCGAG ATCTCCACCG ACAATATGAC
TTTGATACCA TCCTCTGCTT TGGTGACTGC CGCCCATTGC ACAAAGAAGC AAAACGCTGG
GCAAAGTCGA AAGGGATCCG CTTTCTGGCA TTTGAGGAAG GATATTTACG CCCGCAATTT
ATTACCGTTG AAGAAGGCGG AGTGAACGCA TATTCATCGC TACCGCGCGA TCCGGATTTT
TATCGTAAGT TACCAGATAT GCCTACGCCG CACGTTGAGA ACTTAAAACC TTCAACGATG
AAACGTATAG GTCATGCGAT GTGGTATTAC CTGATGGGCT GGCATTACCG TCATGAGTTC
CCTCGCTACC GCCACCACAA ATCATTTTCC CCCTGGTATG AAGCTCGTTG CTGGGTTCGT
GCGTACTGGC GCAAGCAACT TTACAAGGTA ACACAGCGTA AAGTATTGCC GCGGTTAATG
AATGAGCTGG ATCAGCGTTA TTATCTTGCC GTTTTGCAGG TGTATAACGA TAGCCAGATT
CGTAACCACA GCAATTATAA TGATGTGCGA GATTATATTA ATGAAGTCAT GTACTCATTT
TCGCGTAAAG CGCCAAAAGA AAGTTATTTG GTGATCAAAC ATCATCCGAT GGATCGTGGT
CACAGACTCT ATCGACCATT AATTAAGCGG TTGAGTAAGG AATATGGCTT AGGGGAACGA
GTCATTTATG TGCACGATCT CCCGATGCCG GAATTATTAC GCCACGCAAA AGCGGTGGTG
ACAATTAACA GTACGGCGGG GATCTCTGCG CTGATTCATA ACAAACCACT CAAAGTGATG
GGCAATGCCC TGTACGACAT CAAGGGATTG ACGTATCAAG GGCATTTGCA CCAGTTCTGG
CAGGCCGATT TTAAACCGGA TATGAAACTG TTTAAGAAGT TTCGGGGGTA TTTATTGGTG
AAGACGCAGG TTAATGGGGT TTATTATGGG GGGAACATAA CAAACCGCCA ACATAATATA
TATTAA
 
Protein sequence
MQGNALTVLL SGKKYLLLQG PMGPFFNDVA EWLESLGRNA VNVVFNGGDR FYCRHRQYLA 
YYQTPKEFPG WLRDLHRQYD FDTILCFGDC RPLHKEAKRW AKSKGIRFLA FEEGYLRPQF
ITVEEGGVNA YSSLPRDPDF YRKLPDMPTP HVENLKPSTM KRIGHAMWYY LMGWHYRHEF
PRYRHHKSFS PWYEARCWVR AYWRKQLYKV TQRKVLPRLM NELDQRYYLA VLQVYNDSQI
RNHSNYNDVR DYINEVMYSF SRKAPKESYL VIKHHPMDRG HRLYRPLIKR LSKEYGLGER
VIYVHDLPMP ELLRHAKAVV TINSTAGISA LIHNKPLKVM GNALYDIKGL TYQGHLHQFW
QADFKPDMKL FKKFRGYLLV KTQVNGVYYG GNITNRQHNI Y