Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3224 |
Symbol | kpsS |
ID | 6144338 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3296995 |
End bp | 3298200 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 641618057 |
Product | polysialic acid capsule biosynthesis protein KpsS |
Protein accession | YP_001745207 |
Protein GI | 170680977 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3562] Capsule polysaccharide export protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAGGTA ATGCACTAAC CGTTTTATTA TCCGGTAAAA AATATCTGCT ATTGCAGGGG CCAATGGGAC CTTTTTTCAA TGACGTCGCC GAATGGTTAG AGTCATTAGG CCGTAACGCT GTGAATGTTG TCTTCAACGG TGGGGATCGT TTTTACTGCC GCCATCGACA ATATCTGGCT TACTACCAAA CGCCGAAAGA GTTTCCTGGT TGGTTGCGAG ATCTCCACCG ACAATATGAC TTTGATACCA TCCTCTGCTT TGGTGACTGC CGCCCATTGC ACAAAGAAGC AAAACGCTGG GCAAAGTCGA AAGGGATCCG CTTTCTGGCA TTTGAGGAAG GATATTTACG CCCGCAATTT ATTACCGTTG AAGAAGGCGG AGTGAACGCA TATTCATCGC TACCGCGCGA TCCGGATTTT TATCGTAAGT TACCAGATAT GCCTACGCCG CACGTTGAGA ACTTAAAACC TTCAACGATG AAACGTATAG GTCATGCGAT GTGGTATTAC CTGATGGGCT GGCATTACCG TCATGAGTTC CCTCGCTACC GCCACCACAA ATCATTTTCC CCCTGGTATG AAGCTCGTTG CTGGGTTCGT GCGTACTGGC GCAAGCAACT TTACAAGGTA ACACAGCGTA AAGTATTGCC GCGGTTAATG AATGAGCTGG ATCAGCGTTA TTATCTTGCC GTTTTGCAGG TGTATAACGA TAGCCAGATT CGTAACCACA GCAATTATAA TGATGTGCGA GATTATATTA ATGAAGTCAT GTACTCATTT TCGCGTAAAG CGCCAAAAGA AAGTTATTTG GTGATCAAAC ATCATCCGAT GGATCGTGGT CACAGACTCT ATCGACCATT AATTAAGCGG TTGAGTAAGG AATATGGCTT AGGGGAACGA GTCATTTATG TGCACGATCT CCCGATGCCG GAATTATTAC GCCACGCAAA AGCGGTGGTG ACAATTAACA GTACGGCGGG GATCTCTGCG CTGATTCATA ACAAACCACT CAAAGTGATG GGCAATGCCC TGTACGACAT CAAGGGATTG ACGTATCAAG GGCATTTGCA CCAGTTCTGG CAGGCCGATT TTAAACCGGA TATGAAACTG TTTAAGAAGT TTCGGGGGTA TTTATTGGTG AAGACGCAGG TTAATGGGGT TTATTATGGG GGGAACATAA CAAACCGCCA ACATAATATA TATTAA
|
Protein sequence | MQGNALTVLL SGKKYLLLQG PMGPFFNDVA EWLESLGRNA VNVVFNGGDR FYCRHRQYLA YYQTPKEFPG WLRDLHRQYD FDTILCFGDC RPLHKEAKRW AKSKGIRFLA FEEGYLRPQF ITVEEGGVNA YSSLPRDPDF YRKLPDMPTP HVENLKPSTM KRIGHAMWYY LMGWHYRHEF PRYRHHKSFS PWYEARCWVR AYWRKQLYKV TQRKVLPRLM NELDQRYYLA VLQVYNDSQI RNHSNYNDVR DYINEVMYSF SRKAPKESYL VIKHHPMDRG HRLYRPLIKR LSKEYGLGER VIYVHDLPMP ELLRHAKAVV TINSTAGISA LIHNKPLKVM GNALYDIKGL TYQGHLHQFW QADFKPDMKL FKKFRGYLLV KTQVNGVYYG GNITNRQHNI Y
|
| |