Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3219 |
Symbol | kpsF |
ID | 6145537 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3290253 |
End bp | 3291272 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641618052 |
Product | polysialic acid capsule expression protein KpsF |
Protein accession | YP_001745202 |
Protein GI | 170680699 |
COG category | [M] Cell wall/membrane/envelope biogenesis [T] Signal transduction mechanisms |
COG ID | [COG0794] Predicted sugar phosphate isomerase involved in capsule formation [COG2905] Predicted signal-transduction protein containing cAMP-binding and CBS domains |
TIGRFAM ID | [TIGR00393] KpsF/GutQ family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACTGTGG CATTATTTCC GTGCAAAGGA GCTGATATGT CTGAAAGACA TTTACCTGAT GACCAGAGCA GTACTATCGA TCCATATCTA ATTACCTCTG TTCGCCAGAC TCTGGCAGAA CAAGGCGCAG CATTACAAAA CTTGTCTAAA CAACTGGATT CCGGGCAGTA CCAGCGTGTC CTTAATTTGA TAATGAACTG TAAAGGGCAC GTTATTCTTT CGGGAATGGG TAAATCAGGG CATGTCGGTC GTAAAATGTC AGCGACGCTG GCCTCTACGG GTACGCCTAG TTTCTTTATT CATCCTGCAG AAGCTTTCCA TGGCGATCTG GGCATGATTA CGCCTTACGA TCTTCTGATC CTTATTTCTG CCAGCGGTGA AACGGATGAA ATCCTCAAGC TAGTTCCTTC ACTGAAAAAT TTCGGCAACC GAATTATCGC CATTACCAAT AATGGAAATT CCACGCTGGC GAAAAATGCT GATGCCGTGC TGGAACTCCA CATGGCGAAT GAAACCTGCC CGAATAATCT TGCACCAACA ACGTCTACCA CGCTGACGAT GGCGATCGGC GATGCGCTGG CGATTGCCAT GATCCACCAA CGCAAATTTA TGCCGAATGA TTTTGCGCGC TATCACCCGG GCGGTTCATT AGGTCGTCGC CTGCTGACCC GCGTTGCTGA TGTCATGCAG CATGATGTTC CTGCGGTACA GCTGGATGCG TCATTTAAAA CCGTGATTCA ACGTATCACC AGCGGATGCC AGGGAATGGT GATGGTAGAA GACGCAGAAG GTGGGCTAGC GGGCATTATC ACCGACGGTG ACCTGCGTCG CTTTATGGAA AAAGAGGATT CTCTGACATC CGCTACGGCT GCGCAGATGA TGACACGTGA ACCGCTGACG CTACCGGAAG ACACCATGAT CATTGAAGCG GAAGAAAAAA TGCAAAAGCA CCGCGTCTCA ACGTTATTGG TGACCAACAA GGCAAATAAA GTCACTGGCC TTGTGCGCAT TTTCGACTAA
|
Protein sequence | MTVALFPCKG ADMSERHLPD DQSSTIDPYL ITSVRQTLAE QGAALQNLSK QLDSGQYQRV LNLIMNCKGH VILSGMGKSG HVGRKMSATL ASTGTPSFFI HPAEAFHGDL GMITPYDLLI LISASGETDE ILKLVPSLKN FGNRIIAITN NGNSTLAKNA DAVLELHMAN ETCPNNLAPT TSTTLTMAIG DALAIAMIHQ RKFMPNDFAR YHPGGSLGRR LLTRVADVMQ HDVPAVQLDA SFKTVIQRIT SGCQGMVMVE DAEGGLAGII TDGDLRRFME KEDSLTSATA AQMMTREPLT LPEDTMIIEA EEKMQKHRVS TLLVTNKANK VTGLVRIFD
|
| |