Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3221 |
Symbol | kpsD |
ID | 6145112 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3292516 |
End bp | 3294192 |
Gene Length | 1677 bp |
Protein Length | 558 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641618054 |
Product | polysialic acid capsule transport protein KpsD |
Protein accession | YP_001745204 |
Protein GI | 170681559 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1596] Periplasmic protein involved in polysaccharide export |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.364488 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATTAT TTAAATCAAT TTTACTGATT GCCGCCTGTC ACGCGGCGCA GGCCAGCGCG GCCATTGATA TTAACGCTGA CCCAAACCTG ACAGGAGCCG CGCCGCTTAC CGGTATTCTG AACGGGCAAC AGTCGGATAC GCAAAACATG AGCGGCTTCG ACAATACCCC GCCGCCCGCA CCGCCGGTGG TCATGAGCCG TATGTTTGGT GCTCAACTTT TCAACGGCAC CAGCGCGGAT AGCGGTGCGA CGGTAGGATT CAACCCTGAT TATATTCTGA ATCCGGGCGA TAGCATTCAG GTCCGCTTAT GGGGTGCGTT CACCTTTGAT GGTGCGTTAC AGATTGATCC TAAAGGTAAT ATTTTCCTGC CGAACGTTGG TCCGGTGAAA GTTGCTGGCG TCAGTAATAG TCAGTTAAAT GCCCTGGTCA CATCCAAAGT GAAGGAAGTA TACCAGTCCA ACGTCAACGT CTACGCCTCC TTATTACAGG CGCAGCCAGT AAAAGTGTAC GTGACCGGAT TTGTGCGTAA TCCTGGTCTG TATGGCGGTG TGACGTCTGA TTCGTTACTC AATTATCTGA TCAAGGCTGG CGGCGTTGAT CCAGAGCGCG GAAGTTACGT TGATATTGTG GTCAAGCGCG GTAACCGCGT GCGCTCCAAC GTCAACCTGT ACGACTTCCT GCTGAACGGC AAACTGGGGC TTTCGCAGTT CGCCGATGGT GACACCATCA TCGTCGGGCC GCGTCAGCAT ACTTTCAGCG TTCAGGGCGA TGTCTTTAAC AGCTACGACT TTGAGTTCCG CGAAAGCAGC ATTCCCGTAA CGGAAGCGTT GAGCTGGGCG CGCCCTAAGC CTGGCGCGAC TCACATTACG ATTATGCGTA AACAGGGGCT GCAAAAACGC AGCGAATACT ATCCGATCAG TTCTGCGCCA GGCCGTATGT TGCAAAATGG CGATACCTTA ATCGTGAGCA CTGACCGCTA TGCCGGCACC ATTCAGGTGC GGGTTGAAGG CGCACACTCC GGTGAACATG CCATGGTACT GCCTTATGGT TCCACTATGC GTGCGGTTCT GGAAAAAGTC CGCCCGAACA GCATGTCGCA GATGAACGCG GTTCAGCTTT ATCGCCCATC AGTAGCTCAG CGTCAGAAAG AGATGCTGAA TCTCTCGCTG CAAAAACTGG AGGAAGCATC ACTTTCTGCC CAGTCCTCCA CCAAAGAAGA AGCCAGCCTG CGAATGCAGG AAGCGCAACT GATCAGCCGC TTTGTGGCGA AAGCGCGCAC CGTGGTTCCG AAAGGTGAAG TGATCCTCAA CGAATCCAAT ATTGATTCTG TTCTGCTTGA AGATGGCGAC GTCATCAATA TTCCGGAGAA AACATCGCTG GTTATGGTTC ATGGCGAAGT GCTGTTCCCG AACGCGGTGA GCTGGCAGAA GGGTATGACC ACCGAGGATT ACATCGAGAA ATGTGGTGGC CTGACGCAAA AATCGGGTAA CGCCAGAATT ATCGTCATTC GTCAGAACGG TGCGGCAGTC AACGCTGAAG ATGTGGATTC ACTCAAACCG GGCGATGAGA TTATGGTTCT GCCGAAATAT GAATCGAAAA ACATTGAAGT TACCCGTGGT ATTTCCACCA TCCTCTATCA GCTGGCGGTG GGTGCAAAAG TGATTCTGTC TTTGTAA
|
Protein sequence | MKLFKSILLI AACHAAQASA AIDINADPNL TGAAPLTGIL NGQQSDTQNM SGFDNTPPPA PPVVMSRMFG AQLFNGTSAD SGATVGFNPD YILNPGDSIQ VRLWGAFTFD GALQIDPKGN IFLPNVGPVK VAGVSNSQLN ALVTSKVKEV YQSNVNVYAS LLQAQPVKVY VTGFVRNPGL YGGVTSDSLL NYLIKAGGVD PERGSYVDIV VKRGNRVRSN VNLYDFLLNG KLGLSQFADG DTIIVGPRQH TFSVQGDVFN SYDFEFRESS IPVTEALSWA RPKPGATHIT IMRKQGLQKR SEYYPISSAP GRMLQNGDTL IVSTDRYAGT IQVRVEGAHS GEHAMVLPYG STMRAVLEKV RPNSMSQMNA VQLYRPSVAQ RQKEMLNLSL QKLEEASLSA QSSTKEEASL RMQEAQLISR FVAKARTVVP KGEVILNESN IDSVLLEDGD VINIPEKTSL VMVHGEVLFP NAVSWQKGMT TEDYIEKCGG LTQKSGNARI IVIRQNGAAV NAEDVDSLKP GDEIMVLPKY ESKNIEVTRG ISTILYQLAV GAKVILSL
|
| |