Gene EcSMS35_3221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3221 
SymbolkpsD 
ID6145112 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3292516 
End bp3294192 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content52% 
IMG OID641618054 
Productpolysialic acid capsule transport protein KpsD 
Protein accessionYP_001745204 
Protein GI170681559 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1596] Periplasmic protein involved in polysaccharide export 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.364488 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTAT TTAAATCAAT TTTACTGATT GCCGCCTGTC ACGCGGCGCA GGCCAGCGCG 
GCCATTGATA TTAACGCTGA CCCAAACCTG ACAGGAGCCG CGCCGCTTAC CGGTATTCTG
AACGGGCAAC AGTCGGATAC GCAAAACATG AGCGGCTTCG ACAATACCCC GCCGCCCGCA
CCGCCGGTGG TCATGAGCCG TATGTTTGGT GCTCAACTTT TCAACGGCAC CAGCGCGGAT
AGCGGTGCGA CGGTAGGATT CAACCCTGAT TATATTCTGA ATCCGGGCGA TAGCATTCAG
GTCCGCTTAT GGGGTGCGTT CACCTTTGAT GGTGCGTTAC AGATTGATCC TAAAGGTAAT
ATTTTCCTGC CGAACGTTGG TCCGGTGAAA GTTGCTGGCG TCAGTAATAG TCAGTTAAAT
GCCCTGGTCA CATCCAAAGT GAAGGAAGTA TACCAGTCCA ACGTCAACGT CTACGCCTCC
TTATTACAGG CGCAGCCAGT AAAAGTGTAC GTGACCGGAT TTGTGCGTAA TCCTGGTCTG
TATGGCGGTG TGACGTCTGA TTCGTTACTC AATTATCTGA TCAAGGCTGG CGGCGTTGAT
CCAGAGCGCG GAAGTTACGT TGATATTGTG GTCAAGCGCG GTAACCGCGT GCGCTCCAAC
GTCAACCTGT ACGACTTCCT GCTGAACGGC AAACTGGGGC TTTCGCAGTT CGCCGATGGT
GACACCATCA TCGTCGGGCC GCGTCAGCAT ACTTTCAGCG TTCAGGGCGA TGTCTTTAAC
AGCTACGACT TTGAGTTCCG CGAAAGCAGC ATTCCCGTAA CGGAAGCGTT GAGCTGGGCG
CGCCCTAAGC CTGGCGCGAC TCACATTACG ATTATGCGTA AACAGGGGCT GCAAAAACGC
AGCGAATACT ATCCGATCAG TTCTGCGCCA GGCCGTATGT TGCAAAATGG CGATACCTTA
ATCGTGAGCA CTGACCGCTA TGCCGGCACC ATTCAGGTGC GGGTTGAAGG CGCACACTCC
GGTGAACATG CCATGGTACT GCCTTATGGT TCCACTATGC GTGCGGTTCT GGAAAAAGTC
CGCCCGAACA GCATGTCGCA GATGAACGCG GTTCAGCTTT ATCGCCCATC AGTAGCTCAG
CGTCAGAAAG AGATGCTGAA TCTCTCGCTG CAAAAACTGG AGGAAGCATC ACTTTCTGCC
CAGTCCTCCA CCAAAGAAGA AGCCAGCCTG CGAATGCAGG AAGCGCAACT GATCAGCCGC
TTTGTGGCGA AAGCGCGCAC CGTGGTTCCG AAAGGTGAAG TGATCCTCAA CGAATCCAAT
ATTGATTCTG TTCTGCTTGA AGATGGCGAC GTCATCAATA TTCCGGAGAA AACATCGCTG
GTTATGGTTC ATGGCGAAGT GCTGTTCCCG AACGCGGTGA GCTGGCAGAA GGGTATGACC
ACCGAGGATT ACATCGAGAA ATGTGGTGGC CTGACGCAAA AATCGGGTAA CGCCAGAATT
ATCGTCATTC GTCAGAACGG TGCGGCAGTC AACGCTGAAG ATGTGGATTC ACTCAAACCG
GGCGATGAGA TTATGGTTCT GCCGAAATAT GAATCGAAAA ACATTGAAGT TACCCGTGGT
ATTTCCACCA TCCTCTATCA GCTGGCGGTG GGTGCAAAAG TGATTCTGTC TTTGTAA
 
Protein sequence
MKLFKSILLI AACHAAQASA AIDINADPNL TGAAPLTGIL NGQQSDTQNM SGFDNTPPPA 
PPVVMSRMFG AQLFNGTSAD SGATVGFNPD YILNPGDSIQ VRLWGAFTFD GALQIDPKGN
IFLPNVGPVK VAGVSNSQLN ALVTSKVKEV YQSNVNVYAS LLQAQPVKVY VTGFVRNPGL
YGGVTSDSLL NYLIKAGGVD PERGSYVDIV VKRGNRVRSN VNLYDFLLNG KLGLSQFADG
DTIIVGPRQH TFSVQGDVFN SYDFEFRESS IPVTEALSWA RPKPGATHIT IMRKQGLQKR
SEYYPISSAP GRMLQNGDTL IVSTDRYAGT IQVRVEGAHS GEHAMVLPYG STMRAVLEKV
RPNSMSQMNA VQLYRPSVAQ RQKEMLNLSL QKLEEASLSA QSSTKEEASL RMQEAQLISR
FVAKARTVVP KGEVILNESN IDSVLLEDGD VINIPEKTSL VMVHGEVLFP NAVSWQKGMT
TEDYIEKCGG LTQKSGNARI IVIRQNGAAV NAEDVDSLKP GDEIMVLPKY ESKNIEVTRG
ISTILYQLAV GAKVILSL