Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3247 |
Symbol | gspD |
ID | 6142627 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3318117 |
End bp | 3320177 |
Gene Length | 2061 bp |
Protein Length | 686 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641618077 |
Product | general secretion pathway protein GspD |
Protein accession | YP_001745227 |
Protein GI | 170681179 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1450] Type II secretory pathway, component PulD |
TIGRFAM ID | [TIGR02517] general secretion pathway protein D |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTTTGGC GTGATATGAC GTTGTCGGTC TGGCGTAAGA AGACAACTGG CCTCAAAACA AAAAAGCGTT TACTGCCGCT GGTGCTGGCA GCGGCATTAT GCAGTTCACC GGTCTGGGCG GAAGAAGCCA CTTTCACCGC TAATTTTAAA GATACCGACC TGAAATCGTT CATCGAAACC GTCGGCGCTA ACCTTAATAA AACCATCATT ATGGGGCCGG GCGTTCAGGG GAAAGTGAGT ATTCGCACTA TGACTCCACT CAATGAACGC CAGTATTACC AGCTATTCCT TAACCTGCTG GAAGCGCAGG GGTATGCCGT CGTACCGATG GAAAACGACG TGCTGAAGGT GGTGAAATCA AGCGCCGCGA AAGTCGAGCC GCTGCCGCTG GTCGGTGAAG GCAGCGACAA CTACGCGGGC GATGAAATGG TCACCAAAGT CGTGCCGGTA CGTAATGTGT CGGTACGCGA GCTGGCACCG ATTCTGCGCC AGATGATTGA CAGCGCAGGC TCAGGCAACG TTGTTAATTA CGATCCCTCC AACGTGATTA TGCTCACCGG ACGCGCCTCA GTCGTGGAGC GACTGACGGA AGTGATCCAG CGCGTGGATC ACGCAGGAAA CCGCACCGAA GAGGTGATCC CGCTGGATAA CGCCTCAGCC TCGGAAATTG CCCGCGTGCT GGAAAGCCTG ACCAAAAACA GCGGCGAGAA CCAGCCAGCA ACGCTGAAAT CTCAAATTGT CGCCGATGAA CGCACCAACA GCGTGATTGT CAGTGGTGAC CCCGCCACGC GAGACAAAAT GCGCCGTCTG ATCCGTCGGC TGGATTCAGA AATGGAGCGC AGAGGCAACA GCCAGGTGTT CTATCTCAAA TACAGCAAAG CCGAAGATCT GGTCGATGTG CTGAAGCAGG TCAGCGGTAC GCTCACGGCG GCTAAAGAAG AGGCGGAAGG CACGGTTGGT AGCGGGCGTG AGGTTGTCTC CATCGCCGCC AGCAAACACA GTAATGCCCT GATTGTTACT GCGCCGCAGG ACATTATGCA GTCGCTGCAA AGCGTGATTG AACAACTGGA TATTCGCCGT GCTCAGGTGC ATGTCGAGGC GTTAATCGTG GAAGTTGCCG AAGGCAGCAA TATCAACTTC GGCGTGCAGT GGGGGTCAAA AGATGCCGGA TTAATGCAGT TTGCCAACGG TACGCAAATC CCTATTGGTA CGCTGGGGGC AGCGATTTCT GCAGCCAAGC CGCAGAAAGG TTCCACGGTG ATCAGCGAAA ACGGTGCTAC CACCATAAAT CCGGATACTA ACGGCGATCT CTCCACGCTC GCTCAGCTTC TTTCTGGCTT TAGCGGTACG GCGGTTGGCG TGGTGAAAGG CGACTGGATG GCGCTGGTAC AGGCGGTCAA AAACGACTCC AGCTCGAACG TGCTCTCCAC GCCGAGTATC ACCACGCTGG ACAACCAGGA AGCCTTCTTC ATGGTGGGCC AGGACGTTCC GGTATTAACT GGATCTACCG TTGGCTCCAA TAACAGCAAT CCTTTCAATA CGGTAGAGAG GAAAAAAGTC GGCATCATGC TGAAAGTCAC GCCGCAGATT AACGAAGGTA ACGCGGTACA GATGGTGATT GAGCAGGAAG TCTCGAAAGT GGAAGGGCAG ACCAGCCTCG ATGTCGTGTT TGGCGAGCGC AAACTGAAAA CCACCGTGCT GGCTAACGAT GGTGAGCTGA TCGTGCTTGG CGGTCTGATG GACGACCAGG CGGGAGAAAG CGTGGCGAAA GTGCCGCTGC TGGGTGATAT CCCGTTGATT GGTAACCTGT TTAAATCGAC GGCGGATAAA AAAGAAAAAC GTAACCTGAT GGTGTTTATC CGCCCGACCA TTTTGCGTGA CGGTATGGCG GCAGACGGCG TGTCGCAGCG CAAATATAAC TACATGCGCG CTGAGCAAAT CTACCGCGAT GAGCAAGGCT TAAGCCTGAT GCCGCACACC GCGCAGCCGA TATTGCCAGC GCAAAATCAG GCCTTACCGC CGGAAGTACG CGCGTTCCTC AATGCCGGGA GAACGCGTTA A
|
Protein sequence | MFWRDMTLSV WRKKTTGLKT KKRLLPLVLA AALCSSPVWA EEATFTANFK DTDLKSFIET VGANLNKTII MGPGVQGKVS IRTMTPLNER QYYQLFLNLL EAQGYAVVPM ENDVLKVVKS SAAKVEPLPL VGEGSDNYAG DEMVTKVVPV RNVSVRELAP ILRQMIDSAG SGNVVNYDPS NVIMLTGRAS VVERLTEVIQ RVDHAGNRTE EVIPLDNASA SEIARVLESL TKNSGENQPA TLKSQIVADE RTNSVIVSGD PATRDKMRRL IRRLDSEMER RGNSQVFYLK YSKAEDLVDV LKQVSGTLTA AKEEAEGTVG SGREVVSIAA SKHSNALIVT APQDIMQSLQ SVIEQLDIRR AQVHVEALIV EVAEGSNINF GVQWGSKDAG LMQFANGTQI PIGTLGAAIS AAKPQKGSTV ISENGATTIN PDTNGDLSTL AQLLSGFSGT AVGVVKGDWM ALVQAVKNDS SSNVLSTPSI TTLDNQEAFF MVGQDVPVLT GSTVGSNNSN PFNTVERKKV GIMLKVTPQI NEGNAVQMVI EQEVSKVEGQ TSLDVVFGER KLKTTVLAND GELIVLGGLM DDQAGESVAK VPLLGDIPLI GNLFKSTADK KEKRNLMVFI RPTILRDGMA ADGVSQRKYN YMRAEQIYRD EQGLSLMPHT AQPILPAQNQ ALPPEVRAFL NAGRTR
|
| |