Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3246 |
Symbol | gspE |
ID | 6145654 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3316624 |
End bp | 3318117 |
Gene Length | 1494 bp |
Protein Length | 497 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641618076 |
Product | general secretory pathway protein GspE |
Protein accession | YP_001745226 |
Protein GI | 170683342 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB |
TIGRFAM ID | [TIGR02533] general secretory pathway protein E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.827332 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGCCTG TAGCACAGGA AACCACCGCT AACACCGTGC GTCTGCCCTA CAGTTTCAGC CGTCGGTTTA GCCTGGTGGC ATGGTGCGAA GCGTCGCTGG AGATCCTCCA TGTTCATCCG TTGTCGCTCT CTGTTTTGCA GGAGCTACAA CGGGGGCTGA ACGCGCCCTT TACGCTGCGG CAAATCGACG AGGCCGAATT TGAACAGCGG CTGAATGCGG TCTGGCAGCG GGACTCTTCC GAAGCTCGCC AGCTGATGGA AGATCTCGGT TCCGCCGAGG ACTTTTTTAC CCTCGCTGAA GAACTGCCGG AAACGGAAGA TCTGCTGGAA AGTGACGACG ATGCGCCGAT CATCAAACTG ATCAACGCCA TGCTGGCAGA GGCAATCAAA GAAGGCGCTT CGGATATCCA CATCGAGACG TTTGAAAAGA GTCTGGTGAT CCGTTTTCGT GTTGACGGCA CATTACATGA AATGTTGCGC CCCGGTCGTA AACTGGCCTC GCTGCTGGTC TCGCGTATCA AGGTGATGGC GCGGCTGGAT ATCGCCGAAA AGCGCGTACC GCAGGATGGC CGTATTGCGC TGCTGCTGGG CGGTCGGGCG ATTGACGTCC GTGTATCTAC CATGCCTTCC GCCTGGGGGG AACGGGTGGT GCTGCGACTG CTGGACAAAA ACCAGGCCCG CCTGACGCTG GAGCGTCTGG GGCTTAGCCA GCAACTGACC GCGCAGTTGC GCCAGCTGTT ACACAAACCG CACGGCATCT TTCTGGTGAC GGGGCCGACG GGTTCCGGCA AAAGCACCAC GCTGTACGCT GGATTGCAGG AGCTGAACAA CCACTCGCGT AACATTCTCA CGGTTGAAGA CCCTATCGAA TACATGATTG AAGGGATCGG TCAGACGCAG GTTAACACCC GCGTCGGCAT GACCTTCGCC CGTGGCCTGC GCGCGATTTT GCGTCAGGAC CCGGATGTGG TGATGGTCGG TGAAATCCGC GATACCGAAA CCGCAGAAAT CGCTGTTCAG GCTTCACTGA CCGGACACCT GGTACTTTCC ACCCTGCATA CCAACACAGC GGTGGGGGCG ATCACGCGTT TGCAGGATAT GGGCGTGGAG CCTTTCCTGC TCTCTTCCAG TTTGACGGGC GTGATGGCGC AGCGACTGGT TCGCACGCTG TGTCCCGATT GCCGCCAGTC CGCGCCTGCC ACCAACGAAG AAAAACGCCT GCTGGGGATT ACCGATGCGC ATGCCGTCAC GCTGTACCAT CCGCAGGGCT GCCCCGCCTG TAATCACAAA GGTTTTCGCG GACGTACTGC CATCCATGAG CTGATTGTGG TGGACGCCAC ATTGCGTGAT TTGATCCACC GTCAGGCCGG GGAACTGGAG CTGGAACGTT ATGTCCGGCA ACACTCTGCG GGTATCCGCA GCAACGGCAT TGAGAAAGTG CTCGCCGGAG AAACCTCTCT CGATGAAGTT CTGCGGGTAA CCATGGAGGC GTAA
|
Protein sequence | MVPVAQETTA NTVRLPYSFS RRFSLVAWCE ASLEILHVHP LSLSVLQELQ RGLNAPFTLR QIDEAEFEQR LNAVWQRDSS EARQLMEDLG SAEDFFTLAE ELPETEDLLE SDDDAPIIKL INAMLAEAIK EGASDIHIET FEKSLVIRFR VDGTLHEMLR PGRKLASLLV SRIKVMARLD IAEKRVPQDG RIALLLGGRA IDVRVSTMPS AWGERVVLRL LDKNQARLTL ERLGLSQQLT AQLRQLLHKP HGIFLVTGPT GSGKSTTLYA GLQELNNHSR NILTVEDPIE YMIEGIGQTQ VNTRVGMTFA RGLRAILRQD PDVVMVGEIR DTETAEIAVQ ASLTGHLVLS TLHTNTAVGA ITRLQDMGVE PFLLSSSLTG VMAQRLVRTL CPDCRQSAPA TNEEKRLLGI TDAHAVTLYH PQGCPACNHK GFRGRTAIHE LIVVDATLRD LIHRQAGELE LERYVRQHSA GIRSNGIEKV LAGETSLDEV LRVTMEA
|
| |