Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_B0004 |
Symbol | gspE |
ID | 6966471 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011350 |
Strand | + |
Start bp | 84177 |
End bp | 85682 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643384020 |
Product | general secretory pathway protein E |
Protein accession | YP_002268499 |
Protein GI | 209395608 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB |
TIGRFAM ID | [TIGR02533] general secretory pathway protein E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.0631782 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAGAG TAGTACAGAA TGTCAGTGAA TCACGTCCCC TTCTGCCTTT TTCCTTTTCC CGTACGCAGA GAATTTTGTT GCTGCGTGAA CAGGAGGGGA ATCGAGTGTT CTGCATGGAG GACACACCTG CCAGTGCCCT GTTGGAGGTG CGCAGGGTGG CTGAAGGTCC TCTGAATGTC ACAACAGTGT CGGCGGAAGC GTTTGAAAAA CAGCTGGTGA GCAGCTATCA ACGTGATTCT GATGAAGCGC GACAAATGAT GGCTGAGATT GGTAATGAAA TGGATTTTTA TACCGTCGCA GGGGAGTTGC CTGACCGTGA GGACTTACTG GATGCGAATG ACGATGCGCC AATTATCCGC CTGATTAACG CAATGCTGAC AGAGGCAATT AAAGAGAAAG CCTCAGATAT TCATATTGAA ACTTACGAAC GCCATCTGCA GGTTCGCTTT CGCATTGATG GCGTTCTGCG GGAGATCCTC CGTTTACATA GGAATCTGGC TTCGTTACTG ATTTCACGCA TTAAAGTCAT GGCCCGGTTG GATATTGCCG AAAAACGCGT TCCCCAAGAT GGCCGCATGG TACTGCGTAT CGGTGGTCGG GCGGTGGATG TGCGTGTTTC AACGTTGCCT TCAAATCATG GTGAACGCAT CGTGCTGCGT TTACTAGACA AAAATAGCGT TAGTCTCGAT CTTGCTGCAC TCGGCATGTC GCAGCAGAAT CAGCGACACA TTGATGCACT GATCCGCCGT CCTCACGGCA TTATTTTGGT GACCGGGCCT ACAGGGTCGG GTAAAAGCAC GACGCTTTAT GCCGCGTTAA GCCTGCTGAA TCCCCGAGAC CGCAACATTA TGACGGTCGA GGATCCTGTT GAATATGAAC TGGACGGTAT CAGTCAGACC CAGGTCAACC CGAAGGTGGA CATGACCTTT GCCCGGAGTC TGCGCGCTAT TTTACGTCAG GATCCGGACG TGGTGCTTGT GGGGGAGATC CGTGACGGTG AAACAGCCCA GATTGCTGTG CAGGCCTCGC TGACAGGGCA TTTGGTGCTG TCAACGTTAC ATACGAATAG TGCAGCAGGC GCGCTGTCGC GTCTGCAGGA TATGGGCATC CCTCCTTTTC TGCTTTCCAC CTCATTACTG GCTGTTCTGG CCCAGCGACT GGTTCGCACG TTGTGTCCGC GCTGTCGTCA GCCCTGTCAG GTCAGTACTG AGTTAGCGAT GGACATGGAC ATTCCCCCTG AAACCACGAT CTGGCAGCCT GCCGGGTGTC AGCACTGCAG TTTCACGGGC TATCACGGGC GCACCGGGAT CCACGAGCTG TTGCTGATTG ACGATCGCAT TCGGACGGCA ATCTATCAGG GGGAGGGGGA GCTGGGTATT ACCCGCTTGG CAGGGAGTCG CTATCTGACG CTGCGTGGCG ACGGGCGACA AAAGGTACTA GCCGGTGAGA CCAGTTGGGA GGAGGTGGTT CGCGTTACTG AAAGCAGATT GCAGGAAGAG GAATGA
|
Protein sequence | MSRVVQNVSE SRPLLPFSFS RTQRILLLRE QEGNRVFCME DTPASALLEV RRVAEGPLNV TTVSAEAFEK QLVSSYQRDS DEARQMMAEI GNEMDFYTVA GELPDREDLL DANDDAPIIR LINAMLTEAI KEKASDIHIE TYERHLQVRF RIDGVLREIL RLHRNLASLL ISRIKVMARL DIAEKRVPQD GRMVLRIGGR AVDVRVSTLP SNHGERIVLR LLDKNSVSLD LAALGMSQQN QRHIDALIRR PHGIILVTGP TGSGKSTTLY AALSLLNPRD RNIMTVEDPV EYELDGISQT QVNPKVDMTF ARSLRAILRQ DPDVVLVGEI RDGETAQIAV QASLTGHLVL STLHTNSAAG ALSRLQDMGI PPFLLSTSLL AVLAQRLVRT LCPRCRQPCQ VSTELAMDMD IPPETTIWQP AGCQHCSFTG YHGRTGIHEL LLIDDRIRTA IYQGEGELGI TRLAGSRYLT LRGDGRQKVL AGETSWEEVV RVTESRLQEE E
|
| |