Gene ECH74115_B0004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_B0004 
SymbolgspE 
ID6966471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011350 
Strand
Start bp84177 
End bp85682 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content54% 
IMG OID643384020 
Productgeneral secretory pathway protein E 
Protein accessionYP_002268499 
Protein GI209395608 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID[TIGR02533] general secretory pathway protein E 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.0631782 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGAG TAGTACAGAA TGTCAGTGAA TCACGTCCCC TTCTGCCTTT TTCCTTTTCC 
CGTACGCAGA GAATTTTGTT GCTGCGTGAA CAGGAGGGGA ATCGAGTGTT CTGCATGGAG
GACACACCTG CCAGTGCCCT GTTGGAGGTG CGCAGGGTGG CTGAAGGTCC TCTGAATGTC
ACAACAGTGT CGGCGGAAGC GTTTGAAAAA CAGCTGGTGA GCAGCTATCA ACGTGATTCT
GATGAAGCGC GACAAATGAT GGCTGAGATT GGTAATGAAA TGGATTTTTA TACCGTCGCA
GGGGAGTTGC CTGACCGTGA GGACTTACTG GATGCGAATG ACGATGCGCC AATTATCCGC
CTGATTAACG CAATGCTGAC AGAGGCAATT AAAGAGAAAG CCTCAGATAT TCATATTGAA
ACTTACGAAC GCCATCTGCA GGTTCGCTTT CGCATTGATG GCGTTCTGCG GGAGATCCTC
CGTTTACATA GGAATCTGGC TTCGTTACTG ATTTCACGCA TTAAAGTCAT GGCCCGGTTG
GATATTGCCG AAAAACGCGT TCCCCAAGAT GGCCGCATGG TACTGCGTAT CGGTGGTCGG
GCGGTGGATG TGCGTGTTTC AACGTTGCCT TCAAATCATG GTGAACGCAT CGTGCTGCGT
TTACTAGACA AAAATAGCGT TAGTCTCGAT CTTGCTGCAC TCGGCATGTC GCAGCAGAAT
CAGCGACACA TTGATGCACT GATCCGCCGT CCTCACGGCA TTATTTTGGT GACCGGGCCT
ACAGGGTCGG GTAAAAGCAC GACGCTTTAT GCCGCGTTAA GCCTGCTGAA TCCCCGAGAC
CGCAACATTA TGACGGTCGA GGATCCTGTT GAATATGAAC TGGACGGTAT CAGTCAGACC
CAGGTCAACC CGAAGGTGGA CATGACCTTT GCCCGGAGTC TGCGCGCTAT TTTACGTCAG
GATCCGGACG TGGTGCTTGT GGGGGAGATC CGTGACGGTG AAACAGCCCA GATTGCTGTG
CAGGCCTCGC TGACAGGGCA TTTGGTGCTG TCAACGTTAC ATACGAATAG TGCAGCAGGC
GCGCTGTCGC GTCTGCAGGA TATGGGCATC CCTCCTTTTC TGCTTTCCAC CTCATTACTG
GCTGTTCTGG CCCAGCGACT GGTTCGCACG TTGTGTCCGC GCTGTCGTCA GCCCTGTCAG
GTCAGTACTG AGTTAGCGAT GGACATGGAC ATTCCCCCTG AAACCACGAT CTGGCAGCCT
GCCGGGTGTC AGCACTGCAG TTTCACGGGC TATCACGGGC GCACCGGGAT CCACGAGCTG
TTGCTGATTG ACGATCGCAT TCGGACGGCA ATCTATCAGG GGGAGGGGGA GCTGGGTATT
ACCCGCTTGG CAGGGAGTCG CTATCTGACG CTGCGTGGCG ACGGGCGACA AAAGGTACTA
GCCGGTGAGA CCAGTTGGGA GGAGGTGGTT CGCGTTACTG AAAGCAGATT GCAGGAAGAG
GAATGA
 
Protein sequence
MSRVVQNVSE SRPLLPFSFS RTQRILLLRE QEGNRVFCME DTPASALLEV RRVAEGPLNV 
TTVSAEAFEK QLVSSYQRDS DEARQMMAEI GNEMDFYTVA GELPDREDLL DANDDAPIIR
LINAMLTEAI KEKASDIHIE TYERHLQVRF RIDGVLREIL RLHRNLASLL ISRIKVMARL
DIAEKRVPQD GRMVLRIGGR AVDVRVSTLP SNHGERIVLR LLDKNSVSLD LAALGMSQQN
QRHIDALIRR PHGIILVTGP TGSGKSTTLY AALSLLNPRD RNIMTVEDPV EYELDGISQT
QVNPKVDMTF ARSLRAILRQ DPDVVLVGEI RDGETAQIAV QASLTGHLVL STLHTNSAAG
ALSRLQDMGI PPFLLSTSLL AVLAQRLVRT LCPRCRQPCQ VSTELAMDMD IPPETTIWQP
AGCQHCSFTG YHGRTGIHEL LLIDDRIRTA IYQGEGELGI TRLAGSRYLT LRGDGRQKVL
AGETSWEEVV RVTESRLQEE E