Gene ECH74115_5071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5071 
Symbol 
ID6969853 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4718361 
End bp4719398 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content34% 
IMG OID643388750 
Productsecretion system apparatus protein SsaU 
Protein accessionYP_002273176 
Protein GI209398003 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4792] Type III secretory pathway, component EscU 
TIGRFAM ID[TIGR01404] type III secretion protein, YscU/HrpY family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.796147 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.0581283 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAAA AAACAGAAAA GCCCACACCC AAAAAACTAA GGGATCTAAA AAAGAAGGGC 
GATGTAACAA AAAGTGAAGA GGTAATGGCT GCAGTGCAGT CATTAATCTT ATTTTCATTT
TTTTCTTTAT ATGGCATGAG TTTTTTTGTT GATATAGTTG GATTAGTTAA TACGACAATA
GACTCGCTAA ATAGACCGTT TTTGTATGCC ATTCGAGAAA TATTAGGTGC GGTATTAAAT
ATATTTTTAT TATATATTTT GCCAATTTCT TTGATTGTCT TTGTTGGAAC TGTTACGACT
GGTGTATCAC AAATAGGATT CATCTTTGCG GTTGAAAAAA TAAAACCATC GGCTCAGAAG
ATTAGTGTAA AAAATAACCT GAAAAATATT TTTTCTGTAA AGAGCATTTT TGAGCTACTT
AAATCAGTAT TTAAGTTAGT GATAATTGTT CTCATTTTTT ATTTTATGGG GCATTCATAT
GCAAATGAGT TTGCTAATTT CACAGGACTG AACGCATATC AAGCTCTTGT CGTTGTTGCC
TTTTTTGTTT TTCTTTTATG GAAAGGCGTG CTATTCGGAT ATCTACTCTT TTCAGTATTT
GATTTCTGGT TCCAGAAGCA TGAGGGACTG AAGAAAATGA AAATGAGTAA AGATGAGGTG
AAACGAGAAG CCAAGGATAC TGATGGTAAC CCTGAAATTA AAGGGGAGCG CCGTCGCCTT
CATTCCGAGA TACAAAGTGG AAGTTTGGCT AATAACATCA AAAAATCAAC CGTTATTGTT
AAAAACCCGA CTCACATTGC GATTTGCCTA TACTATAAAC TTGGGGAGAC TCCATTACCT
TTAGTTATTG AAACAGGAAA AGATGCCAAA GCTCTACAGA TCATTAAACT GGCTGAACTC
TATGATATTC CAGTGATTGA AGATATTCCT TTAGCAAGAA CTCTCTATAA GAATATACAT
AAAGGACAAT ATATAACAGA AGACTTTTTT GAACCTGTGG CACAATTGAT TCGTATTGCG
ATAGACCTTG ATTATTAA
 
Protein sequence
MSEKTEKPTP KKLRDLKKKG DVTKSEEVMA AVQSLILFSF FSLYGMSFFV DIVGLVNTTI 
DSLNRPFLYA IREILGAVLN IFLLYILPIS LIVFVGTVTT GVSQIGFIFA VEKIKPSAQK
ISVKNNLKNI FSVKSIFELL KSVFKLVIIV LIFYFMGHSY ANEFANFTGL NAYQALVVVA
FFVFLLWKGV LFGYLLFSVF DFWFQKHEGL KKMKMSKDEV KREAKDTDGN PEIKGERRRL
HSEIQSGSLA NNIKKSTVIV KNPTHIAICL YYKLGETPLP LVIETGKDAK ALQIIKLAEL
YDIPVIEDIP LARTLYKNIH KGQYITEDFF EPVAQLIRIA IDLDY