Gene ECH74115_4137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4137 
Symbol 
ID6967744 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3827232 
End bp3828353 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content33% 
IMG OID643387889 
Productsurface presentation of antigens protein SpaS 
Protein accessionYP_002272329 
Protein GI209399671 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1377] Flagellar biosynthesis pathway, component FlhB 
TIGRFAM ID[TIGR01404] type III secretion protein, YscU/HrpY family 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.000465227 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCAAATA AAACTGAAAA GCCAACACAA AAGAAACTGC AGGATGCTTC TAAAAAAGGG 
CAAATTCTAA AAAGTAGAGA CTTAACCGTT TCTGTAATAA TGCTTGTGGG TACTTTATAT
CTCGGATATG TTTTTGATGT GCATCACATT ATGTCGATTC TTGAATATAT CCTTGATCAT
AACGCTAAAC CGGATATTTG GGACTATTTT AAAGCTATGG GGATTGGTTG GTTGAAAACG
ATCATTCCTT TTTTGCTGGT TTGCATGTTC ACAACAATAC TTGTCTCCTG GTTTCAAAGT
AAAATGCAAT TAGCAACTGA AGCTGTAAAA TTAAAGTTTG ATTCATTGAA TCCAGTAAAT
GGTTTAAAGC GTATATTTGG CTTAAAAACC GTAAAAGAAT TTGTTAAAGC AATTCTTTAT
ATTATTTTTT TTGCATTGGA GATCAAAGTA TTTTGGAGTA ATCATAAATC ACTGCTTTTT
AAAACTCTTG ATGGAGATAT CATATCTTTA TTATCAGATT GGGGAGAGAT GCTATTCCTT
CTCATACTGT ATTGTCTCGG CAGTATGATA ATTGTCTTAA TTTTTGATTT TATCGCTGAA
TATTTTTTAT TTATGAAAGA TATGAAAATG GATAAACAAG AAGTTAAAAG AGAATACAAG
GAACAAGAAG GAAATCCTGA AATTAAGTCT AAACGCAGAG AGCGCCATCA GGAAATTCTT
TCTGAGCAAT TGAAATCTGA TGTCAGTAAT AGCCGTTTGA TGATTGCCAA CCCTACTCAC
ATTGCAATAG GGATATACTT TAAGCCACAT CTGTCACCTA TTCCATTGAT TTCTGTAAGA
GAAACTAATG AGGTAGCATT AGCTGTAAGG AAATATGCAA AGGAAATCGG GATACCAATT
ATTACAGATA AAAAATTAGC ACGAAAAATT TATGCTACCC ATCGTCGCTA CGATTATGTT
AGCTTCGAAA ATATAGATGA AATATTACGT CTTCTGCTGT GGCTTGAAGA TGTGGAGAAT
GCTGGACAAC CTGTTCCAGA TGAACAGCTC TCTTCAGAAG ATAAATATAT TGAGGGTGAA
GACACAAAAA GCGAGAATAA TGACAATAAT TTAAAAAATT AA
 
Protein sequence
MANKTEKPTQ KKLQDASKKG QILKSRDLTV SVIMLVGTLY LGYVFDVHHI MSILEYILDH 
NAKPDIWDYF KAMGIGWLKT IIPFLLVCMF TTILVSWFQS KMQLATEAVK LKFDSLNPVN
GLKRIFGLKT VKEFVKAILY IIFFALEIKV FWSNHKSLLF KTLDGDIISL LSDWGEMLFL
LILYCLGSMI IVLIFDFIAE YFLFMKDMKM DKQEVKREYK EQEGNPEIKS KRRERHQEIL
SEQLKSDVSN SRLMIANPTH IAIGIYFKPH LSPIPLISVR ETNEVALAVR KYAKEIGIPI
ITDKKLARKI YATHRRYDYV SFENIDEILR LLLWLEDVEN AGQPVPDEQL SSEDKYIEGE
DTKSENNDNN LKN