Gene ECH74115_5803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5803 
Symbol 
ID6967837 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5435668 
End bp5437320 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content38% 
IMG OID643389431 
Producthypothetical protein 
Protein accessionYP_002273823 
Protein GI209396327 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.485501 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAGCGC AGCTTTTTGA GCAGTTGTTT CAATCGATAG ACTCTACACT GATCACCAAT 
ATTTTCATCT GGGCTGTTAT ATTCGTATTT TTATCAGCGT GGTGGTGTGA CAAAAAAAAT
ATACATAGTA AGTTTAGAGA ATATGCTCCA ACCTTAATGG GGGCATTAGG TATTCTGGGT
ACTTTCATTG GTATTATTAT TGGTTTACTC AATTTTAATA CCGAAAGTAT TGATACCAGC
ATCCCCGTAT TATTAGGTGG CCTAAAAACA GCATTCATTA CAAGCATTGT AGGTATGTTT
TTTGCCATTT TATTTAATGG AATGGATGCT TTCTTTTTTG CCAATAAACG AAGTGCGTTA
GCAGAAAATA ACCCTGAATC TGTTACACCT GAACATATCT ATCATGAATT AAAAGAGCAG
AACCAGACTC TGACTAAATT AGTCTCGGGT ATTAACGGTG ATAGTGAAGG TTCTCTTATT
GCTCAAATAA AATTACTACG TACTGAGATT AGCGATTCCT CGCAGGCACA ATTAGCTAAT
CACACTCATT TCAGTAATAA GCTTTGGGAA CAACTTGAAC AATTTGCAGA TCTAATGGCA
AAAGGTGCTA CAGAACAAAT TATTGATGCT TTGCGACAAG TCATTATTGA TTTTAATCAA
AATTTAACTG AACAGTTTGG TGAAAACTTT AAAGCTCTTG ATGCCTCTGT AAAAAAACTT
GTTGAGTGGC AGGGAAATTA TAAAACGCAA ATTGAGCAGA TGTCAGAACA ATATCAACAA
AGTGTCGAGT CCCTGGTTGA AACAAAAACT GCGGTTGCAG GGATTTGGGA AGAATGTAAA
GAAATTCCTC TGGCTATGTC TGAACTGCGT GAAGTGCTTC AGGTGAACCA ACATCAAATC
AGCGAACTCT CCCGCCATTT AGAAACCTTT GTCGCCATCC GCGATAAAGC TACAACCGTA
TTACCTGAAA TACAGAACAA AATGGCTGAA GTGGGTGAAC TGCTGAAATC CGGAGCTGCA
AATGTTAGTG CATCTCTTGA GCAAACCAGC CAGCAAATAC TTCTTAATGC AGATTCAATG
CGCGTTGCCC TGGATGAAGG TACCGAAGGA TTCAGACAAT CGGTTACCCA AACACAACAA
GCATTTGCCT CGATGGCACA TGATGTCAGC AATTCCTCCG AAACCCTAAC CAGCACGTTA
GGTGAAACAA TTACTGAAAT GAAACAAAGT GGTGAAGAAT TCCTGAAATC ACTAGAGTCG
CACTCGAAAG AATTGCATAG AAATATGGAA CAAAATACGA CGAATGTGAT TGATATGTTC
AGTAAGACTG GTGAAAAGAT TAACCATCAA CTATCCAGTA ATGCCGATAA TATGTTTGAT
TCAATCCAGA CATCATTTGA TAAGGCAAGT GCAGGGCTGA CTTCTCAAGT CAGAGAATCA
ATTGAAAAAT TTGCTCTATC CATCAACGAG CAGTTACATG CTTTTGAGCA AGCAACTGAA
CGTGAAATGA ACCGTGAAAT GCAATCATTA GGTAATGCTC TGCTTTCAAT CAGCAAAGGT
TTTGTCGGTA ACTATGAAAA ACTTATTAAA GATTACCAAA TAGTTATGGG GCAGTTACAA
GCATTAATTT CTGCTAATAA ACATCGCGGG TAA
 
Protein sequence
MLAQLFEQLF QSIDSTLITN IFIWAVIFVF LSAWWCDKKN IHSKFREYAP TLMGALGILG 
TFIGIIIGLL NFNTESIDTS IPVLLGGLKT AFITSIVGMF FAILFNGMDA FFFANKRSAL
AENNPESVTP EHIYHELKEQ NQTLTKLVSG INGDSEGSLI AQIKLLRTEI SDSSQAQLAN
HTHFSNKLWE QLEQFADLMA KGATEQIIDA LRQVIIDFNQ NLTEQFGENF KALDASVKKL
VEWQGNYKTQ IEQMSEQYQQ SVESLVETKT AVAGIWEECK EIPLAMSELR EVLQVNQHQI
SELSRHLETF VAIRDKATTV LPEIQNKMAE VGELLKSGAA NVSASLEQTS QQILLNADSM
RVALDEGTEG FRQSVTQTQQ AFASMAHDVS NSSETLTSTL GETITEMKQS GEEFLKSLES
HSKELHRNME QNTTNVIDMF SKTGEKINHQ LSSNADNMFD SIQTSFDKAS AGLTSQVRES
IEKFALSINE QLHAFEQATE REMNREMQSL GNALLSISKG FVGNYEKLIK DYQIVMGQLQ
ALISANKHRG