Gene ECH74115_5805 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5805 
Symbol 
ID6969043 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5437955 
End bp5440891 
Gene Length2937 bp 
Protein Length978 aa 
Translation table11 
GC content39% 
IMG OID643389433 
Producthypothetical protein 
Protein accessionYP_002273825 
Protein GI209397351 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGAAA ATGAGAGCGG TTATTTGATT ACCTATTTCA GAAGGATATA TCAGGCGCTT 
CGTGGGCAAT CATCTCAATA TCCTGCCGCT AAAATTGCAC AAGAATTAGG CGATACGATT
CCTGTCACAA TGCAAAACGA ACTGGTCTAT GAAATAGCCG GTGAATTCTG CGAGACTCTC
TGCCGATTAC TTAGTGAGCA TCCTCCTCAT AGCAGTGATC CCGTTTCTGC ATTACGTAAG
CTCTCTCCAG ACTGGCATCT TCAACTTCCA CTCGTTCTCC CCGAAGCGAA TGCTGCAGAG
ATTGTAAGGC GATTACTTTC TCAATCTTCT GAAATACGCA GTGCAAGTAG TCTTCAGGTT
GAAAGAATCT GGGTCGATGT TGATGACAGT TGGTATTGCG ATGCCCGGTT CCGATTCCCG
GCCACAATGC GTACAGAACA GTTAACCTCT TTGTTTGAAT GCCATATTCA GCCGGAGCAG
ACCCGACTCA TCATTTCAGG AAAATGGAAA AATGGCGGGG CAAGGTTGGC AATGCTAAGC
CGTTATGAGC AGCAAGATTG GCGGGTCGAG TTATTACCTA TTGCGATGCA AAAACTCTCT
GGTGCAGATG CAATGGCCGA AATCTCATTG TCGCTTCATG AAGGTCCAAT TCTGTTAGGT
CACACAATTC CAAAAGGCGG TTATGAACTT ACTGAAGAAC TTCCCTGGGT TTTTGAAGCG
ATGAATGAGA GTGAATCGCA GCTAAAACTT GTTGGTATGG GGTCTGTGAG TTCCAGGCTA
AATGCTCTGT TTATTTCGCT ACCGAAAAAT AGCCATTTAG ATATTAGTGG AGAAGGTGAG
TTTGATATCC CAAGGTTGCT GAAAAACAGT GAACGCAGCT TAACTAAAAT AAGTGGAGTA
TTTAGCGTTG TTTTACATGA TGGTGCTGTT TGTACAATTC GCACCCAGCA ACTTTATGAT
TCTGCGATTG AATATTATAT TAAATCGACA GAAGTTGAAC TGGTTAAATC GGATTATCCT
GTCCATCGAG CCTGGCCGAA GATCGGTTGG AAAAAGGATC TGCAATATGG CATTGTACCG
GAGAAAGAAC TTTTTTGGCG TTCCATTCGT TCAGGTAATA ATGCCTGGTA TTCTGTCGCA
TCAGAAATGC CAAAAGGACA GATAGAAGTC CGGCGAATAG TTAATGATGA GGTTTTATTT
AGCGGTAAAG TTGTAGTTTT ACCAGCAGAC TTCGATATTA ATATTATACC TGAGAGTGCT
CAGCAAGGCA TTATTATGCT TTCTGGTATA ACGGATACTA GAATTGATAA ATATTCAAAC
AATGAAAAAG TAACACTTAA GTCCGATTAT TCACAAAACG AGTGTGCTAT TTATTATAAT
TCATCGCTGA TGCTGGAAAA TACCGTTGAT TTACGGGTCT CCTGGAAAGA TGGTTCTAAT
CTTAAACTAT TATTACCTAA GCCGGTTAGC GGTGGCCGAT TTGTAACTAA TGATGGTTCT
GTTCATTTTG ATGGTGTGGC ATCTATAGCA CATTTGCATG GAATAGATGC TGAGTTATTA
ACCATATCAT GTGCTGGAAG AGGATATCTT AATATCGAGT TATTGGATGA AAATCCAGTA
GCTGAAAAAT TCCGCTATTT ACATGCCGAC CTTCCTCTTT TATCAGGACG CAATGACAAA
TTACAACAAA TTTCACTTTA TGAGAACTAT AATTTGCTAA ATGCTATGCT GGCATGTGCA
TGGAACAGCA ATAGTACATT ATGTGTTGAT TTTTACTCTG ACAGATTTGG AAAGGATAAA
GCAACACTCA ATATTAAACG CTACGATGGT AGTTTTATTG AACACGATCA AGGGTTACTG
GTTGATATAA AAAATTCTGT TGTTTTTCCT GCAAATAGAA TAGACGAACT GGTTGTTGAT
GCTATTTCTC TTAAGAATCC TGGCTTACAC ATATCATTGT TAAAAAAAGA TGAGTTTGCT
TATGATCTTT CAGCTCTGAA TGTTCAGGAT AGTCCATGGT TAATTGTGGG AAAACTTGAT
GGTACAGCTC GCATTGCACC AGTAATTAAA TGGATGCTAC CTGTATTGCA GACAAATGAT
TTATTACTAA ATGCTCTATG CGAAGCAGAC CCAGAACAGC GCAAAAAAAA TTTTAATGAG
CTAATTTTTG AAATAGATAA CAACCCATTG CAAAATTATT GCTGTTTATT AACAGAGTAT
ATTAAGAAAT ACAAAATGAA TAATGGCTTA TCCTTGCTGG ATCTGGACTT GTTCAGAGGT
ATTTCGAGTA ATTACCGCGT GGTTGTTCAA TTGTTAATAT CATCATGTCT TTCTGGTGAT
AGCGATACGA TTTATGATAT ACAGGAAGAA TTACCCTTTT CATGGGGATG GATTCCCGTT
TCAATCTGGA AAGATGTTTT CCAAAAATGT TGGACTTATC TGGAGAAACA GATTAACGAT
AAAACATTAG CATTACATAT ATTGCAACCC TTTATTGCTT TTATGAACCA TCGTGCACAT
ATCGATCGTC GGCTGGCTCC GATTGCGAAT ATGTTACTTA CATATAGTGA GAGCCTACCA
ACCGGTTGTG ATGTATTGCC AACTGTTAGT CGTGAGCAGT TTAATGAAGC TAAACAGATG
CTATTAAGGA ACCCCGACAG CTTTGGGCGT ATCAGTATCT TCCCTAAAGA ACTTTGGTCT
AGTGCTATTA CTCCAGAGTT AAAATCTGTT TTTAATAAGC TTTGGATTGA AGATAAATAT
CACTCACGGC TTGAAAAACG TTTTAATTTG ATGTTAGTCG CAGCGCTGTT AACCCAAAAA
GATAATAACC TGATACATCA ACTGTCTGCG CTTTTTGAAT TTCACTATCA GCAAGCCCCA
CAGCAATTAG GGGTAATCTA TCAATATTAT TTTGAACAAG CAGGAGTATG TCATTGA
 
Protein sequence
MIENESGYLI TYFRRIYQAL RGQSSQYPAA KIAQELGDTI PVTMQNELVY EIAGEFCETL 
CRLLSEHPPH SSDPVSALRK LSPDWHLQLP LVLPEANAAE IVRRLLSQSS EIRSASSLQV
ERIWVDVDDS WYCDARFRFP ATMRTEQLTS LFECHIQPEQ TRLIISGKWK NGGARLAMLS
RYEQQDWRVE LLPIAMQKLS GADAMAEISL SLHEGPILLG HTIPKGGYEL TEELPWVFEA
MNESESQLKL VGMGSVSSRL NALFISLPKN SHLDISGEGE FDIPRLLKNS ERSLTKISGV
FSVVLHDGAV CTIRTQQLYD SAIEYYIKST EVELVKSDYP VHRAWPKIGW KKDLQYGIVP
EKELFWRSIR SGNNAWYSVA SEMPKGQIEV RRIVNDEVLF SGKVVVLPAD FDINIIPESA
QQGIIMLSGI TDTRIDKYSN NEKVTLKSDY SQNECAIYYN SSLMLENTVD LRVSWKDGSN
LKLLLPKPVS GGRFVTNDGS VHFDGVASIA HLHGIDAELL TISCAGRGYL NIELLDENPV
AEKFRYLHAD LPLLSGRNDK LQQISLYENY NLLNAMLACA WNSNSTLCVD FYSDRFGKDK
ATLNIKRYDG SFIEHDQGLL VDIKNSVVFP ANRIDELVVD AISLKNPGLH ISLLKKDEFA
YDLSALNVQD SPWLIVGKLD GTARIAPVIK WMLPVLQTND LLLNALCEAD PEQRKKNFNE
LIFEIDNNPL QNYCCLLTEY IKKYKMNNGL SLLDLDLFRG ISSNYRVVVQ LLISSCLSGD
SDTIYDIQEE LPFSWGWIPV SIWKDVFQKC WTYLEKQIND KTLALHILQP FIAFMNHRAH
IDRRLAPIAN MLLTYSESLP TGCDVLPTVS REQFNEAKQM LLRNPDSFGR ISIFPKELWS
SAITPELKSV FNKLWIEDKY HSRLEKRFNL MLVAALLTQK DNNLIHQLSA LFEFHYQQAP
QQLGVIYQYY FEQAGVCH