Gene ECH74115_5355 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5355 
SymbolrhaA 
ID6972016 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4997796 
End bp4999055 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content56% 
IMG OID643389011 
ProductL-rhamnose isomerase 
Protein accessionYP_002273420 
Protein GI209395912 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4806] L-rhamnose isomerase 
TIGRFAM ID[TIGR01748] L-rhamnose isomerase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACTC AACTGGAACA GGCCTGGGAG CTGGCGAAAC AGCGTTTCGC GGCAGTAGGG 
ATTGATGTCG AGGAGGCGCT GCGCCAACTT GATCGTTTAC CCGTTTCAAT GCACTGCTGG
CAGGGTGATG ATGTTTCCGG TTTTGAAAAC CCGGAAGGTT CGCTGACCGG AGGGATTCAG
GCTACTGGCA ATTATCCGGG CAAAGCGCGT AATGCCAGTG AGCTACGTGC CGATCTGGAA
CAGGCTATGC GGCTGATTCC GGGGCCAAAA CGGCTTAATT TACATGCCAT CTATCTGGAA
TCAGACACGC CAGTCTCGCG CGACCAGATC AAACCTGAGC ACTTCAAAAA CTGGGTTGAA
TGGGCGAAAG CCAATCAGCT CGGTCTGGAT TTTAACCCCT CCTGTTTCTC GCATCCGCTA
AGCGCCGATG GCTTTACGCT TTCCCATGCC GACGACAGCA TTCGCCAGTT CTGGATTGAT
CACTGCAAGG CCAGCCGCCG CGTTTCGGCC TATTTTGGTG AGCAACTCGG CACACCGTCG
GTGATGAACA TCTGGATCCC GGATGGCATG AAAGATATCA CCGTTGACCG TCTCGCTCCG
CGCCAGCGTC TGCTGGCAGC TCTGGATGAG GTGATCAGCG AGAAGCTGGA TCCGGCGCAC
CATATCGACG CCGTTGAGAG CAAATTGTTT GGCATTGGCG CAGAGAGCTA CACGGTTGGC
TCCAATGAGT TTTACATGGG GTATGCCACC AGCCGCCAGA CTGCGCTGTG CCTGGACGCC
GGGCATTTCC ACCCGACTGA AGTGATTTCC GACAAGATTT CCGCCGCCAT GCTGTATGTG
CCGCAGTTGC TGCTGCACGT CAGCCGTCCG GTTCGCTGGG ACAGCGATCA CGTAGTGCTG
CTGGATGATG AAACCCAGGC GATTGCCAGT GAGATTGTTC GTCACGATCT GTTTGACCGG
GTGCATATCG GCCTCGACTT CTTTGATGCC TCTATCAACC GTATTGCTGC GTGGATCATT
GGTACACGCA ATATGAAAAA AGCCCTGCTG CGTGCGTTGC TGGAACCTAC CGCTGAGCTG
CGCAAGCTGG AAGCGGCGGG CGATTACACT GCGCGTCTGG CACTGCTGGA AGAGCAGAAA
TCGTTGCCGT GGCAGGCGGT CTGGGAAATG TATTGCCAAC GTCACGATAC GCCAGCAGGT
AGCGAATGGC TGGAGAGCGT GCGGGCATAT GAGAAAGAAA CTTTGAGTCG CCGCGGGTAA
 
Protein sequence
MTTQLEQAWE LAKQRFAAVG IDVEEALRQL DRLPVSMHCW QGDDVSGFEN PEGSLTGGIQ 
ATGNYPGKAR NASELRADLE QAMRLIPGPK RLNLHAIYLE SDTPVSRDQI KPEHFKNWVE
WAKANQLGLD FNPSCFSHPL SADGFTLSHA DDSIRQFWID HCKASRRVSA YFGEQLGTPS
VMNIWIPDGM KDITVDRLAP RQRLLAALDE VISEKLDPAH HIDAVESKLF GIGAESYTVG
SNEFYMGYAT SRQTALCLDA GHFHPTEVIS DKISAAMLYV PQLLLHVSRP VRWDSDHVVL
LDDETQAIAS EIVRHDLFDR VHIGLDFFDA SINRIAAWII GTRNMKKALL RALLEPTAEL
RKLEAAGDYT ARLALLEEQK SLPWQAVWEM YCQRHDTPAG SEWLESVRAY EKETLSRRG