Gene EcSMS35_4294 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4294 
SymbolrhaA 
ID6143792 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4397151 
End bp4398410 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content55% 
IMG OID641619115 
ProductL-rhamnose isomerase 
Protein accessionYP_001746239 
Protein GI170680183 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4806] L-rhamnose isomerase 
TIGRFAM ID[TIGR01748] L-rhamnose isomerase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.140419 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACTC AACTGGAACA GGCCTGGGAA CTGGCGAAAC AGCGTTTCGC GGCGGTGGGG 
ATTGATGTCG AGGAGGCGCT GCGCCAACTT GATCGTTTAC CCGTTTCAAT GCACTGCTGG
CAGGGCGATG ATGTTTCCGG TTTTGAAAAC CCGGAAGGTT CGCTGACCGG TGGGATTCAG
GCTACTGGTA ATTATCCGGG CAAAGCGCGT AATGCCAGCG AGCTACGTGC CGATCTGGAA
CAAGCTATGC GGCTGATTCC GGGTCCGAAA CGGCTTAATT TACATGCCAT CTATCTGGAA
TCAGACACGC CAGTCTCGCG CGACCAAATC AAACCAGAGC ACTTCAAAAA CTGGGTTGAA
TGGGCGAAAG CCAATCAGCT CGGTCTGGAT TTTAACCCCT CCTGTTTTTC GCATCCGCTA
AGCGCCGATG GCTTTACGCT TTCCCATGCC GACGACAGTA TTCGCCAGTT CTGGATTGAT
CACTGCAAAG CCAGCCGCCG TGTTTCAGCC TATTTTGGCG AGCAACTTGG CACACCGTCG
GTGATGAACA TCTGGATCCC GGATGGCATG AAAGATATCA CCGTTGACCG TCTCGCCCCA
CGTCAGCGTC TGCTGGCAGC ACTGGATGAG GTGATCAGCG AGAAGCTGGA TCCGGCGCAC
CATATCGACG CCGTTGAGAG CAAATTGTTT GGCATTGGCG CGGAGAGCTA CACGGTTGGC
TCCAATGAGT TTTACATGGG GTATGCCACC AGCCGCCAGA CGGCGCTGTG CCTGGACGCC
GGGCATTTCC ACCCGACTGA AGTGATTTCC GACAAGATTT CCGCCGCCAT GCTGTATGTG
CCGCAGTTGC TGCTGCACGT CAGCCGTCCG GTTCGCTGGG ACAGCGATCA CGTAGTGCTG
CTGGATGATG AAACCCAGGC GATTGCCAGT GAGATTGTTC GTCACGATCT GTTTGACCGG
GTACATATCG GCCTGGATTT CTTTGATGCC TCTATCAACC GCATTGCCGC ATGGGTCATT
GGTACGCGCA ATATGAAAAA AGCCCTGCTA CGTGCGTTGC TGGAACCTAC CACTGAACTG
CGCAAGCTGG AAGCAGCGGG CGATTACACC GCACGACTGG CACTGCTGGA AGAACAAAAA
TCATTGCCGT GGCAGGCGGT CTGGGAAATG TATTGCCAAC GTCACGATAC CCCAGTGGGT
AGCGAATGGC TGGAAAGCGT GCGGGCTTAT GAGAAAGCGA TTTTGAGCCA GCGTGGGTAA
 
Protein sequence
MTTQLEQAWE LAKQRFAAVG IDVEEALRQL DRLPVSMHCW QGDDVSGFEN PEGSLTGGIQ 
ATGNYPGKAR NASELRADLE QAMRLIPGPK RLNLHAIYLE SDTPVSRDQI KPEHFKNWVE
WAKANQLGLD FNPSCFSHPL SADGFTLSHA DDSIRQFWID HCKASRRVSA YFGEQLGTPS
VMNIWIPDGM KDITVDRLAP RQRLLAALDE VISEKLDPAH HIDAVESKLF GIGAESYTVG
SNEFYMGYAT SRQTALCLDA GHFHPTEVIS DKISAAMLYV PQLLLHVSRP VRWDSDHVVL
LDDETQAIAS EIVRHDLFDR VHIGLDFFDA SINRIAAWVI GTRNMKKALL RALLEPTTEL
RKLEAAGDYT ARLALLEEQK SLPWQAVWEM YCQRHDTPVG SEWLESVRAY EKAILSQRG