Gene EcHS_A4132 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4132 
SymbolrhaA 
ID5592695 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4122732 
End bp4123991 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content56% 
IMG OID640923234 
ProductL-rhamnose isomerase 
Protein accessionYP_001460693 
Protein GI157163375 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4806] L-rhamnose isomerase 
TIGRFAM ID[TIGR01748] L-rhamnose isomerase 


Plasmid Coverage information

Num covering plasmid clones56 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACTC AACTGGAACA GGCCTGGGAG CTGGCGAAAC AGCGTTTCGC GGCAGTAGGG 
ATTGATGTCG AGGAGGCGCT GCGCCAACTT GATCGTTTAC CCGTTTCAAT GCACTGCTGG
CAGGGCGATG ATGTTTCCGG TTTTGAAAAC CCGGAAGGTT CGCTGACCGG TGGGATTCAG
GCTACTGGTA ATTATCCGGG CAAAGCGCGT AATGCCAGTG AGCTACGAGC CGATCTGGAA
CAAGCTATGC GGCTGATTCC GGGGCCGAAA CGGCTTAATT TACATGCCAT CTATCTGGAA
TCAGACACGC CTGTCTCGCG CGATCAGATC AAACCAGAGC ACTTCAAAAA CTGGGTTGAA
TGGGCGAAAG CCAATCAGCT CGGTCTGGAT TTTAACCCCT CCTGTTTTTC GCATCCGCTA
AGCGCCGATG GCTTTACGCT TTCCCATGCC GACGACAGCA TTCGCCAGTT CTGGATTGAT
CACTGCAAAG CTAGCCGTCG CGTTTCGGCC TATTTTGGCG AGCAACTCGG CACACCATCG
GTGATGAACA TCTGGATCCC GGATGGTATG AAAGATATCA CCGTTGACCG TCTCGCCCCG
CGTCAGCGTC TGCTGGCAGC ACTGGATGAG GTGATCAGCG AGAAGCTAAA CCCTGCGCAC
CATATCGACG CCGTTGAGAG CAAATTGTTT GGCATTGGCG CGGAGAGCTA CACGGTTGGC
TCCAATGAGT TTTACCTGGG GTATGCCACC AGCCGCCAGA CGGCGCTGTG CCTGGACGCC
GGGCATTTCC ACCCGACTGA AGTGATTTCC GACAAGATTT CCGCCGCCAT GCTGTATGTG
CCGCAGTTGC TGCTGCACGT CAGCCGTCCG GTTCGCTGGG ACAGCGATCA CGTAGTTCTG
CTGGATGATG AAACCCAGGC AATTGCCAGT GAGATTGTGC GTCACGATCT GTTTGACCGG
GTGCATATCG GCCTTGACTT CTTCGATGCC TCTATCAACC GCATTGCCGC GTGGGTCATT
GGTACACGCA ATATGAAAAA AGCCCTGCTG CGTGCGTTGC TGGAACCTAC CGCTGAGCTG
CGCAAGCTGG AAGCGGCGGG CGATTACACT GCGCGTCTGG CACTGCTGGA AGAGCAGAAA
TCGTTGCCGT GGCAGGCGGT CTGGGAAATG TATTGCCAAC GTCACGATAC GCCAGCAGGT
AGCGAATGGC TGGAGAGCGT GCGGGCTTAT GAGAAAGAAA TTTTGAGTCG CCGCGGGTAA
 
Protein sequence
MTTQLEQAWE LAKQRFAAVG IDVEEALRQL DRLPVSMHCW QGDDVSGFEN PEGSLTGGIQ 
ATGNYPGKAR NASELRADLE QAMRLIPGPK RLNLHAIYLE SDTPVSRDQI KPEHFKNWVE
WAKANQLGLD FNPSCFSHPL SADGFTLSHA DDSIRQFWID HCKASRRVSA YFGEQLGTPS
VMNIWIPDGM KDITVDRLAP RQRLLAALDE VISEKLNPAH HIDAVESKLF GIGAESYTVG
SNEFYLGYAT SRQTALCLDA GHFHPTEVIS DKISAAMLYV PQLLLHVSRP VRWDSDHVVL
LDDETQAIAS EIVRHDLFDR VHIGLDFFDA SINRIAAWVI GTRNMKKALL RALLEPTAEL
RKLEAAGDYT ARLALLEEQK SLPWQAVWEM YCQRHDTPAG SEWLESVRAY EKEILSRRG