Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4294 |
Symbol | rhaA |
ID | 6143792 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4397151 |
End bp | 4398410 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641619115 |
Product | L-rhamnose isomerase |
Protein accession | YP_001746239 |
Protein GI | 170680183 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4806] L-rhamnose isomerase |
TIGRFAM ID | [TIGR01748] L-rhamnose isomerase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.140419 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACTC AACTGGAACA GGCCTGGGAA CTGGCGAAAC AGCGTTTCGC GGCGGTGGGG ATTGATGTCG AGGAGGCGCT GCGCCAACTT GATCGTTTAC CCGTTTCAAT GCACTGCTGG CAGGGCGATG ATGTTTCCGG TTTTGAAAAC CCGGAAGGTT CGCTGACCGG TGGGATTCAG GCTACTGGTA ATTATCCGGG CAAAGCGCGT AATGCCAGCG AGCTACGTGC CGATCTGGAA CAAGCTATGC GGCTGATTCC GGGTCCGAAA CGGCTTAATT TACATGCCAT CTATCTGGAA TCAGACACGC CAGTCTCGCG CGACCAAATC AAACCAGAGC ACTTCAAAAA CTGGGTTGAA TGGGCGAAAG CCAATCAGCT CGGTCTGGAT TTTAACCCCT CCTGTTTTTC GCATCCGCTA AGCGCCGATG GCTTTACGCT TTCCCATGCC GACGACAGTA TTCGCCAGTT CTGGATTGAT CACTGCAAAG CCAGCCGCCG TGTTTCAGCC TATTTTGGCG AGCAACTTGG CACACCGTCG GTGATGAACA TCTGGATCCC GGATGGCATG AAAGATATCA CCGTTGACCG TCTCGCCCCA CGTCAGCGTC TGCTGGCAGC ACTGGATGAG GTGATCAGCG AGAAGCTGGA TCCGGCGCAC CATATCGACG CCGTTGAGAG CAAATTGTTT GGCATTGGCG CGGAGAGCTA CACGGTTGGC TCCAATGAGT TTTACATGGG GTATGCCACC AGCCGCCAGA CGGCGCTGTG CCTGGACGCC GGGCATTTCC ACCCGACTGA AGTGATTTCC GACAAGATTT CCGCCGCCAT GCTGTATGTG CCGCAGTTGC TGCTGCACGT CAGCCGTCCG GTTCGCTGGG ACAGCGATCA CGTAGTGCTG CTGGATGATG AAACCCAGGC GATTGCCAGT GAGATTGTTC GTCACGATCT GTTTGACCGG GTACATATCG GCCTGGATTT CTTTGATGCC TCTATCAACC GCATTGCCGC ATGGGTCATT GGTACGCGCA ATATGAAAAA AGCCCTGCTA CGTGCGTTGC TGGAACCTAC CACTGAACTG CGCAAGCTGG AAGCAGCGGG CGATTACACC GCACGACTGG CACTGCTGGA AGAACAAAAA TCATTGCCGT GGCAGGCGGT CTGGGAAATG TATTGCCAAC GTCACGATAC CCCAGTGGGT AGCGAATGGC TGGAAAGCGT GCGGGCTTAT GAGAAAGCGA TTTTGAGCCA GCGTGGGTAA
|
Protein sequence | MTTQLEQAWE LAKQRFAAVG IDVEEALRQL DRLPVSMHCW QGDDVSGFEN PEGSLTGGIQ ATGNYPGKAR NASELRADLE QAMRLIPGPK RLNLHAIYLE SDTPVSRDQI KPEHFKNWVE WAKANQLGLD FNPSCFSHPL SADGFTLSHA DDSIRQFWID HCKASRRVSA YFGEQLGTPS VMNIWIPDGM KDITVDRLAP RQRLLAALDE VISEKLDPAH HIDAVESKLF GIGAESYTVG SNEFYMGYAT SRQTALCLDA GHFHPTEVIS DKISAAMLYV PQLLLHVSRP VRWDSDHVVL LDDETQAIAS EIVRHDLFDR VHIGLDFFDA SINRIAAWVI GTRNMKKALL RALLEPTTEL RKLEAAGDYT ARLALLEEQK SLPWQAVWEM YCQRHDTPVG SEWLESVRAY EKAILSQRG
|
| |