Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_03738 |
Symbol | rhaA |
ID | 8113846 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 4000736 |
End bp | 4001983 |
Gene Length | 1248 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 644849898 |
Product | hypothetical protein |
Protein accession | YP_003001471 |
Protein GI | 251787167 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4806] L-rhamnose isomerase |
TIGRFAM ID | [TIGR01748] L-rhamnose isomerase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCACTC AACTGGAACA GGCCTGGGAG CTGGCGAAAC AGCGTTTCGC GGCGGTGGGG ATTGATGTCG AGGAGGCGCT GCGCCAACTT GATCGTTTAC CCGTTTCAAT GCACTGCTGG CAGGGCGATG ATGTTTCCGG TTTTGAAAAC CCGGAAGGTT CGCTGACCGG TGGGATTCAG GCTACTGGCA ATTATCCGGG CAAAGCGCGT AATGCCAGTG AGCTGCGTGC CGATCTGGAA CAGGCTATGC GGCTGATTCC GGGGCCGAAA CGGCTTAATT TACATGCCAT CTATCTGGAA TCAGACACGC CAGTCTCGCG CGACCAGATC AAACCAGAGC ACTTCAAAAA CTGGGTTGAA TGGGCGAAAG CCAATCAGCT CGGTCTGGAT TTTAACCCCT CCTGTTTTTC GCATCCGCTA AGCGCCGATG GCTTTACGCT CTCGCATCCC GACGACAGCA TTCGCCAGTT CTGGATTGAT CACTGCAAGG CCAGCCGTCG CGTTTCGGCC TATTTTGGTG AGCAACTCGG CACACCGTCG GTGATGAACA TCTGGATCCC GGATGGCATG AAAGATATCA CCGTTGACCG TCTCGCTCCG CGTCAGCGTC TGCTGGCTGC TCTGGATGAG GTAATCAGCG AGAAGCTGGA TCCGGCGCAC CATATCGACG CTGTTGAGAG CAAATTGTTT GGCATTGGTG CAGAGAGCTA CACGGTTGGC TCCAATGAGT TTTACATGGG GTATGCCACC AGCCGCCAGA CTGCGCTGTG CCTGGACGCC GGGCACTTCC ACCCGACGGA AGTTATTTCC GACAAGATTT CCGCCGCCAT GCTGTATGTG CCGCAGTTGC TGCTGCACGT CAGCCGTCCG GTTCGCTGGG ACAGCGATCA CGTGGTGCTG CTGGATGATG AAACCCAGGC GATTGCCAGT GAGATTGTGC GTCACGATCT GTTCGACCGG GTGCATATCG GCCTTGACTT CTTCGATGCC TCTATCAACC GCATTGCCGC GTGGGTCATT GGTACACGCA ATATGAAAAA AGCCCTGCTG CGTGCGTTGC TGGAACCTAG CGCTGAGCTG CGCAAGCTGG AAGCGGCGGG CGATTACACT GCGCGTCTGG CACTGCTGGA AGAGCAGAAA TCGTTGCCGT GGCAGGCAGT ATGGGAAATG TATTGCCAAC GTCACGATAC CCCAGCGGGT AGCGAATGGC TGGAGAGCGT GCGGGCTTAT GAGAAAGCGA TTTTGAGC
|
Protein sequence | MTTQLEQAWE LAKQRFAAVG IDVEEALRQL DRLPVSMHCW QGDDVSGFEN PEGSLTGGIQ ATGNYPGKAR NASELRADLE QAMRLIPGPK RLNLHAIYLE SDTPVSRDQI KPEHFKNWVE WAKANQLGLD FNPSCFSHPL SADGFTLSHP DDSIRQFWID HCKASRRVSA YFGEQLGTPS VMNIWIPDGM KDITVDRLAP RQRLLAALDE VISEKLDPAH HIDAVESKLF GIGAESYTVG SNEFYMGYAT SRQTALCLDA GHFHPTEVIS DKISAAMLYV PQLLLHVSRP VRWDSDHVVL LDDETQAIAS EIVRHDLFDR VHIGLDFFDA SINRIAAWVI GTRNMKKALL RALLEPSAEL RKLEAAGDYT ARLALLEEQK SLPWQAVWEM YCQRHDTPAG SEWLESVRAY EKAILS
|
| |