Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_2070 |
Symbol | |
ID | 6980809 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 2130090 |
End bp | 2131265 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643396792 |
Product | Alkanesulfonate monooxygenase |
Protein accession | YP_002281580 |
Protein GI | 209549663 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | [TIGR03565] alkanesulfonate monooxygenase, FMNH(2)-dependent |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGCCA CATCCGATCC CATCAATTTC CTCTGGTTCA TCCCGACGTC GGGCGACGGC ACCTATCTCG GTTCCGCCGA TCTCAACCGC GCGCCCGAAA TCGGCTATCT CACGCAGATC GCCCAGGCCG TCGATCGTCT CGGCTATTCC GGCGTGCTGC TGCCGACCGG GGTTGCCTGC GAGGAGTCCT TCGTGATGGC GGCGGCGCTC GCCGCCAAGA CCGAGAAACT GCAGTTCCTG GTGGCGATCC GTCCCGGCAC GGCGTCACCG GCCTATTACG CGCGTCTGGC AACGACGCTC GACCGTATTT CCAACGGCCG CCTGCTGCTT AACATCGTCG TCGGCGGCAG CCCGGCCGAG CTTGCCGGTG ACGGCATCCA TCTCGAGCAT GACGAGCGTT ATGCCCATGC CGAGGAGTTT TTCACCGTTT TCGAGGAACT GCTGGATAAG GGAACGGCGA GTTTCGACGG CAAATATATC AAGGCGACCA ATGCGCGCCT CGGCTTTCCC TCGGTGCAGA ACCCGCGTCC GCCGCTCTAT TTCGGCGGCT CGTCGGATGC CGGCATCGAT TTCTCGGTCG GCCGCGTCGA CAAGTATCTG ACCTGGGGCG AGCCGCCGGC GCAGGTGGCG GAAAAGGTCG CCAAGGTGCG CAAGGCGGCG GCCGAGCGCG GCCGCGAGGT GAGCTTCGGC ATCCGCCTGC ACTTCATCGT GCGCGAAACC GACGAGGAGG CATGGGAGGC GGCGGAGCGG CTGATCCGTC ATCTCGACGA CGACACGATC CGCGAGGCGC AGGAGCGTTT CGTTCACGAG TCCGACTCGG TCGGCCAGAA GCGGATGGCC GCCCTTCACG GCGGCCGCCG TGACAAACTC GAGGTCTCGC CGAATCTTTG GGCCGGCGTC GGCCTGGTAC GCGCTGGTGC CGGCACGGCG CTTGTGGGCT CGCCCAAGGC GGTGGCCGCA CGCCTTGCCG AATATCAGGA CATCGGCATC GATACGGTGA TCGGCTCCGG CTATCCGCAC CTCGAAGAAG CCTATCGTGT CGCCGAACTG CTCTTCCCCG AACTCGGCAT CACTCGCGAG CAACAGCGCC TGGCCTTCAA CAACGAATTT GGCCGCAAGC AGGTTTTCGC AGGCGGCAGC CATGGCGGCA ATCTGAAGGT CGTTTCCGGT TCCTGA
|
Protein sequence | MTATSDPINF LWFIPTSGDG TYLGSADLNR APEIGYLTQI AQAVDRLGYS GVLLPTGVAC EESFVMAAAL AAKTEKLQFL VAIRPGTASP AYYARLATTL DRISNGRLLL NIVVGGSPAE LAGDGIHLEH DERYAHAEEF FTVFEELLDK GTASFDGKYI KATNARLGFP SVQNPRPPLY FGGSSDAGID FSVGRVDKYL TWGEPPAQVA EKVAKVRKAA AERGREVSFG IRLHFIVRET DEEAWEAAER LIRHLDDDTI REAQERFVHE SDSVGQKRMA ALHGGRRDKL EVSPNLWAGV GLVRAGAGTA LVGSPKAVAA RLAEYQDIGI DTVIGSGYPH LEEAYRVAEL LFPELGITRE QQRLAFNNEF GRKQVFAGGS HGGNLKVVSG S
|
| |