Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_0174 |
Symbol | |
ID | 8011404 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 171262 |
End bp | 172221 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644822766 |
Product | 2-nitropropane dioxygenase NPD |
Protein accession | YP_002974024 |
Protein GI | 241202928 |
COG category | [R] General function prediction only |
COG ID | [COG2070] Dioxygenases related to 2-nitropropane dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.111735 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCTGC CCCCGATCCT CAAGGACAGA TTGAGACTGC CGGTCATCGG TTCGCCGCTG TTCATCATTT CGCATCCAGC GCTGACGCTG GCGCAATGCA AGGCCGGCAT CGTCGGAGCG TTTCCGGCGC TGAACGTCCG GCCGGAAAGC CAGCTCGACG AATGGCTGGC GGAGATTACC GAGGAGCTCG CCCGCCACGA CGCCGCCCAT CCGGAGCGGC CGGCCGCCCC CTTTGCCGTC AACCAGATCG TCCACATGTC GAACAAGCGG CTGGAGCACG ACCTCTCGCT CTGCGTCAAA TACAAGGTGC CGATCGTCAT CTCCTCGCTC GGCGCCGTGC CCGAAGTCAA CGCCGCCGTG CACTCCTATG GCGGCATCGT GCTGCACGAC ATTATCAATA ACCGCCACGC CCATTCGGCG ATCCGCAAGG GTGCGGACGG GCTGATCGCG GTGGCGGCCG GCGCCGGCGG CCATGCCGGG ACGTTGTCGC CCTTTGCGCT CGTCCAGGAA ATCCGTGAAT GGTTCGACGG GCCGCTGCTG CTGGCAGGCG CCATCGCCAC CGGCGGCGCT ATTCTCGCCG CAGAAGCGAT GGGCGCCGAC ATGGCCTATA TCGGCTCGCC CTTCATCGCC ACCGAGGAAG CCCGCGCCGC AGCCGCCTAC AAGCAGGCGA TCGTCGAAGG GGCCGCCAGC GATATCGTCT ATTCCAACTA TTTCACCGGC GTGCACGGCA ACTATCTCAA GCCCTCGATC CTCGCCGCCG GCATGGACCC CGACAACCTG CCGCTTGCCG ATGTCTCGAA GATGGATTTC GAGCAGGCTG TCGGCGGCGC CAAGGCCTGG AAGGACATAT GGGGCAGCGG CCAGGGCATT AGCGCCGTCA AGGCCGTCGA GCCGGTGGCA AAACTCGTCG ACCGGTTGGA GGCCGAATAC AGGGCGGCGC GCACCCGGCT GGCGCTCTGA
|
Protein sequence | MALPPILKDR LRLPVIGSPL FIISHPALTL AQCKAGIVGA FPALNVRPES QLDEWLAEIT EELARHDAAH PERPAAPFAV NQIVHMSNKR LEHDLSLCVK YKVPIVISSL GAVPEVNAAV HSYGGIVLHD IINNRHAHSA IRKGADGLIA VAAGAGGHAG TLSPFALVQE IREWFDGPLL LAGAIATGGA ILAAEAMGAD MAYIGSPFIA TEEARAAAAY KQAIVEGAAS DIVYSNYFTG VHGNYLKPSI LAAGMDPDNL PLADVSKMDF EQAVGGAKAW KDIWGSGQGI SAVKAVEPVA KLVDRLEAEY RAARTRLAL
|
| |