Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5952 |
Symbol | |
ID | 6977338 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011366 |
Strand | + |
Start bp | 365679 |
End bp | 367574 |
Gene Length | 1896 bp |
Protein Length | 631 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643393404 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_002278222 |
Protein GI | 209546332 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism [R] General function prediction only |
COG ID | [COG1082] Sugar phosphate isomerases/epimerases [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0955687 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.00791612 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAAGACCT CGATTGCGAC TGTGACGATC AGCGGCGAAC TGCCTGAGAA GCTCGAGGCG ATCGCCCGGG CGGGGTTCGA TGGCGTCGAG ATCTTCGAAA ACGATTTCCT GGCTTTCGAC GGCAGCCCGG CCGATGTCGG AAAACTGGTG CGCGACCATG GCCTTGAGAT CACGCTGTTT CAGCCGTTTC GGGATTTCGA GGGCATGCCG GAGCCGCTGC GCAGCCGCAC CTTCGATCGG GCGGAACGGA AGTTCGACGT GATGCAGCAG CTTGGAACAG ATCTGGTGCT GGTCTGCTCG AACATCTCGC CGGCCGCCAT CGGCGGCATC GACAGGGCGG CTGAGGATTT CCGCGAACTT GGCGAGCGCG CCGCCCGGCG CGGACTGAGG GTGGGTTACG AGGCGCTCGC CTGGGGTCGC CATATCAGTG ATCACCGCGA CGCCTGGGAA ATCGTTCGGC GCGCCGATCA TCCGAATGTC GGCCTCATCC TCGACAGCTT CCACACTCTG TCGCGCAAGA TCGACGTTAA TTCGATCCGC TCGATCCCCA AGGAGAAGAT CTTCATCGTC CAGCTTGCCG ATGCGCCGCT GATCGACATG GACCTGCTCT ACTGGAGCCG CCACTTCCGC AACATGCCTG GAGAAGGCGA GCTTCCCGTC ACGGCGTTTA CCGAAGCCGT CGCTGCCACA GGTTATGACG GCTACTTCTC GCTGGAGATT TTCAACGACC AGTTTCGAGG CGGCCTGTCG CGGCCGATCG CCGCCGACGG CCATCGCTCG CTGATCTATC TCGGCGATCA GGTGCGCCGC CATCTCGGCA CCGGCAGCAT GACCGGGGCG GCGATGCCGG AACGGGCGGC CGTCCAGGGC GTCGGCTTCG TGGAGTTTGC CACGGACGAA CAGGATGAAG TCGAGCTGGT GGCCTTGCTC CGCACCCTCG GCTTCAGACA GACCGCCGTC CACCGCACCA AGAAAGTCTC TCTGTTCGAG CAGGGCGAGA TCCGGATCCT CGTCAATGTC GATCAGGCGG GATTTGCCAA TGCGGCCTAC GCCGTGCATG GGACGTTTGC CTATGCCATG GCACTGGTCG TCGACGACGC CGCAAAGGCA TATGCGCGTG CGCTTGCTCT CGATGCCGAG CCCTTCGCCC AGCCGGTCGC GGAGGGTGAA CTCGAGCTGC CCGCCATTCG TGGCGTCGGC GGCGGCATCG TCTATCTGAT CGACGACAAG AGCGCCCTCG GCCGCTTTTC CGAAATCGAT TTTCGCCCGG TCACCGACGA CACCGACACG GCGTCTGCCG GCCTTATTCG CGTCGATCAC GTCGCCCAGA CGGTCGGCTA TGATGAAATG CTCACCTGGC TTTTGTTCTA CACGTCGATC TTCGAGACCC ATAAGACCCC GATGGTCGAC ATCATCGATC CAGCCGGCGT GGTGCGCAGC CAGGTCGTCG AGAACCAGTC GGGCGCGCTG CGCATCACCA TGAACGGAGC CGAAAACCGC CGCACCCTGG CCGGCCATTT CATTGCCGAA AAGTTCGGAT CCGGCATCCA GCACCTGGCG TTTTCGACCG ACGATATCTT TGCAACCGCC GAAAAACTTC GCAGCTGCGG ATTCCGGTCC CTGCACATTT CGCCGAACTA TTACGACGAC GTCGAAGCCC GCTTCGGGCT CGATCCCGTC ATGACCGAGC GGCTGAAAGC GGAGAACATC CTCTATGACC GGGACGAGCA CGGCGAGTAT TTCCAGCTCT ACAGCGGCAC CTATGGCGAG GGTTTCTTTT TCGAGATCGT CGAGCGCCGC GGCTATCGCG GCTATGGCGC CCCGAACGCA ATTTTCCGGA TCGCCGCGCT GAAAAGACAA ATGCGCCCGG AAGGAATTCC GAAAGACGTC GATTAA
|
Protein sequence | MKTSIATVTI SGELPEKLEA IARAGFDGVE IFENDFLAFD GSPADVGKLV RDHGLEITLF QPFRDFEGMP EPLRSRTFDR AERKFDVMQQ LGTDLVLVCS NISPAAIGGI DRAAEDFREL GERAARRGLR VGYEALAWGR HISDHRDAWE IVRRADHPNV GLILDSFHTL SRKIDVNSIR SIPKEKIFIV QLADAPLIDM DLLYWSRHFR NMPGEGELPV TAFTEAVAAT GYDGYFSLEI FNDQFRGGLS RPIAADGHRS LIYLGDQVRR HLGTGSMTGA AMPERAAVQG VGFVEFATDE QDEVELVALL RTLGFRQTAV HRTKKVSLFE QGEIRILVNV DQAGFANAAY AVHGTFAYAM ALVVDDAAKA YARALALDAE PFAQPVAEGE LELPAIRGVG GGIVYLIDDK SALGRFSEID FRPVTDDTDT ASAGLIRVDH VAQTVGYDEM LTWLLFYTSI FETHKTPMVD IIDPAGVVRS QVVENQSGAL RITMNGAENR RTLAGHFIAE KFGSGIQHLA FSTDDIFATA EKLRSCGFRS LHISPNYYDD VEARFGLDPV MTERLKAENI LYDRDEHGEY FQLYSGTYGE GFFFEIVERR GYRGYGAPNA IFRIAALKRQ MRPEGIPKDV D
|
| |