Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_7092 |
Symbol | |
ID | 8022378 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012858 |
Strand | + |
Start bp | 501904 |
End bp | 503799 |
Gene Length | 1896 bp |
Protein Length | 631 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644833929 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_002985063 |
Protein GI | 241666979 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.26168 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGACCT CGATTGCGAC TGTAACGATC AGCGGCGAAC TTCCCGAGAA GCTCGAGGCG ATCGCCCGAG CGGGCTTCGA CGGCGTCGAG ATCTTCGAAA ATGATTTCCT CGCCTTCGAC GGAAGCCCGG CCGATGTCGG AAAACTCGTC CGCGATCATG GCCTGGAGAT CACCCTGTTT CAGCCGTTTC GTGATTTCGA GGGCATGCCG GAGCAGCTGC GAAGCCGAAC GTTTGATCGC GCGGAACGGA AGTTCGACGT GATGCAGCAG CTCGGAACGG ATCTGGTGCT GGTCTGCTCC AACGTCTCGC CGGCAGCCAT CGGCGGCATC GACCGGGCGG CGGCGGACTT TCACGAACTC GGCGAGCGTG CCGCCCGGCG GGGACTGAGG GTGGGCTACG AGGCGCTTGC CTGGGGCCGC CATATCAGCG ACCACCGCGA CGCCTGGGAG ATCGTCCGTC GCGCCGATCA TCCGAATATC GGTCTTATCC TCGACAGCTT CCACACTTTG TCGCGCAAGA TCGACGTCAA TTCGATCCGC TCAATTCCCA AGGAGAAAAT CTTCATCGTC CAGCTTGCCG ATGCCCCTGA TATCGACATG GACCTGCTCT ATTGGAGCCG CCACTTCCGA AACATGCCAG GGGAAGGCGA TCTTCCCGTC ACGGCGTTTA CCGAAGCCGT CGCGGCAACA GGTTATGATG GCTATTTCTC CCTGGAGATC TTCAACGACC AGTTTCGAGG CGGTCTGTCG CGGGCGATTG CCGCCGATGG CCACCGCTCG CTGATCTATC TTGGCGATCA GGTGCGCCGT CATCTCGGCA TCGGCAGCAT GACCGGAGCG GCGATGCCGG AAAGGCCTTC CGTCAAGGGC GTCGGTTTCG TCGAGTTCGC CACGGACGAG GAAGACGAGG TCGAGCTGGT TGCATTGCTG CGCACCCTCG GTTTCAAAAG GACGGCAATC CACCGCACGA AGAAAGTCTC TCTTTTCGAG CAGGGCGAGA TCCGGATCCT CGTCAATGTC GATGAGGCAG GATTCGCGAA CGCGGCTTAC GCCGTTCACG GCACTTTTGC CTATGCCATG GCCCTGGTCG TCGACGATGC CGCAAAGGCC TATGAGCGCG CGCTCGCCTT GGATGCCGAG CCATTCACCC AGCCTGTCGC GGACGGTGAA CTCGAGCTGC CCGCCATTCG GGGTGTCGGG GGCGGGATCG TCTATCTCAT CGACGACAAG AGCGCATTGG GTCGCTTCTC CGAAATCGAC TTTCAGCCGG TCACGGACGA TACCGACGCG CCGTCCGCGG GGCTGTTGCG CGTTGATCAC GTCGCCCAGA CGGTCGGTTA CGATGAAATG CTCACCTGGC TTCTGTTCTA CACGTCGATT TTCGAGACGC GGAAGACCCC AATGGTCGAT ATCATCGATC CGGCCGGAGT GGTGCGCAGC CAAGTCGTCG AGAACGACAC CGGGGCGCTG CGCATTACCA TGAACGGCGC CGAGAATCGC CGCACCCTGG CGGGACATTT CATCGCCGAA AAATTCGGGG CCGGCATCCA GCACCTGGCG TTTTTGACCG ACGACATCTT CGCGACCGCC GAAAGCCTTC GTGTTTGCGG TTTCAGATCG CTGCACATTT CGCCGAACTA CTACGACGAC GTCGAAGCAC GCTTCGGCCT CGATCCGGCC CGGACCGAGC GGTTGAAGGC GGAAAACATC CTCTATGACC GCGACGAGCA TGGCGAATAT TTCCAGCTCT ATAGCGGAAC CTATGGAGAG GGGTTCTTCT TCGAGATCGT CGAACGCCGC GGCTACCGCG GCTATGGCGC CCCGAACGCG ATTTTCCGGA TCGCCGCCCT GAAGAAACAG ATGCGTCCGG AAGGAATTCC GAAAGACGCC TTTTGA
|
Protein sequence | MRTSIATVTI SGELPEKLEA IARAGFDGVE IFENDFLAFD GSPADVGKLV RDHGLEITLF QPFRDFEGMP EQLRSRTFDR AERKFDVMQQ LGTDLVLVCS NVSPAAIGGI DRAAADFHEL GERAARRGLR VGYEALAWGR HISDHRDAWE IVRRADHPNI GLILDSFHTL SRKIDVNSIR SIPKEKIFIV QLADAPDIDM DLLYWSRHFR NMPGEGDLPV TAFTEAVAAT GYDGYFSLEI FNDQFRGGLS RAIAADGHRS LIYLGDQVRR HLGIGSMTGA AMPERPSVKG VGFVEFATDE EDEVELVALL RTLGFKRTAI HRTKKVSLFE QGEIRILVNV DEAGFANAAY AVHGTFAYAM ALVVDDAAKA YERALALDAE PFTQPVADGE LELPAIRGVG GGIVYLIDDK SALGRFSEID FQPVTDDTDA PSAGLLRVDH VAQTVGYDEM LTWLLFYTSI FETRKTPMVD IIDPAGVVRS QVVENDTGAL RITMNGAENR RTLAGHFIAE KFGAGIQHLA FLTDDIFATA ESLRVCGFRS LHISPNYYDD VEARFGLDPA RTERLKAENI LYDRDEHGEY FQLYSGTYGE GFFFEIVERR GYRGYGAPNA IFRIAALKKQ MRPEGIPKDA F
|
| |