Gene Rleg_7092 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_7092 
Symbol 
ID8022378 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012858 
Strand
Start bp501904 
End bp503799 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content61% 
IMG OID644833929 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_002985063 
Protein GI241666979 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.26168 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGACCT CGATTGCGAC TGTAACGATC AGCGGCGAAC TTCCCGAGAA GCTCGAGGCG 
ATCGCCCGAG CGGGCTTCGA CGGCGTCGAG ATCTTCGAAA ATGATTTCCT CGCCTTCGAC
GGAAGCCCGG CCGATGTCGG AAAACTCGTC CGCGATCATG GCCTGGAGAT CACCCTGTTT
CAGCCGTTTC GTGATTTCGA GGGCATGCCG GAGCAGCTGC GAAGCCGAAC GTTTGATCGC
GCGGAACGGA AGTTCGACGT GATGCAGCAG CTCGGAACGG ATCTGGTGCT GGTCTGCTCC
AACGTCTCGC CGGCAGCCAT CGGCGGCATC GACCGGGCGG CGGCGGACTT TCACGAACTC
GGCGAGCGTG CCGCCCGGCG GGGACTGAGG GTGGGCTACG AGGCGCTTGC CTGGGGCCGC
CATATCAGCG ACCACCGCGA CGCCTGGGAG ATCGTCCGTC GCGCCGATCA TCCGAATATC
GGTCTTATCC TCGACAGCTT CCACACTTTG TCGCGCAAGA TCGACGTCAA TTCGATCCGC
TCAATTCCCA AGGAGAAAAT CTTCATCGTC CAGCTTGCCG ATGCCCCTGA TATCGACATG
GACCTGCTCT ATTGGAGCCG CCACTTCCGA AACATGCCAG GGGAAGGCGA TCTTCCCGTC
ACGGCGTTTA CCGAAGCCGT CGCGGCAACA GGTTATGATG GCTATTTCTC CCTGGAGATC
TTCAACGACC AGTTTCGAGG CGGTCTGTCG CGGGCGATTG CCGCCGATGG CCACCGCTCG
CTGATCTATC TTGGCGATCA GGTGCGCCGT CATCTCGGCA TCGGCAGCAT GACCGGAGCG
GCGATGCCGG AAAGGCCTTC CGTCAAGGGC GTCGGTTTCG TCGAGTTCGC CACGGACGAG
GAAGACGAGG TCGAGCTGGT TGCATTGCTG CGCACCCTCG GTTTCAAAAG GACGGCAATC
CACCGCACGA AGAAAGTCTC TCTTTTCGAG CAGGGCGAGA TCCGGATCCT CGTCAATGTC
GATGAGGCAG GATTCGCGAA CGCGGCTTAC GCCGTTCACG GCACTTTTGC CTATGCCATG
GCCCTGGTCG TCGACGATGC CGCAAAGGCC TATGAGCGCG CGCTCGCCTT GGATGCCGAG
CCATTCACCC AGCCTGTCGC GGACGGTGAA CTCGAGCTGC CCGCCATTCG GGGTGTCGGG
GGCGGGATCG TCTATCTCAT CGACGACAAG AGCGCATTGG GTCGCTTCTC CGAAATCGAC
TTTCAGCCGG TCACGGACGA TACCGACGCG CCGTCCGCGG GGCTGTTGCG CGTTGATCAC
GTCGCCCAGA CGGTCGGTTA CGATGAAATG CTCACCTGGC TTCTGTTCTA CACGTCGATT
TTCGAGACGC GGAAGACCCC AATGGTCGAT ATCATCGATC CGGCCGGAGT GGTGCGCAGC
CAAGTCGTCG AGAACGACAC CGGGGCGCTG CGCATTACCA TGAACGGCGC CGAGAATCGC
CGCACCCTGG CGGGACATTT CATCGCCGAA AAATTCGGGG CCGGCATCCA GCACCTGGCG
TTTTTGACCG ACGACATCTT CGCGACCGCC GAAAGCCTTC GTGTTTGCGG TTTCAGATCG
CTGCACATTT CGCCGAACTA CTACGACGAC GTCGAAGCAC GCTTCGGCCT CGATCCGGCC
CGGACCGAGC GGTTGAAGGC GGAAAACATC CTCTATGACC GCGACGAGCA TGGCGAATAT
TTCCAGCTCT ATAGCGGAAC CTATGGAGAG GGGTTCTTCT TCGAGATCGT CGAACGCCGC
GGCTACCGCG GCTATGGCGC CCCGAACGCG ATTTTCCGGA TCGCCGCCCT GAAGAAACAG
ATGCGTCCGG AAGGAATTCC GAAAGACGCC TTTTGA
 
Protein sequence
MRTSIATVTI SGELPEKLEA IARAGFDGVE IFENDFLAFD GSPADVGKLV RDHGLEITLF 
QPFRDFEGMP EQLRSRTFDR AERKFDVMQQ LGTDLVLVCS NVSPAAIGGI DRAAADFHEL
GERAARRGLR VGYEALAWGR HISDHRDAWE IVRRADHPNI GLILDSFHTL SRKIDVNSIR
SIPKEKIFIV QLADAPDIDM DLLYWSRHFR NMPGEGDLPV TAFTEAVAAT GYDGYFSLEI
FNDQFRGGLS RAIAADGHRS LIYLGDQVRR HLGIGSMTGA AMPERPSVKG VGFVEFATDE
EDEVELVALL RTLGFKRTAI HRTKKVSLFE QGEIRILVNV DEAGFANAAY AVHGTFAYAM
ALVVDDAAKA YERALALDAE PFTQPVADGE LELPAIRGVG GGIVYLIDDK SALGRFSEID
FQPVTDDTDA PSAGLLRVDH VAQTVGYDEM LTWLLFYTSI FETRKTPMVD IIDPAGVVRS
QVVENDTGAL RITMNGAENR RTLAGHFIAE KFGAGIQHLA FLTDDIFATA ESLRVCGFRS
LHISPNYYDD VEARFGLDPA RTERLKAENI LYDRDEHGEY FQLYSGTYGE GFFFEIVERR
GYRGYGAPNA IFRIAALKKQ MRPEGIPKDA F