Gene Rleg2_5952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5952 
Symbol 
ID6977338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011366 
Strand
Start bp365679 
End bp367574 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content62% 
IMG OID643393404 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_002278222 
Protein GI209546332 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG1082] Sugar phosphate isomerases/epimerases
[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0955687 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00791612 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAAGACCT CGATTGCGAC TGTGACGATC AGCGGCGAAC TGCCTGAGAA GCTCGAGGCG 
ATCGCCCGGG CGGGGTTCGA TGGCGTCGAG ATCTTCGAAA ACGATTTCCT GGCTTTCGAC
GGCAGCCCGG CCGATGTCGG AAAACTGGTG CGCGACCATG GCCTTGAGAT CACGCTGTTT
CAGCCGTTTC GGGATTTCGA GGGCATGCCG GAGCCGCTGC GCAGCCGCAC CTTCGATCGG
GCGGAACGGA AGTTCGACGT GATGCAGCAG CTTGGAACAG ATCTGGTGCT GGTCTGCTCG
AACATCTCGC CGGCCGCCAT CGGCGGCATC GACAGGGCGG CTGAGGATTT CCGCGAACTT
GGCGAGCGCG CCGCCCGGCG CGGACTGAGG GTGGGTTACG AGGCGCTCGC CTGGGGTCGC
CATATCAGTG ATCACCGCGA CGCCTGGGAA ATCGTTCGGC GCGCCGATCA TCCGAATGTC
GGCCTCATCC TCGACAGCTT CCACACTCTG TCGCGCAAGA TCGACGTTAA TTCGATCCGC
TCGATCCCCA AGGAGAAGAT CTTCATCGTC CAGCTTGCCG ATGCGCCGCT GATCGACATG
GACCTGCTCT ACTGGAGCCG CCACTTCCGC AACATGCCTG GAGAAGGCGA GCTTCCCGTC
ACGGCGTTTA CCGAAGCCGT CGCTGCCACA GGTTATGACG GCTACTTCTC GCTGGAGATT
TTCAACGACC AGTTTCGAGG CGGCCTGTCG CGGCCGATCG CCGCCGACGG CCATCGCTCG
CTGATCTATC TCGGCGATCA GGTGCGCCGC CATCTCGGCA CCGGCAGCAT GACCGGGGCG
GCGATGCCGG AACGGGCGGC CGTCCAGGGC GTCGGCTTCG TGGAGTTTGC CACGGACGAA
CAGGATGAAG TCGAGCTGGT GGCCTTGCTC CGCACCCTCG GCTTCAGACA GACCGCCGTC
CACCGCACCA AGAAAGTCTC TCTGTTCGAG CAGGGCGAGA TCCGGATCCT CGTCAATGTC
GATCAGGCGG GATTTGCCAA TGCGGCCTAC GCCGTGCATG GGACGTTTGC CTATGCCATG
GCACTGGTCG TCGACGACGC CGCAAAGGCA TATGCGCGTG CGCTTGCTCT CGATGCCGAG
CCCTTCGCCC AGCCGGTCGC GGAGGGTGAA CTCGAGCTGC CCGCCATTCG TGGCGTCGGC
GGCGGCATCG TCTATCTGAT CGACGACAAG AGCGCCCTCG GCCGCTTTTC CGAAATCGAT
TTTCGCCCGG TCACCGACGA CACCGACACG GCGTCTGCCG GCCTTATTCG CGTCGATCAC
GTCGCCCAGA CGGTCGGCTA TGATGAAATG CTCACCTGGC TTTTGTTCTA CACGTCGATC
TTCGAGACCC ATAAGACCCC GATGGTCGAC ATCATCGATC CAGCCGGCGT GGTGCGCAGC
CAGGTCGTCG AGAACCAGTC GGGCGCGCTG CGCATCACCA TGAACGGAGC CGAAAACCGC
CGCACCCTGG CCGGCCATTT CATTGCCGAA AAGTTCGGAT CCGGCATCCA GCACCTGGCG
TTTTCGACCG ACGATATCTT TGCAACCGCC GAAAAACTTC GCAGCTGCGG ATTCCGGTCC
CTGCACATTT CGCCGAACTA TTACGACGAC GTCGAAGCCC GCTTCGGGCT CGATCCCGTC
ATGACCGAGC GGCTGAAAGC GGAGAACATC CTCTATGACC GGGACGAGCA CGGCGAGTAT
TTCCAGCTCT ACAGCGGCAC CTATGGCGAG GGTTTCTTTT TCGAGATCGT CGAGCGCCGC
GGCTATCGCG GCTATGGCGC CCCGAACGCA ATTTTCCGGA TCGCCGCGCT GAAAAGACAA
ATGCGCCCGG AAGGAATTCC GAAAGACGTC GATTAA
 
Protein sequence
MKTSIATVTI SGELPEKLEA IARAGFDGVE IFENDFLAFD GSPADVGKLV RDHGLEITLF 
QPFRDFEGMP EPLRSRTFDR AERKFDVMQQ LGTDLVLVCS NISPAAIGGI DRAAEDFREL
GERAARRGLR VGYEALAWGR HISDHRDAWE IVRRADHPNV GLILDSFHTL SRKIDVNSIR
SIPKEKIFIV QLADAPLIDM DLLYWSRHFR NMPGEGELPV TAFTEAVAAT GYDGYFSLEI
FNDQFRGGLS RPIAADGHRS LIYLGDQVRR HLGTGSMTGA AMPERAAVQG VGFVEFATDE
QDEVELVALL RTLGFRQTAV HRTKKVSLFE QGEIRILVNV DQAGFANAAY AVHGTFAYAM
ALVVDDAAKA YARALALDAE PFAQPVAEGE LELPAIRGVG GGIVYLIDDK SALGRFSEID
FRPVTDDTDT ASAGLIRVDH VAQTVGYDEM LTWLLFYTSI FETHKTPMVD IIDPAGVVRS
QVVENQSGAL RITMNGAENR RTLAGHFIAE KFGSGIQHLA FSTDDIFATA EKLRSCGFRS
LHISPNYYDD VEARFGLDPV MTERLKAENI LYDRDEHGEY FQLYSGTYGE GFFFEIVERR
GYRGYGAPNA IFRIAALKRQ MRPEGIPKDV D