Gene Rleg_1510 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1510 
Symbol 
ID8012594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1491658 
End bp1492767 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content58% 
IMG OID644824098 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_002975340 
Protein GI241204244 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00348363 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.317162 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTCCTT TCCCGCATGA CGCGCCGCCC TCGGAAATCA CCGCCGACAA TCCGGCCGGC 
ACCGACGGCT TCGAATTCGT CGAATTCGCC CATCCCGAGC CCGAAACGCT GCGCGAACTC
TTCACACGCA TGGGCTATGT TGCGGTCGCC AAACACAAGA CGAAAGATAT CACCGTCTGG
CGACAAGGCG ACATCAACTA TGTCCTGAAC GCCGAACCCG GCAGCCATGC CGCGCGCTTC
GTCGCAAATC ACGGTCCCTG CGCCCCATCG ATGGCCTGGC GCGTCGTCGA CGCCCAACAC
GCGTTCAAGC ATGCGGTGTC AAAAGGCGCT GTCCCCTATG AAGGGGACGA CAAGATGCTC
GATGTTCCGG CGATTGTCGG CATCGGCGGC TCGCTGCTCT ATTTCGTCGA GACCTATGGT
GCCAAGGGAT CGGCCTACGA GGCCGAGTTC GATTGGCTCG GCGAGCGCGA CCCGCATCCC
CAAGGGATCG GCTTCTATTA TCTCGACCAT CTGACGCACA ACGTCTTTCG CGGCAACATG
GACAAGTGGT GGGATTTTTA CCGCAACCTG TTCAATTTCA AGCAGATCCA CTTCTTCGAT
ATCGATGGGC GCATCACCGG TCTGGTGAGC CGCGCCATCA CCTCGCCCTG CGGCAAAATC
CGCATTCCGC TGAATGAATC GAAGGACGAC ACCAGCCAGA TCGAGGAATA TCTGAAGAAG
TACAAGGGCG AGGGCATCCA GCATATCGCC GTTGGAACGG AGGATATTTA CGACGCCACC
GACAGGCTGG CCGACAACGG CCTGCGTTTC ATGCCGGGTC CGCCGGAGAC CTATTACGAC
ATGTCCTATG AACGCGTGAA CGGCCATAGT GAACCTGTCG AACGCATGAA GAAACACGGT
ATCCTCATCG ATGGCGAAGG TGTGGTGAAT GGCGGCATGA CGAAAATCCT GCTGCAGATC
TTCTCCAAGA CCGTCATCGG CCCGATCTTC TTCGAATTCA TCCAGCGCAA GGGCGACGAG
GGTTTCGGCG AAGGCAATTT CCGGGCTCTC TTTGAGTCGA TCGAGGCCGA TCAGATCAAG
CGTGGTGTCA TCGGGACTGC AGCGGAGTAA
 
Protein sequence
MGPFPHDAPP SEITADNPAG TDGFEFVEFA HPEPETLREL FTRMGYVAVA KHKTKDITVW 
RQGDINYVLN AEPGSHAARF VANHGPCAPS MAWRVVDAQH AFKHAVSKGA VPYEGDDKML
DVPAIVGIGG SLLYFVETYG AKGSAYEAEF DWLGERDPHP QGIGFYYLDH LTHNVFRGNM
DKWWDFYRNL FNFKQIHFFD IDGRITGLVS RAITSPCGKI RIPLNESKDD TSQIEEYLKK
YKGEGIQHIA VGTEDIYDAT DRLADNGLRF MPGPPETYYD MSYERVNGHS EPVERMKKHG
ILIDGEGVVN GGMTKILLQI FSKTVIGPIF FEFIQRKGDE GFGEGNFRAL FESIEADQIK
RGVIGTAAE