Gene Rleg2_1406 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1406 
Symbol 
ID6980134 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1426208 
End bp1427317 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content59% 
IMG OID643396127 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_002280926 
Protein GI209549009 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.584202 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTCCTT TCCCGCACGA TGCGCCGCCA TCCGAAATCA CCGCGGACAA CCCGGCCGGC 
ACCGACGGTT TCGAATTCGT CGAGTTCGCA CATCCCGAAC CCGAGAAGCT CAGCGAACTT
TTCACGCGCA TGGGCTATGT CGCGGTCGCC AGGCACAAGA CGAAAGACAT CACCGTCTGG
CGCCAGGGTG ACATCAACTA TGTCGTCAAT GCCGAGCCCG GCAGCCATGC CGCCCGCTTT
GTCGGGCAAC ATGGTCCCTG CGCCGCATCG ATGGCCTGGC GTGTCGTCGA TGCCAAACAT
GCTTTCGACC ATGCGGTGTC GAAAGGCGCC GTTCCCTATG AAGGCGACGA CAAGATGCTC
GACGTTCCGG CCATCACAGG CATCGGCGGC TCGCTGCTCT ATTTCGTCGA GACCTATGGG
GCGAAGGGGT CGGCTTACGA GGCCGAGTTC GATTGGCTGG GTGAGCGCAA TCCGCGACCC
GAGGGGATCG GTTTCTATTA TCTCGACCAT CTCACCCACA ACGTCTTCCG CGGCAATATG
GACAAGTGGT GGGATTTTTA CCGCGAGCTG TTCAACTTCA AGCAGATCCA CTTCTTCGAC
ATTGACGGCC GCATCACCGG CCTGGTCAGC CGCGCCATCA CCTCGCCCTG CGGCAAGATC
CGCATTCCGC TGAATGAATC GAAGGACGAC ACCAGCCAGA TCGAGGAATA TCTGAAGAAG
TACCGGGGCG AGGGCATCCA GCACATCGCC GTCGGCACCG AGGACATTTA TGGGGCGACC
GACAAGCTCG CCGATAACGG CCTGCGCTTC ATGCCGGGTC CGCCGGAGAC CTATTACAAC
ATGTCCTATG AGCGGGTGAA CGGCCATAGC GAACCCATCG AACGCATGAA GAAACACGGC
ATCCTGATCG ACGGGGAGGG CGTGGTGAAT GGCGGCATGA CGAAAATCCT GCTGCAGATC
TTTTCCAAAA CCGTCATCGG CCCGATCTTC TTCGAATTTA TCCAGCGCAA GGGCGACGAG
GGTTTCGGCG AAGGCAACTT CCGGGCGCTG TTCGAGTCGA TCGAAGCCGA TCAGATCAAG
CGTGGTGTCA TCGGGACGGC AGCGGAGTAA
 
Protein sequence
MGPFPHDAPP SEITADNPAG TDGFEFVEFA HPEPEKLSEL FTRMGYVAVA RHKTKDITVW 
RQGDINYVVN AEPGSHAARF VGQHGPCAAS MAWRVVDAKH AFDHAVSKGA VPYEGDDKML
DVPAITGIGG SLLYFVETYG AKGSAYEAEF DWLGERNPRP EGIGFYYLDH LTHNVFRGNM
DKWWDFYREL FNFKQIHFFD IDGRITGLVS RAITSPCGKI RIPLNESKDD TSQIEEYLKK
YRGEGIQHIA VGTEDIYGAT DKLADNGLRF MPGPPETYYN MSYERVNGHS EPIERMKKHG
ILIDGEGVVN GGMTKILLQI FSKTVIGPIF FEFIQRKGDE GFGEGNFRAL FESIEADQIK
RGVIGTAAE