Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_1406 |
Symbol | |
ID | 6980134 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 1426208 |
End bp | 1427317 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643396127 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_002280926 |
Protein GI | 209549009 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.584202 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTCCTT TCCCGCACGA TGCGCCGCCA TCCGAAATCA CCGCGGACAA CCCGGCCGGC ACCGACGGTT TCGAATTCGT CGAGTTCGCA CATCCCGAAC CCGAGAAGCT CAGCGAACTT TTCACGCGCA TGGGCTATGT CGCGGTCGCC AGGCACAAGA CGAAAGACAT CACCGTCTGG CGCCAGGGTG ACATCAACTA TGTCGTCAAT GCCGAGCCCG GCAGCCATGC CGCCCGCTTT GTCGGGCAAC ATGGTCCCTG CGCCGCATCG ATGGCCTGGC GTGTCGTCGA TGCCAAACAT GCTTTCGACC ATGCGGTGTC GAAAGGCGCC GTTCCCTATG AAGGCGACGA CAAGATGCTC GACGTTCCGG CCATCACAGG CATCGGCGGC TCGCTGCTCT ATTTCGTCGA GACCTATGGG GCGAAGGGGT CGGCTTACGA GGCCGAGTTC GATTGGCTGG GTGAGCGCAA TCCGCGACCC GAGGGGATCG GTTTCTATTA TCTCGACCAT CTCACCCACA ACGTCTTCCG CGGCAATATG GACAAGTGGT GGGATTTTTA CCGCGAGCTG TTCAACTTCA AGCAGATCCA CTTCTTCGAC ATTGACGGCC GCATCACCGG CCTGGTCAGC CGCGCCATCA CCTCGCCCTG CGGCAAGATC CGCATTCCGC TGAATGAATC GAAGGACGAC ACCAGCCAGA TCGAGGAATA TCTGAAGAAG TACCGGGGCG AGGGCATCCA GCACATCGCC GTCGGCACCG AGGACATTTA TGGGGCGACC GACAAGCTCG CCGATAACGG CCTGCGCTTC ATGCCGGGTC CGCCGGAGAC CTATTACAAC ATGTCCTATG AGCGGGTGAA CGGCCATAGC GAACCCATCG AACGCATGAA GAAACACGGC ATCCTGATCG ACGGGGAGGG CGTGGTGAAT GGCGGCATGA CGAAAATCCT GCTGCAGATC TTTTCCAAAA CCGTCATCGG CCCGATCTTC TTCGAATTTA TCCAGCGCAA GGGCGACGAG GGTTTCGGCG AAGGCAACTT CCGGGCGCTG TTCGAGTCGA TCGAAGCCGA TCAGATCAAG CGTGGTGTCA TCGGGACGGC AGCGGAGTAA
|
Protein sequence | MGPFPHDAPP SEITADNPAG TDGFEFVEFA HPEPEKLSEL FTRMGYVAVA RHKTKDITVW RQGDINYVVN AEPGSHAARF VGQHGPCAAS MAWRVVDAKH AFDHAVSKGA VPYEGDDKML DVPAITGIGG SLLYFVETYG AKGSAYEAEF DWLGERNPRP EGIGFYYLDH LTHNVFRGNM DKWWDFYREL FNFKQIHFFD IDGRITGLVS RAITSPCGKI RIPLNESKDD TSQIEEYLKK YRGEGIQHIA VGTEDIYGAT DKLADNGLRF MPGPPETYYN MSYERVNGHS EPIERMKKHG ILIDGEGVVN GGMTKILLQI FSKTVIGPIF FEFIQRKGDE GFGEGNFRAL FESIEADQIK RGVIGTAAE
|
| |