Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_1510 |
Symbol | |
ID | 8012594 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 1491658 |
End bp | 1492767 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 644824098 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_002975340 |
Protein GI | 241204244 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00348363 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.317162 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTCCTT TCCCGCATGA CGCGCCGCCC TCGGAAATCA CCGCCGACAA TCCGGCCGGC ACCGACGGCT TCGAATTCGT CGAATTCGCC CATCCCGAGC CCGAAACGCT GCGCGAACTC TTCACACGCA TGGGCTATGT TGCGGTCGCC AAACACAAGA CGAAAGATAT CACCGTCTGG CGACAAGGCG ACATCAACTA TGTCCTGAAC GCCGAACCCG GCAGCCATGC CGCGCGCTTC GTCGCAAATC ACGGTCCCTG CGCCCCATCG ATGGCCTGGC GCGTCGTCGA CGCCCAACAC GCGTTCAAGC ATGCGGTGTC AAAAGGCGCT GTCCCCTATG AAGGGGACGA CAAGATGCTC GATGTTCCGG CGATTGTCGG CATCGGCGGC TCGCTGCTCT ATTTCGTCGA GACCTATGGT GCCAAGGGAT CGGCCTACGA GGCCGAGTTC GATTGGCTCG GCGAGCGCGA CCCGCATCCC CAAGGGATCG GCTTCTATTA TCTCGACCAT CTGACGCACA ACGTCTTTCG CGGCAACATG GACAAGTGGT GGGATTTTTA CCGCAACCTG TTCAATTTCA AGCAGATCCA CTTCTTCGAT ATCGATGGGC GCATCACCGG TCTGGTGAGC CGCGCCATCA CCTCGCCCTG CGGCAAAATC CGCATTCCGC TGAATGAATC GAAGGACGAC ACCAGCCAGA TCGAGGAATA TCTGAAGAAG TACAAGGGCG AGGGCATCCA GCATATCGCC GTTGGAACGG AGGATATTTA CGACGCCACC GACAGGCTGG CCGACAACGG CCTGCGTTTC ATGCCGGGTC CGCCGGAGAC CTATTACGAC ATGTCCTATG AACGCGTGAA CGGCCATAGT GAACCTGTCG AACGCATGAA GAAACACGGT ATCCTCATCG ATGGCGAAGG TGTGGTGAAT GGCGGCATGA CGAAAATCCT GCTGCAGATC TTCTCCAAGA CCGTCATCGG CCCGATCTTC TTCGAATTCA TCCAGCGCAA GGGCGACGAG GGTTTCGGCG AAGGCAATTT CCGGGCTCTC TTTGAGTCGA TCGAGGCCGA TCAGATCAAG CGTGGTGTCA TCGGGACTGC AGCGGAGTAA
|
Protein sequence | MGPFPHDAPP SEITADNPAG TDGFEFVEFA HPEPETLREL FTRMGYVAVA KHKTKDITVW RQGDINYVLN AEPGSHAARF VANHGPCAPS MAWRVVDAQH AFKHAVSKGA VPYEGDDKML DVPAIVGIGG SLLYFVETYG AKGSAYEAEF DWLGERDPHP QGIGFYYLDH LTHNVFRGNM DKWWDFYRNL FNFKQIHFFD IDGRITGLVS RAITSPCGKI RIPLNESKDD TSQIEEYLKK YKGEGIQHIA VGTEDIYDAT DRLADNGLRF MPGPPETYYD MSYERVNGHS EPVERMKKHG ILIDGEGVVN GGMTKILLQI FSKTVIGPIF FEFIQRKGDE GFGEGNFRAL FESIEADQIK RGVIGTAAE
|
| |