Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_0005 |
Symbol | |
ID | 6407646 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 7213 |
End bp | 8331 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 642709912 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_001989043 |
Protein GI | 192288438 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGACCAT TTCCGCACGA TGCGCAGCCG GCCACGATCA GCGCCGACAA TCCGATGGGC ACCGACGGAT TCGAGTTTGT CGAATTCGCT CATCCGGATC CGGCGCAACT TCATAGCCTG TTCAAGCTGA TGGGCTTCAG CTTGGTCGCA AAGCATCGCA CCAAGGCGAT TTCGGTCTAT CGCCAGGGCG ACGTCAACTA TCTGGTCAAC GAGCAGCCCG GCACCCACGG CACCGACTTC GTCACTGCAC ACGGCCCGTG CGCGCCGTCG ATGGCGTTCC GCGTGGTCGA TGCCAAGCAG GCCTACGAGC GCGCGATCTC GCTCGGCGCC GAACCGGCCG GCGTGACGGC CGCGGAGGCG ACACTGGACG TACCAGCCAT CAAGGGCATC GGCGGCAGTC TGCTGTATTT CGTCGACCGC TACGGTGCCA AGGGGTCGGC CTACGATGCT GAATTCGACT GGGTCGGTGC CCGCGACGTT CGGCCCGCCG GCGCCGGGCT CTATTACATC GATCATCTGA CTCACAACGT CCATCGCGGC CGTATGGATG TCTGGACCGG GTTCTATGCC AAACTGTTCA ACTTCCGGCA GATACGCTTC TTCGACATCG AAGGTCGCGC CTCCGGCCTG TTCTCGCGTG CGCTGACCAG CCCGGACGGC AAGATTCGGA TTCCGATCAA CGAGGACGCC GGTGATTCAG GCCAGATCGA AGAGTATCTC AGCCTGTATC GCGGCGAGGG CATCCAGCAT ATCGCCTGCG GCTGCCGCGA CATCTACAGC ACGGTCGAAG GACTTCGTGC CGCCGGCCTG CCATTCATGC CGTCGCCGCC GGAGACGTAT TTCGAACGCG TCGATGCACG TCTGCCGGAG CATGGTGAGG ACGTCGCCCG GCTGCAGCGG AACGGCATCC TGATCGACGG TGAAGGCGTC GTCGACGGCG GTCAGACCAA GGTGCTACTG CAGATCTTCT CGGCGAATGC GATCGGCCCG ATCTTCTTCG AATTCATCCA GCGCAAGGGC GACGACGGCT TCGGCGAAGG CAACTTCAAG GCGCTATTCG AGTCGATCGA AGAAGACCAG ATCCGCCGAG GCGTGCTGAA AGCGGAGCGT GCGGCGTAG
|
Protein sequence | MGPFPHDAQP ATISADNPMG TDGFEFVEFA HPDPAQLHSL FKLMGFSLVA KHRTKAISVY RQGDVNYLVN EQPGTHGTDF VTAHGPCAPS MAFRVVDAKQ AYERAISLGA EPAGVTAAEA TLDVPAIKGI GGSLLYFVDR YGAKGSAYDA EFDWVGARDV RPAGAGLYYI DHLTHNVHRG RMDVWTGFYA KLFNFRQIRF FDIEGRASGL FSRALTSPDG KIRIPINEDA GDSGQIEEYL SLYRGEGIQH IACGCRDIYS TVEGLRAAGL PFMPSPPETY FERVDARLPE HGEDVARLQR NGILIDGEGV VDGGQTKVLL QIFSANAIGP IFFEFIQRKG DDGFGEGNFK ALFESIEEDQ IRRGVLKAER AA
|
| |