Gene Rpal_0005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_0005 
Symbol 
ID6407646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp7213 
End bp8331 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content63% 
IMG OID642709912 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_001989043 
Protein GI192288438 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGACCAT TTCCGCACGA TGCGCAGCCG GCCACGATCA GCGCCGACAA TCCGATGGGC 
ACCGACGGAT TCGAGTTTGT CGAATTCGCT CATCCGGATC CGGCGCAACT TCATAGCCTG
TTCAAGCTGA TGGGCTTCAG CTTGGTCGCA AAGCATCGCA CCAAGGCGAT TTCGGTCTAT
CGCCAGGGCG ACGTCAACTA TCTGGTCAAC GAGCAGCCCG GCACCCACGG CACCGACTTC
GTCACTGCAC ACGGCCCGTG CGCGCCGTCG ATGGCGTTCC GCGTGGTCGA TGCCAAGCAG
GCCTACGAGC GCGCGATCTC GCTCGGCGCC GAACCGGCCG GCGTGACGGC CGCGGAGGCG
ACACTGGACG TACCAGCCAT CAAGGGCATC GGCGGCAGTC TGCTGTATTT CGTCGACCGC
TACGGTGCCA AGGGGTCGGC CTACGATGCT GAATTCGACT GGGTCGGTGC CCGCGACGTT
CGGCCCGCCG GCGCCGGGCT CTATTACATC GATCATCTGA CTCACAACGT CCATCGCGGC
CGTATGGATG TCTGGACCGG GTTCTATGCC AAACTGTTCA ACTTCCGGCA GATACGCTTC
TTCGACATCG AAGGTCGCGC CTCCGGCCTG TTCTCGCGTG CGCTGACCAG CCCGGACGGC
AAGATTCGGA TTCCGATCAA CGAGGACGCC GGTGATTCAG GCCAGATCGA AGAGTATCTC
AGCCTGTATC GCGGCGAGGG CATCCAGCAT ATCGCCTGCG GCTGCCGCGA CATCTACAGC
ACGGTCGAAG GACTTCGTGC CGCCGGCCTG CCATTCATGC CGTCGCCGCC GGAGACGTAT
TTCGAACGCG TCGATGCACG TCTGCCGGAG CATGGTGAGG ACGTCGCCCG GCTGCAGCGG
AACGGCATCC TGATCGACGG TGAAGGCGTC GTCGACGGCG GTCAGACCAA GGTGCTACTG
CAGATCTTCT CGGCGAATGC GATCGGCCCG ATCTTCTTCG AATTCATCCA GCGCAAGGGC
GACGACGGCT TCGGCGAAGG CAACTTCAAG GCGCTATTCG AGTCGATCGA AGAAGACCAG
ATCCGCCGAG GCGTGCTGAA AGCGGAGCGT GCGGCGTAG
 
Protein sequence
MGPFPHDAQP ATISADNPMG TDGFEFVEFA HPDPAQLHSL FKLMGFSLVA KHRTKAISVY 
RQGDVNYLVN EQPGTHGTDF VTAHGPCAPS MAFRVVDAKQ AYERAISLGA EPAGVTAAEA
TLDVPAIKGI GGSLLYFVDR YGAKGSAYDA EFDWVGARDV RPAGAGLYYI DHLTHNVHRG
RMDVWTGFYA KLFNFRQIRF FDIEGRASGL FSRALTSPDG KIRIPINEDA GDSGQIEEYL
SLYRGEGIQH IACGCRDIYS TVEGLRAAGL PFMPSPPETY FERVDARLPE HGEDVARLQR
NGILIDGEGV VDGGQTKVLL QIFSANAIGP IFFEFIQRKG DDGFGEGNFK ALFESIEEDQ
IRRGVLKAER AA