Gene Bpro_4496 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBpro_4496 
Symbol 
ID4012810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas sp. JS666 
KingdomBacteria 
Replicon accessionNC_007948 
Strand
Start bp4750766 
End bp4752682 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content67% 
IMG OID637944148 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_551280 
Protein GI91790328 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG1082] Sugar phosphate isomerases/epimerases
[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.460677 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCGTT CCATCGCCAC GGTTTCCCTT TCCGGCATGC TGCGCGAGAA GCTGCAGGCC 
GCTGCTGCGG CGCACTTCGA TGGCGTCGAG ATCTTCGAGA ACGACCTGTT GCAGTTCCCG
GGGTCGCCGC GCGAGGTGCG CCTCATGGCC GAAGACCTCG GGCTGAGCAT CGACATGTTC
CAGCCCTTCC GCGACTTTGA CGGCACCACG CCCGCGCAAC TGGCGCGCAA CCTGGAACGC
GCCGAACGCA AGTTCGACGT GATGCAGGAG CTTGGGACCC AGTTGATCCT GGTGTGTTCC
AATGTGCAGC CCGACGCACT GAGTGATGTC GACCGGCTTG CCGAGCAGTT CCAGCAGCTG
GCCGAACATG CCGGGCGGCG CGGCATGCGC ATTGCCTACG AGGCGCTGGC CTGGGGCAGC
AAGGTCAGGC TCTGGTCGCA GGCCTGGGCC GTGGTCGAGC GGGTCAGCCA TCCGCATCTG
GGCCTGGCGC TCGATAGCTT TCATACGCTG TCGCTGCGCG ATGACCCGAG CGGCATTGCC
CAGCTGCCGG GCGAGAAAAT CTTTTTCGTC CAGCTGGCCG ATGCCCCCTG GATCAATACC
GACGTGCTGA CCCACAGCCG GCACTACCGC TGCTTTCCCG GCCAAGGCGA ATTCGAGATG
GCGAAGTTCA CCGCGGCCGT GCTCGATGCC GGTTACAGCG GCCCGCTGTC GCTGGAAATT
TTCAACGACG AATTCCGCGC CGCGCCGGCC CGAGCGAACG CCGTCGATGC CATGCGCTCG
CTGCTGTGGC TGGAAGAGCA GGTGCAACTG GGCCGCGCCA GGGACGCACA GCCTTACAAG
GTGCCGCTGT TCACGCCGCC GGCCCCGCCG ACCTTCACCG GGTGGGCGTT CATCGAGTTC
GCCGTCGATC CGGCCGCCGG CGCGCGTCTG GCCGCGTGGC TTCGGGCCTG CGGCTTTCAG
TGCATCGGCC ACCACCGGTC CAAGAATGTG GACCTGTACG GCCAGGGCGA AGTCCGCATC
ATCGTCAACC TGGAAGAGGA TTCCTTCGCC CGCAGCCATT TCGAATTGCA CGGCACTTCG
GTCTGCGCCG TCGCGCTGGC CACGCCGGAT GTGGCCGGTG CGCTGGCGCG TGCCGAGGCG
CTGCAGTGCC CGCGCGTGCT CGGCCGCGTG GGCCAGCACG AGTTGACCAT CCCGGCGGTG
CGCGCGCCGG ACGGCAGCCT CGTTTATTTC TGCGAGACGG CTCAGAGTGG CCGCTACCCG
TTCGAGGCCG ATTTTGTGAT GGATGAAGCC GCGCTGCACG GTGGGGCGCT CGGTGACTCC
GCGCGCATCG ACCACCTGGT ACAGGCCGTG CCGGCAGGGC AGGTCGAGCC CTGGGTGTTG
TTCCATCGCG CGGTGCTGGG CCTGGTGCCT GAGCGCAACG TGGTGCTGCA TGACCCCTAT
GGCGTGATCA GGAGCCGCGA GATTGAATCA GCCGACCGCG CGGTGCGCGT GTCGATCACC
GTTTCCGAAC GCGACAACAC CTCGGTGTCT CGCGCCGTGT CCAGCTTCCG GGGTGCCGGC
ATGCAGCAGA TCGCGCTGGC CGTGAGTGAC CTGATTGCCA CGGCGCGCGC GCTCAAGGCC
GCGGGTGCGC CGCTGTTGCC GGTACCGGCC AATTACTACG ACGACCTGGC CGCCAAATAC
GACATCGACG CCGCTGAGCT GGCGGCTATG CGCGAGCTGG GCATCCTGTA TGACCGCGAA
GCCAATGGTG GCGAGTTCCT GCACCTGTAC CTCACGCCGT TTGACGACCG CTTCCATTTC
GAGCTGACCG AGCGGCGTGG CGGCTACACC GGTTACGGTG CGCCGAATGC CCCGGCGCGC
CTGGCAGCGA TGGCATTGTG GCGCCAGACA CGGGGCGAGT CCCAGGATCG GCCCTGA
 
Protein sequence
MRRSIATVSL SGMLREKLQA AAAAHFDGVE IFENDLLQFP GSPREVRLMA EDLGLSIDMF 
QPFRDFDGTT PAQLARNLER AERKFDVMQE LGTQLILVCS NVQPDALSDV DRLAEQFQQL
AEHAGRRGMR IAYEALAWGS KVRLWSQAWA VVERVSHPHL GLALDSFHTL SLRDDPSGIA
QLPGEKIFFV QLADAPWINT DVLTHSRHYR CFPGQGEFEM AKFTAAVLDA GYSGPLSLEI
FNDEFRAAPA RANAVDAMRS LLWLEEQVQL GRARDAQPYK VPLFTPPAPP TFTGWAFIEF
AVDPAAGARL AAWLRACGFQ CIGHHRSKNV DLYGQGEVRI IVNLEEDSFA RSHFELHGTS
VCAVALATPD VAGALARAEA LQCPRVLGRV GQHELTIPAV RAPDGSLVYF CETAQSGRYP
FEADFVMDEA ALHGGALGDS ARIDHLVQAV PAGQVEPWVL FHRAVLGLVP ERNVVLHDPY
GVIRSREIES ADRAVRVSIT VSERDNTSVS RAVSSFRGAG MQQIALAVSD LIATARALKA
AGAPLLPVPA NYYDDLAAKY DIDAAELAAM RELGILYDRE ANGGEFLHLY LTPFDDRFHF
ELTERRGGYT GYGAPNAPAR LAAMALWRQT RGESQDRP