Gene Bpro_4213 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBpro_4213 
Symbol 
ID4013117 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas sp. JS666 
KingdomBacteria 
Replicon accessionNC_007948 
Strand
Start bp4431761 
End bp4432891 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content61% 
IMG OID637943860 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_551003 
Protein GI91790051 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCAC CCCTATCTTT CGAAACAGGC GCCGCCTGGG ACAACCCCAT GGGCACCGAC 
GGTTTCGAGT TCATCGAATA CGCCGCGCCG GACCCCCAGG CCATGGGCGC GCTGTTCGAG
CGCATGGGCT TCAAGCCGAT TGCCAGGCAT CGCCATAAAG ATGTGACGCT GTACCGCCAG
GGCGGCATCA ACTTCATCCT CAATGCCGAG CCCGATTCAT TTGCGCAGCG CTTTGCACGC
CAGCACGGTC CCAGCGTCTG CGCCATTGCG TTCAGGGTGC GGGATGCCAA GGCCGCTTAC
GAGCGCGCGA TTGCGCTGGG CGCCTGGGGT TATGCCCACA CCGCCGGCCC GGGCGAGCTG
AACATCCCGG CCATCAAGGG CATTGGCGAC TCCATCATCT ATTTCATTGA CCGCTGGCGC
GGCAAGAACG GCGCCCGGGA AGGCGATATC GGCAACATCG GTTTTTACGA TGTTGACTTC
GAGCCCCTGC CGGGCGTGAG CGGCGCCGAG GCGCTGAATC CCACGGGCCA TGGGCTGACC
TACATTGACC ACCTGACGCA CAACGTGCAC CGCGGCCGCA TGGACGAGTG GGCCGGCTTC
TACGAGCGCC TGTTCAACTT CCGCGAGATC CGCTACTTCG ACATCGAAGG CCTGGTGACC
GGCGTGAAAA GCAAGGCCAT GACCAGCCCC TGCGGCAAGA TCCGCATCCC GATCAACGAG
GAAGGCAATG AGAAAGCCGG CCAGATCCAG GAGTACCTGG ACCGTTACCA GGGCGAGGGC
ATCCAGCACA TCGCCATGGG CAGCGGCAAT TTGCCAGCCA CCGTGGACAA GCTACGCGCC
AGCGGCATCA AGCTGCTGGA CACGGTAGAC ACCTACTACG AACTGATCGA CAAGCGCATT
CCAGGCCATG GCGAAAATGT GGCGGAACTG CACAAGCGAA AAATTTTGGT GGACGGCAAG
AAAGGCGCGC TTCTGCTGCA GATCTTCAGT GAAAACCAGC TCGGCCCGAT CTTCTTTGAA
TTCATCCAGC GCAAGGGCGA CGAGGGTTTT GGCGAAGGCA ACTTCAAGGC ACTGTTCGAA
AGCATCGAGC TGGACCAGAT GCGCCGAGGG GTTTTGGCGA GTGCACAATA A
 
Protein sequence
MNAPLSFETG AAWDNPMGTD GFEFIEYAAP DPQAMGALFE RMGFKPIARH RHKDVTLYRQ 
GGINFILNAE PDSFAQRFAR QHGPSVCAIA FRVRDAKAAY ERAIALGAWG YAHTAGPGEL
NIPAIKGIGD SIIYFIDRWR GKNGAREGDI GNIGFYDVDF EPLPGVSGAE ALNPTGHGLT
YIDHLTHNVH RGRMDEWAGF YERLFNFREI RYFDIEGLVT GVKSKAMTSP CGKIRIPINE
EGNEKAGQIQ EYLDRYQGEG IQHIAMGSGN LPATVDKLRA SGIKLLDTVD TYYELIDKRI
PGHGENVAEL HKRKILVDGK KGALLLQIFS ENQLGPIFFE FIQRKGDEGF GEGNFKALFE
SIELDQMRRG VLASAQ