Gene Bphy_3887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBphy_3887 
Symbol 
ID6245423 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia phymatum STM815 
KingdomBacteria 
Replicon accessionNC_010623 
Strand
Start bp860666 
End bp861793 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content64% 
IMG OID642595655 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_001860062 
Protein GI186472720 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAGCG ACCTGCCGAC CCCCAACGAC GCGCTCAAGG CCGTCGCGGA CCCGGAGCAC 
AACCCGCTCG GCACAGCCGG CATCGAGTTC GTCGAATTCG CGGCGCGTCA CCCGCAGATG
CTGGCGGAGA CGTTCGTGAA GCTCGGCTTC AAGCCGATCG CGCGCCATAT CAGCAAGGAC
GTGACGCTGT TCCGTCAGGC CGACATGAAT TTCCTGCTCA ATGCCGAGCC GGATTCGTTC
GCCGAGCGTT ACGCGGAAGA ATATGGCGTG GGCATTTGCG CAATCGGCAT TCGCGTCGCC
GATGCACAAC GCGCTTACGA TCGCGCCATC GAACTCGGCG CGTGGTCGTT CGAAGGCGAG
CGCCTCGGCA AGGGCGAACT CGTGATCCCC GCCATTCAGG GCATCGGCGA CTCGCACATT
TACTTCATCG ACCGCTGGCG CGGGCGCGGC GGCCAGAAGG GCGGCCTGGG CGACATCTCG
ATCTTCGACA TCGACTTTCG TCCGATCGAA GTCAGCACCG CGCAAGCCGA TCTGAGCCAC
GCGGGCACGG GTCTCGTCGC CGTCGATCAT CTGACGCAAA CGGTCGGCGC AGGCCGCATG
CAGGAGTGGA TCGACTTCTA TCGCGAGCTG CTCAATTTCC GCGAGATTCA TGAACTGCAC
GCGAATTGGC ATGTGTCGGC GGAATCGCGT GTGATGGTGT CGCCGTGCGG CGCGATCCGC
ATTCCCCTCT ACGAGGAAGG CACGCGCCGC ACGAGCCTGA TGCACGAGTA TCTGCCCGAT
CATCCCGGCG AAGGCGTGCA GCACATCGCG CTCGCGACCG ACGACATCTT CGCGTGCACC
GATCAGCTGC TTGCCAACGG CATCGAACTG GTCGAGCCGC CGCCCGCGTA TTACGAGCAG
ATCGACGCTC GCTTGCCGGG ACATGGGCTC GATATCGAGC GCCTGAAGCG CGGACGGATT
CTCGTGGACG GCGAGATCGG CGCAGACGGC GTGCCGCTTC TGTTTTTCCA GACCTTTGTG
CTGCGACGAG AAGGGGACAT CTTCTTCGAG ATCGTGCAGC GCGAGGGTCA TCACGGTTTC
GGCGAGGGCA ATCTCAGCGC ACTGGCGCAG GCGCGCTCGG CCGCATAA
 
Protein sequence
MPSDLPTPND ALKAVADPEH NPLGTAGIEF VEFAARHPQM LAETFVKLGF KPIARHISKD 
VTLFRQADMN FLLNAEPDSF AERYAEEYGV GICAIGIRVA DAQRAYDRAI ELGAWSFEGE
RLGKGELVIP AIQGIGDSHI YFIDRWRGRG GQKGGLGDIS IFDIDFRPIE VSTAQADLSH
AGTGLVAVDH LTQTVGAGRM QEWIDFYREL LNFREIHELH ANWHVSAESR VMVSPCGAIR
IPLYEEGTRR TSLMHEYLPD HPGEGVQHIA LATDDIFACT DQLLANGIEL VEPPPAYYEQ
IDARLPGHGL DIERLKRGRI LVDGEIGADG VPLLFFQTFV LRREGDIFFE IVQREGHHGF
GEGNLSALAQ ARSAA