Gene Bphyt_3671 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBphyt_3671 
Symbol 
ID6283539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia phytofirmans PsJN 
KingdomBacteria 
Replicon accessionNC_010681 
Strand
Start bp4112425 
End bp4113522 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content58% 
IMG OID642623260 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_001897285 
Protein GI187925643 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0292973 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000943483 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAGGTTT CAACTTGGGA AAATCCGCTC GGCACAGACG GCTTCGAGTT CATTGAATAC 
ACCGCGCCGG ATCCGAAAGC GCTCGGCAAG CTGTTCGAAC AGATGGGCTT CACCGCCGTG
GCCCGGCATC GTCATAAGGA CGTGACGCTG TACCGCCAGG GAGAAATCAA CTTCATCGTC
AACGGCGAGC CGGATTCGTT CGCGCAACGC TTCACGCGTT TGCACGGCCC TTCCATCTGC
GCGATCGCTT TCCGCGTTCA GGACGCCGCC AAGGCGTACA AAGAAGCGCT GGAAAAGGGC
GCCTGGGGCT TCGACAACAA AACCGGCCCG ATGGAATTGA ACATTCCGGC GATCAAGGGC
ATTGGCGACT CGCTGATCTA TTTCGTCGAT CGGTGGCGCG GCAAGAACGG CGCGGAGCCG
AACAGCATCG GCAACATCGA CATTTACGAT GTCGACTTCG AACCGATTGC CGGCGCGAAC
CCGAATCCGG TCGGCCACGG CCTGACCTAC ATCGACCACC TGACGCATAA CGTGCATCGC
GGCCGGATGC AGGAATGGGC GGAGTTCTAC GAGCGTCTGT TCAACTTCCG CGAAGTGCGT
TATTTCGACA TCGAAGGCAA GGTGACGGGC GTGAAGTCGA AGGCAATGAC CTCGCCGTGC
GGCAAGATTC GCATCCCGAT CAATGAAGAA GGTTCGGAAA CCGCCGGCCA GATTCAGGAA
TATCTCGACG CGTATCACGG CGAAGGCATT CAGCACATTG CCCTCGGCAG CAACGACATC
TACCGCACGG TGGACGGCTT GCGCGGATCG AATATCTCGC TGCTCGACAC GATCGACACG
TATTACGAGC TAGTCGATCG CCGCGTGCCG AATCACGGCG AGCCGCTCGA CGAACTGCGC
AAACGCAAGA TTCTGATCGA CGGCGCACCC GAAGATCTGC TGCTGCAGAT TTTCACCGAA
AACCAGATTG GCCCGATCTT CTTCGAGATC ATTCAGCGCA AGGGCAATCA GGGCTTCGGC
GAGGGCAACT TCAAGGCACT GTTCGAATCG ATCGAACTGG ACCAGATTCG CCGTGGCGTG
GTGCAAGACA AGGTCTGA
 
Protein sequence
MQVSTWENPL GTDGFEFIEY TAPDPKALGK LFEQMGFTAV ARHRHKDVTL YRQGEINFIV 
NGEPDSFAQR FTRLHGPSIC AIAFRVQDAA KAYKEALEKG AWGFDNKTGP MELNIPAIKG
IGDSLIYFVD RWRGKNGAEP NSIGNIDIYD VDFEPIAGAN PNPVGHGLTY IDHLTHNVHR
GRMQEWAEFY ERLFNFREVR YFDIEGKVTG VKSKAMTSPC GKIRIPINEE GSETAGQIQE
YLDAYHGEGI QHIALGSNDI YRTVDGLRGS NISLLDTIDT YYELVDRRVP NHGEPLDELR
KRKILIDGAP EDLLLQIFTE NQIGPIFFEI IQRKGNQGFG EGNFKALFES IELDQIRRGV
VQDKV