Gene Bcep18194_A3420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_A3420 
Symbol 
ID3748588 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007510 
Strand
Start bp283139 
End bp284236 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content63% 
IMG OID637761685 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_367666 
Protein GI78064897 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGATCC CGAACTGGGA CAACCCCGTC GGCACCGACG GCTTCGAATT CATCGAATAC 
ACGGCACCGG ACCCGAAAGC GCTCGGACAA CTGTTCGAAC GGATGGGTTT CACCGCGATC
GCGCGCCACC GCCACAAGGA CGTGACGGTG TACCGCCAGG GCGACATCAA CTTCATCATC
AACGCCGAAC CCGATTCATT CGCACAACGC TTCGCGCGCC TGCACGGCCC GTCGATCTGC
GCGATCGCGT TCCGCGTGCA GGATGCCGCG AAGGCGTACC AGCACGCGCT CGACCTCGGC
GCCTGGGGCT TCGACAACAA GACCGGCCCG ATGGAGCTGA ACATCCCGGC GATCAAGGGC
ATTGGCGACT CGCTGATCTA CTTCGTCGAC CGCTGGCGCG GCAAGAACGG CGCGCAACCG
GGCGCCATCG GCAACATCAG CATCTATGAC GTCGATTTCG AGCCGATCGC CGGCGCGAAC
CCGAACCCGG TCGGCCACGG CCTCACCTAT ATCGACCACC TGACGCACAA CGTGCATCGC
GGCCGCATGC AGGAATGGGC CGAGTTCTAC GAGCGCCTGT TCAACTTCCG CGAAGTGCGC
TACTTCGACA TCGAAGGCAA GGTGACGGGC GTGAAGTCGA AGGCGATGAC GTCGCCGTGC
GGCAAGATCC GCATCCCGAT CAACGAGGAA GGCTCGGATA CGGCCGGCCA GATCCAGGAA
TACCTCGACG CGTACCACGG CGAAGGCATC CAGCACATCG CGCTCGGCGC CACCGACATC
TACCAGGCGG TCGATGGCCT GCGCAGCAAG GAAGTGAAGC TGCTCGACAC GATCGACACG
TATTACGAAC TGGTCGACCG TCGCGTGCCG AACCACGGCG AATCGCTGGA CGAACTGAAG
AAGCGCAAGA TCCTGATCGA CGGCGCGCGC GACGACCTGC TGCTGCAGAT ATTCACCGAG
AACCAGATCG GGCCGATCTT CTTCGAGATC ATCCAGCGCA AGGGCAACCA GGGCTTCGGC
GAAGGCAACT TCAAGGCGCT GTTCGAATCG ATCGAACTCG ACCAGATCCG CCGCGGCGTC
GTGCAGGACA AGGCCTGA
 
Protein sequence
MQIPNWDNPV GTDGFEFIEY TAPDPKALGQ LFERMGFTAI ARHRHKDVTV YRQGDINFII 
NAEPDSFAQR FARLHGPSIC AIAFRVQDAA KAYQHALDLG AWGFDNKTGP MELNIPAIKG
IGDSLIYFVD RWRGKNGAQP GAIGNISIYD VDFEPIAGAN PNPVGHGLTY IDHLTHNVHR
GRMQEWAEFY ERLFNFREVR YFDIEGKVTG VKSKAMTSPC GKIRIPINEE GSDTAGQIQE
YLDAYHGEGI QHIALGATDI YQAVDGLRSK EVKLLDTIDT YYELVDRRVP NHGESLDELK
KRKILIDGAR DDLLLQIFTE NQIGPIFFEI IQRKGNQGFG EGNFKALFES IELDQIRRGV
VQDKA