Gene BamMC406_5194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBamMC406_5194 
Symbol 
ID6179882 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia ambifaria MC40-6 
KingdomBacteria 
Replicon accessionNC_010552 
Strand
Start bp2391546 
End bp2393438 
Gene Length1893 bp 
Protein Length630 aa 
Translation table11 
GC content69% 
IMG OID641684950 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_001811855 
Protein GI172064204 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG1082] Sugar phosphate isomerases/epimerases
[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCGCT CGATCGCCAC CGTGTCACTG TCCGGCACCC TTGCCGAGAA GCTCGCCGCC 
GTCCAGGCCG CCGGCTTCGA CGGTGTCGAA ATCTTCGAGA ACGACCTCGT CTACTTCGAC
GGATCGCCGG CCGACGTGCG GCGCATGGCC GCCGATCTCG GCCTCGACAT CGTGCTGTTC
CAGCCGTTTC GCGACTTCGA GGGCGTGAGC CCGGCCCAGC TCGCCCGCAA TCTCGACCGC
ATCCGGCGCA AGTTCGATGT GATGCACGCG CTCGGCACCG ACCGCATCCT GGTGTGCAGC
AACGTGTCGC CCGACACGAT CGCGGACGAC GCGCTGCTGG TCGACCAGCT CGGCGCGCTC
GCCGAAGCCG CGCGGCAGGC CGGCGTGGTC GCCGCGTACG AGGCGCTCGC GTGGGGCCGC
GTCGTGAAGC GGTACGGCCA TGCGTGGCAG CTCGTGAATG CGGTGAACAG CCCGCACCTC
GGCCTCGCGC TCGACAGCTT CCATACGCTG TCGCTCGACG ATTCGCCGGA CGCGATCGCC
GACATCCCCG GCGACCGGAT CGCGTTCGTG CAGATCGCCG ATGCGCCGAA GCTCGCGATG
GACGTGCTCG AATGGAGCCG CCACTTCCGC TCGTTCCCCG GCCAGGGCGA CTTCGACCTC
GCACGCTTCA CGGCTCGCGT GATCGAGTCC GGCTATACGG GGCCGCTGTC GCTCGAGATC
TTCAACGACG GCTTCCGCGC CGCGCCGACC GCGATCACGG CCGCCGACGG CCACCGCTCG
CTGCTCTATC TGGAAGAACT GACGCATGCG CGGCTCGCAC AGGACGGCCG TGCGCCGGCT
GCCGACCAGC CGCTGTTCGC GCCGCCGGCG CCGCCCGCGC ACGTCGGGTT CCAGTTCATC
GAGTTCGCGG TCGACGCGCA GGCCGCAGCC ACCGTCGGCG CATGGCTCGG CAAGATGCGG
TTCCGGCTCG CGGGCCGCCA TCGGTCGAAG GACGTAACGC TGTTCCAGCA CGGCGCCGCG
TCGATCGTGC TGAACGCCGA GCGCGACTCG TTCGCCGATG CGTTCTTCCA GCAGCACGGG
TTGTCGCTCT GTGCGTCGGC GTTTCGCGTC GACGATGCGA AAGTGGCGTT CGAACGCGCG
GCCGGCTTCG GCTATGCGCC GTTTTCCGGG CGCGTCGGCC CGAACGAGCG CGTGCTGCCG
AGCGTGCAGG CGCCGGACGG CAGCCTCGAA TACTTCGTCG ACGAGACGCC GAACGCGCCG
ACGCTGTATG AATCGGATTT CGTGCTGACC GACATCGACG GCCCCACCGA GGTCGGCCCG
CTGACGGGCA TCGACCACGT GTGCCTCGCG CTGCCGGCCG ACGCGCTCGA TACGTGGATC
CTGTTCTTCA AGACCGCATT CGGCTTCGAG GCCGAACGCA GCTGGCTCGT GCCCGACCCG
TACGGCCTGA TGCGCAGCCG GGCGGTGCGC AGCGCGGACG GCTCGGTGCG GATCGCGCTG
AATGCGTCGG TCGACCGCCA TACGGCCGTC GCCGAATCGC TGGACCGCTA TCACGGCACG
GGGCTGAATC ACGTCGCGTT CCGGACCGAC GACATCGTGA AGACGGTCGC CGCGTTCGCA
GCCGACGGCG TGCCGTTCCT GCGGATCCCG CCGAACTACT ACGACGATCT CGCCGCGCGC
TACGCGCTGC CGGACGAATT GATCGACACG CTGAGCGCCC ACCATCTGCT GTACGACCGC
GACGAAAACG GCGGCGAATT CCTGCATGCG TACACGGAAC TGGTCGACAA CCGCTTCTCG
CTCGAGATCG TCGAGCGGCG CGGCGGCTAT GACGGCTACG GTGCGACAAA CGCGGCCGTG
CGGCTCGCCG CGCAGGCCCA GCGCAGGAAA TAA
 
Protein sequence
MQRSIATVSL SGTLAEKLAA VQAAGFDGVE IFENDLVYFD GSPADVRRMA ADLGLDIVLF 
QPFRDFEGVS PAQLARNLDR IRRKFDVMHA LGTDRILVCS NVSPDTIADD ALLVDQLGAL
AEAARQAGVV AAYEALAWGR VVKRYGHAWQ LVNAVNSPHL GLALDSFHTL SLDDSPDAIA
DIPGDRIAFV QIADAPKLAM DVLEWSRHFR SFPGQGDFDL ARFTARVIES GYTGPLSLEI
FNDGFRAAPT AITAADGHRS LLYLEELTHA RLAQDGRAPA ADQPLFAPPA PPAHVGFQFI
EFAVDAQAAA TVGAWLGKMR FRLAGRHRSK DVTLFQHGAA SIVLNAERDS FADAFFQQHG
LSLCASAFRV DDAKVAFERA AGFGYAPFSG RVGPNERVLP SVQAPDGSLE YFVDETPNAP
TLYESDFVLT DIDGPTEVGP LTGIDHVCLA LPADALDTWI LFFKTAFGFE AERSWLVPDP
YGLMRSRAVR SADGSVRIAL NASVDRHTAV AESLDRYHGT GLNHVAFRTD DIVKTVAAFA
ADGVPFLRIP PNYYDDLAAR YALPDELIDT LSAHHLLYDR DENGGEFLHA YTELVDNRFS
LEIVERRGGY DGYGATNAAV RLAAQAQRRK