Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcep18194_B0343 |
Symbol | |
ID | 3752102 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia sp. 383 |
Kingdom | Bacteria |
Replicon accession | NC_007511 |
Strand | - |
Start bp | 366233 |
End bp | 368125 |
Gene Length | 1893 bp |
Protein Length | 630 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637765188 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_371103 |
Protein GI | 78061195 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism [R] General function prediction only |
COG ID | [COG1082] Sugar phosphate isomerases/epimerases [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.880013 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCGCT CGATCGCCAC CGTGTCACTG TCCGGCACGC TCGCCGAGAA ACTCGCCGCC ATCCAGGCCG CAGGCTTCGA CGGCGTCGAA ATCTTCGAAA ACGACCTCGT CTACTTCGAT GGATCGCCGG CCGACGTCCG GCGCATGGCC GCGGATCTCG GCCTCGACAT CGTGCTGTTC CAGCCGTTTC GCGACTTCGA GGGCGTGAGC GCGACCCAGC TCGCCCGCAA CCTCGACCGG ATCAAGCGCA AGTTCGACGT GATGCATGCG CTCGGCACCG ACCGCATCCT GGTGTGCAGC AACGTGTCGG CCGACACGAT CGCGGACGAC GGGCTGCTCG TCGACCAGCT CGGTACGCTC GCTGAGGCCG CGCGGCAGGC CGGCGTGGTC GCCGCGTACG AAGCGCTCGC GTGGGGCCGT GTCGTGAAGC GGTACGGCCA TGCGTGGCAA CTCGTGAATA CCGTGAACAG CCCGCATCTC GGCCTGGCGC TCGACAGCTT CCACACGCTG TCGCTCGACG ATTCGCCGGA CGCAATCGCC GACATCCCCG GCGACCGGAT CGCGTTCGTG CAGATCGCCG ATGCGCCGAA GCTCGCGATG GACGTGCTCG AATGGAGCCG CCATTACCGC TCGTTCCCGG GCCAGGGCGA TTTCGACCTC GCGCACTTCA CTGCGCGCGT GATCGAGTCC GGCTATACGG GGCCGCTGTC GCTCGAGATC TTCAACGACG GCTTCCGCGC GGCGCCCACG GCGATCACGG CCGCCGACGG CCATCGCTCG CTGCTCTATC TGGAGGAACT GACGCGCGAG CGGCTCGCAC AGGACGGCCA TGCGCCGGCC GCCAGCCAGC CGCTGTTCGC GCCGCCGGCG CCGCCCGCGC ACGTCGGGTT CCAGTTCATC GAATTCGCGG TCGACGCGCA GGCGGCGGCG ACCGTCGGCG AATGGCTCGG CCGGATGCGC TTCCGGCTCG CGGGCCGCCA TCGGTCGAAG GACGTGACGC TGTTCCAGCA CGGCGCCGCG TCGATCGTGC TGAACGCGGA GCGCGATTCG TTCGCCGATG CGTTCTTCCA GCAGCATGGG CTGTCGCTGT GTGCATCGGC ATTCCGCGTC GACGACGCGA ACGTGGCGTT CGAGCGCGCA GCAGGCTTCG GCTATGCGCC GTTCTCCGGC CGCGTCGGGC CGAACGAGCG CGTGTTGCCG AGCGTGCAGG CGCCGGACGG CAGCCTCGAA TACTTCGTCG ACGAAGCGCC GAACGCGCCG ACGCTGTACG AATCGGATTT CGTGCTGACC GACATCGACG GCCCGAGCGA AGTCGGGCCG CTGACCGGTA TCGACCATGT GTGCCTCGCC CTGCCGGCCG ACGCGCTCGA TACGTGGATC CTGTTCTTCA AGACGGCATT CGGCTTCGAG GCCGAGCGCA GCTGGCTCGT GCCCGACCCG TACGGGCTAA TGCGCAGCCG CGCGGTGCGC AGCGCCGACG GCTCGGTGCG GATCGCGCTG AACGCGTCGG TCGACCGGCA CACGGCGGTG GCCGAATCGC TCGACCGCTA CCACGGCACA GGGCTGAACC ACGTCGCATT CCGCACGGAC GACATCGTGA AGACGATCGC CGCGTTCGCG GCCGACGGCA TTCCGTTCCT GCGCATTCCG CCGAACTACT ACGACGATCT CGCCGCACGC TACGCGTTGT CGGACGAACT GATCGACACG CTGAGCACGC ATCACCTGCT GTACGACCGC GACGAGCACG GCGGCGAATT CCTGCATGCG TATACAGAAC TGGTCGACAA CCGCTTCTCG CTCGAGATCG TCGAGCGGCG CGGCGGATAC GACGGCTATG GCGCGACAAA CGCGGCCGTG CGGCTCGCCG CACAGGCGCA GCGCAGGAAA TAA
|
Protein sequence | MQRSIATVSL SGTLAEKLAA IQAAGFDGVE IFENDLVYFD GSPADVRRMA ADLGLDIVLF QPFRDFEGVS ATQLARNLDR IKRKFDVMHA LGTDRILVCS NVSADTIADD GLLVDQLGTL AEAARQAGVV AAYEALAWGR VVKRYGHAWQ LVNTVNSPHL GLALDSFHTL SLDDSPDAIA DIPGDRIAFV QIADAPKLAM DVLEWSRHYR SFPGQGDFDL AHFTARVIES GYTGPLSLEI FNDGFRAAPT AITAADGHRS LLYLEELTRE RLAQDGHAPA ASQPLFAPPA PPAHVGFQFI EFAVDAQAAA TVGEWLGRMR FRLAGRHRSK DVTLFQHGAA SIVLNAERDS FADAFFQQHG LSLCASAFRV DDANVAFERA AGFGYAPFSG RVGPNERVLP SVQAPDGSLE YFVDEAPNAP TLYESDFVLT DIDGPSEVGP LTGIDHVCLA LPADALDTWI LFFKTAFGFE AERSWLVPDP YGLMRSRAVR SADGSVRIAL NASVDRHTAV AESLDRYHGT GLNHVAFRTD DIVKTIAAFA ADGIPFLRIP PNYYDDLAAR YALSDELIDT LSTHHLLYDR DEHGGEFLHA YTELVDNRFS LEIVERRGGY DGYGATNAAV RLAAQAQRRK
|
| |