Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMASAVP1_1752 |
Symbol | |
ID | 4677483 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei SAVP1 |
Kingdom | Bacteria |
Replicon accession | NC_008784 |
Strand | - |
Start bp | 1720889 |
End bp | 1722943 |
Gene Length | 2055 bp |
Protein Length | 684 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639844266 |
Product | putative 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_991344 |
Protein GI | 121597343 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism [R] General function prediction only |
COG ID | [COG1082] Sugar phosphate isomerases/epimerases [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.10433 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCGTT CGATCGCCAC CGTCTCGCTA TCGGGCACGC TCGTCGAGAA GCTCCGCGCG ATTCGCGCCG CCGGCTTCGA CGGCGTCGAG ATCTTCGAGA ACGACTTGCT CTACTTCGAC GGCTCGCCCG CCGACGTGCG CGCCATCGCC GCTGATCTCG GCCTCGCCAT CGTGCTGTTC CAGCCGTTTC GCGATTTCGA GGGCGTGCCG CCCGAGCGCC TCGCGCGCAA TCTCGAGCGC GCGAAGCGCA AGTTCGAGCT GATGCACGCG CTCGGCGCGA ACCGCATGCT CGTGTGCAGC AACATGTCGC CGGACGCGAT CGGCGACGAC GCACTGCTCA TCGACCAGTT GGGCGCGCTC GCGCGCGCCG CGCAGGCGGC GGGCGTCGTC GCCGCGTACG AGGCGCTGGC ATGGGGTCGG AACGTGAAGA CTTATGGCCA TGCGTGGCGG CTCGTCGACG CGGTGAACCA TCCGAGCCTC GGGCTTGCGC TCGACAGCTT CCATACGCTG TCGCTCGGCG ATTCGCCGGA CGGCATCGCG CGCATTCCCG GCGAGCGGAT CGCGTTCGTG CAGATCGCCG ACGCGCCGAA GCTCGCGATG GACGTGCTCG AATGGAGCCG GCATTACCGG TCGTTTCCGG GCCAGGGCGA TTTCGACCTC GCGGGATTCA CCGCGCGCGT GATCGAATCG GGCTACGCCG GGCCGCTGTC GCTCGAGATC TTCAACGACG GCTTTCGCGC TGCGCCGACC GCGCTGACGG CCGCGGACGG CTACCGGTCG CTGCTGTATC TCGAGGAGAC CACGCGCGAG CGGCTCGCGT GCGACGCGCG GCGCGCACGT CGGGCGGGCG GCGCGCCGGG AACGGGCGAA ACGCGCGGCG AGCGCGAAAG ACACGGCGCG CACGGCGAAC GTGGCGAACA CCGCGCACAC GATAAGGACG ACAAGCCAGA CGACAAGCGC GGCGGCCCCG ACGAACGCGA AAGCCGCGCG GCGCACCCGC GCCCCGCGCC GCCGCTCTTC GCGCCGCCGC CCGCGCCCGC GCACGTCGGC TTTCAGTTCA TCGAATTCGC GGTCGACGCG GCCGCCGCCG AGAACGTCGC CGGCTGGCTC GGCAAGCTGC GCTTTCGGCG CGCGGGCCGT CACCGCTCGA AGGACGTGAC GCTGTATCAG CACGGCGCGG CGTCGATCGT GTTGAACGCC GAGCGCGATT CGTTCGCCGA CGCGTTCTTC CAGGAGCACG GCCTGTCGCT GTGCGCGTCG GCGTTTCGCG TCGACGATGC GCGTCTCGCG TTCGAGCGCG CGGCGGGCTA CGGCTACGCG CCGTTCTCGG GCCGCGTCGG CCCGAACGAG CGCGTGCTGC CGAGCGTGCG CGCGCCCGAC GGCAGCCTGA ACTACTTCGT CGACGAGGCG CCCGGCGCGC CGACGCTGTA CGAATCGGAT TTCGTGCTGA CCGACATCGA CGGGCCAAGC GAAGTCGGCC CGCTCGCCGG CATCGATCAC GTGTGCCTCG CGCTGCCCGC CGACGCGCTC GATACGTGGG TGCTGTTCTT CAAGACCGCG TTCGGCTTCG AGGCCGAGCG CAACTGGCTC GTGCCGGACC CGTACGGGCT CGTGCGCAGC CGCGCGGTGC GCAGCCCGGA CGGCTCGGTG CGCATCGCGC TCAATGCGTC GGTGGACCGG CATACGGCCG TCGTCCGGTC GCTCGAGCGC TATCGCGGCA CGGGGCTCAA TCATGTCGCG TTCCGCGCGG ACGACATCGT CGCGGCGATC GCCGAATTCG CCGCGGACGG CGTGCCGTTC CTGCGGATTC CGCGCAATTA CTACGACGAT CTCGCCGCAC GCTACGCGCT GCCCGACGAG ACGATCGACA CGCTGCGCCG CCATCACCTG CTGTACGACC GCGACGACGC GGGCGGCGAA TTCCTGCATG CGTACACCGA GCTCGTCGAC GGCCGCTTCT CGTTCGAGAT CGTCGAGCGG CGCGGCGGCT ACGACGGATA CGGCGCGGCG AACGCAGCCG TGCGGCTCGC CGCGCAGGCG CAGCGCAGGG GGTAA
|
Protein sequence | MQRSIATVSL SGTLVEKLRA IRAAGFDGVE IFENDLLYFD GSPADVRAIA ADLGLAIVLF QPFRDFEGVP PERLARNLER AKRKFELMHA LGANRMLVCS NMSPDAIGDD ALLIDQLGAL ARAAQAAGVV AAYEALAWGR NVKTYGHAWR LVDAVNHPSL GLALDSFHTL SLGDSPDGIA RIPGERIAFV QIADAPKLAM DVLEWSRHYR SFPGQGDFDL AGFTARVIES GYAGPLSLEI FNDGFRAAPT ALTAADGYRS LLYLEETTRE RLACDARRAR RAGGAPGTGE TRGERERHGA HGERGEHRAH DKDDKPDDKR GGPDERESRA AHPRPAPPLF APPPAPAHVG FQFIEFAVDA AAAENVAGWL GKLRFRRAGR HRSKDVTLYQ HGAASIVLNA ERDSFADAFF QEHGLSLCAS AFRVDDARLA FERAAGYGYA PFSGRVGPNE RVLPSVRAPD GSLNYFVDEA PGAPTLYESD FVLTDIDGPS EVGPLAGIDH VCLALPADAL DTWVLFFKTA FGFEAERNWL VPDPYGLVRS RAVRSPDGSV RIALNASVDR HTAVVRSLER YRGTGLNHVA FRADDIVAAI AEFAADGVPF LRIPRNYYDD LAARYALPDE TIDTLRRHHL LYDRDDAGGE FLHAYTELVD GRFSFEIVER RGGYDGYGAA NAAVRLAAQA QRRG
|
| |