Gene BURPS1710b_A1894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A1894 
Symbol 
ID3694383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp2309219 
End bp2311273 
Gene Length2055 bp 
Protein Length684 aa 
Translation table11 
GC content71% 
IMG OID637732148 
Productputative amino acid dioxygenase 
Protein accessionYP_337051 
Protein GI76818027 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG1082] Sugar phosphate isomerases/epimerases
[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCGTT CGATCGCCAC CGTCTCGCTA TCGGGCACGC TCGTCGAGAA GCTCCGCGCG 
ATTCGCGCCG CCGGCTTCGA CGGCGTCGAG ATCTTCGAGA ACGACTTGCT CTACTTCGAC
GGCTCGCCCG CCGACGTGCG CGCCATCGCC GCCGATCTCG GCCTCGCCAT CGTGCTGTTC
CAGCCGTTTC GCGATTTCGA GGGCGTGCCG CCCGAGCGCC TCGCGCGCAA TCTCGAGCGC
GCGAAGCGCA AGTTCGAGCT GATGCACGCG CTCGGCGCGA ACCGCATGCT CGTGTGCAGC
AACATGTCGC CGGACGCGAT CGGCGACGAC GCACTGCTCA TCGACCAGTT GGGCGCGCTC
GCGCGCGCCG CGCAGGCGGC GGGCGTCGTC GCCGCGTACG AGGCGCTGGC ATGGGGTCGG
AACGTGAAGA CTTATGGCCA TGCGTGGCGG CTCGTCGACG CGGTGAATCA TCCGAGCCTC
GGGCTTGCGC TCGACAGCTT CCATACGCTG TCGCTCGGCG ATTCGCCGGA CGGCATCGCA
CGCATTCCCG GCGAGCGGAT CGCGTTCGTG CAGATCGCCG ACGCGCCGAA GCTCGCGATG
GACGTGCTCG AATGGAGCCG GCATTACCGG TCGTTTCCGG GCCAGGGCGA TTTCGACCTC
GCGGGATTCA CCGCGCGCGT GATCGAATCG GGCTACGCCG GGCCGCTGTC GCTCGAGATC
TTCAACGACG GCTTTCGCGC CGCGCCGACC GCGCTGACGG CCGCGGACGG CTACCGGTCG
CTGCTGTATC TCGAGGAGAC CACGCGCGAG CGGCTCGCGT GCGACGCGCG GCGCGCACGT
CGGGCGGGCG GCGCGCCGGG AGCGGGCGAA ACGCGCGGCG AGCGCGAAAG ACACGACGCG
CACGGCGAAC GTGGCGAACA CCGCGCGCAC GATAAGGACG ACAAGCCAGA CGACAAGCGC
GGCGGCCCCG ACGAACGCGA AAGCCGCGCG GCGCACCCGC GCCCCGCGCA GCCGCTCTTC
GCGCCGCCGC CCGCGCCCGC GCACGTCGGC TTTCAGTTCA TCGAATTCGC GGTCGACGCG
GCCGCCGCCG AGAACGTCGC CGGCTGGCTC GGCAAGCTGC GCTTTCGGCG CGCGGGCCGT
CACCGCTCGA AGGACGTGAC GCTGTATCAG CACGGCGCGG CGTCGATCGT GTTGAACGCC
GAGCGCGATT CGTTCGCCGA CGCGTTCTTT CAGGAGCACG GCCTGTCGCT GTGCGCGTCG
GCGTTTCGCG TCGACGATGC GCGTCTCGCG TTCGAGCGCG CGGCGGGCTA CGGCTACGCG
CCGTTCTCGG GCCGCGTCGG CCCGAACGAG CGCGTGCTGC CGAGCGTGCG CGCGCCCGAC
GGCAGCCTGA ACTACTTCGT CGACGAGGCG CCCGGCGCGC CGACGCTGTA CGAATCGGAT
TTCGTGCTGA CCGACATCGA CGGGCCAAGC GAAGTCGGCC CGCTCGCCGG CATCGATCAC
GTGTGCCTCG CGCTGCCCGC CGACGCGCTC GATACGTGGG TGCTGTTCTT CAAGACCGCG
TTCGGCTTCG AGGCCGAGCG CAACTGGCTC GTGCCGGACC CGTACGGGCT CGTGCGCAGC
CGCGCGGTGC GCAGCCCGGA CGGCTCGGTG CGCATCGCGC TCAATGCGTC GGTGGACCGG
CATACGGCCG TCGTCCGGTC GCTCGAGCGC TATCGCGGCA CGGGGCTCAA TCATGTCGCG
TTCCGCGCGG ACGACATCGT CGCGGCGATC GCCGAATTCG CCGCGGACGG CGTGCCGTTC
CTGCGGATTC CGCGCAATTA CTACGACGAT CTCGCCACAC GCTACGCGCT GCCCGACGAG
ACGATCGACA CGCTGCGCCG CCATCACCTG CTGTACGACC GCGACGACGC GGGCGGCGAA
TTCCTGCATG CGTACACCGA GCTCGTCGAC GGCCGCTTCT CGTTCGAGAT CGTCGAGCGG
CGCGGCGGCT ACGACGGATA CGGCGCGGCG AACGCAGCCG TGCGGCTCGC CGCGCAGGCG
CAGCGCAGGG GGTAA
 
Protein sequence
MQRSIATVSL SGTLVEKLRA IRAAGFDGVE IFENDLLYFD GSPADVRAIA ADLGLAIVLF 
QPFRDFEGVP PERLARNLER AKRKFELMHA LGANRMLVCS NMSPDAIGDD ALLIDQLGAL
ARAAQAAGVV AAYEALAWGR NVKTYGHAWR LVDAVNHPSL GLALDSFHTL SLGDSPDGIA
RIPGERIAFV QIADAPKLAM DVLEWSRHYR SFPGQGDFDL AGFTARVIES GYAGPLSLEI
FNDGFRAAPT ALTAADGYRS LLYLEETTRE RLACDARRAR RAGGAPGAGE TRGERERHDA
HGERGEHRAH DKDDKPDDKR GGPDERESRA AHPRPAQPLF APPPAPAHVG FQFIEFAVDA
AAAENVAGWL GKLRFRRAGR HRSKDVTLYQ HGAASIVLNA ERDSFADAFF QEHGLSLCAS
AFRVDDARLA FERAAGYGYA PFSGRVGPNE RVLPSVRAPD GSLNYFVDEA PGAPTLYESD
FVLTDIDGPS EVGPLAGIDH VCLALPADAL DTWVLFFKTA FGFEAERNWL VPDPYGLVRS
RAVRSPDGSV RIALNASVDR HTAVVRSLER YRGTGLNHVA FRADDIVAAI AEFAADGVPF
LRIPRNYYDD LATRYALPDE TIDTLRRHHL LYDRDDAGGE FLHAYTELVD GRFSFEIVER
RGGYDGYGAA NAAVRLAAQA QRRG