Gene BURPS1106A_A0481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A0481 
SymbolhppD 
ID4905254 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp470449 
End bp472503 
Gene Length2055 bp 
Protein Length684 aa 
Translation table11 
GC content71% 
IMG OID640143587 
ProductAP endonuclease 
Protein accessionYP_001074523 
Protein GI126457644 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG1082] Sugar phosphate isomerases/epimerases
[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCGTT CGATCGCCAC CGTCTCGCTA TCGGGCACGC TCGTCGAGAA GCTCCGCGCG 
ATTCGCGCCG CCGGCTTCGA CGGCGTCGAG ATCTTCGAGA ACGACTTGCT CTACTTCGAC
GGCTCGCCCG CCGACGTGCG CGCCATCGCC GCCGATCTCG GCCTCGCCAT CGTGCTGTTC
CAGCCGTTTC GCGATTTCGA GGGCGTGCCG CCCGAGCGCC TCGCGCGCAA TCTCGAGCGC
GCGAAGCGCA AGTTCGAGCT GATGCACGCG CTCGGCGCGA ACCGCATGCT CGTGTGCAGC
AACATGTCGC CGGACGCGAT CGGCGACGAC GCACTGCTCA TCGACCAGTT GGGCGCGCTC
GCGCGCGCCG CGCAGGCGGC GGGCGTCGTC GCCGCGTACG AGGCGCTGGC ATGGGGTCGG
AACGTGAAGA CTTATGGCCA TGCGTGGCGG CTCGTCGACG CGGTGAACCA TCCGAGCCTC
GGGCTTGCGC TCGACAGCTT CCATACGCTG TCGCTCGGCG ATTCGCCGGA CGGCATCGCG
CGCATTCCCG GCGAGCGGAT CGCGTTCGTG CAGATCGCCG ACGCGCCGAA GCTCGCGATG
GACGTGCTCG AATGGAGCCG GCATTACCGG TCGTTTCCGG GCCAGGGCGA TTTCGACCTC
GCGGGATTCA CCGCGCGCGT GATCGAATCG GGCTACGCCG GGCCGCTGTC GCTCGAGATC
TTCAACGACG GCTTTCGCGC CGCGCCGACC GCGCTGACGG CCGCGGACGG CTACCGGTCG
CTGCTGTATC TCGAGGAGAC CACGCGCGAG CGGCTCGCGT GCGACGCGCG GCGCGCACGT
CGGGCGGGCG GCGCGCCGGG AGCGGGCGAA ACGCGCGGCG AGCGCGAAAG ACACGACGCG
CACGGCGAAC GTGGCGAACA CCGCGCGCAC GATAAAGACG ACAAGCCAGA CGACAAGCGC
GGCGGCCCCG ACGAACGCGA AAGCCGCGCG GCGCACCCGC GCCCCGCGCA GCCGCTCTTC
GCGCCGCCGC CCGCGCCCGC GCACGTCGGC TTTCAGTTCA TCGAATTCGC GGTCGACGCG
GCCGCCGCCG AGAACGTCGC CGGCTGGCTC GGCAAGCTGC GCTTTCGGCG CGCGGGCCGT
CACCGCTCGA AGGACGTGAC GCTGTATCAG CACGGCGCGG CGTCGATCGT GTTGAACGCC
GAGCGCGATT CGTTCGCCGA CGCGTTCTTT CAGGAGCACG GCCTATCGCT GTGCGCGTCG
GCGTTTCGCG TCGACGATGC GCGTCTCGCG TTCGAGCGCG CGGCGGGCTA CGGCTACGCG
CCGTTCTCGG GCCGCGTCGG CCCGAACGAG CGCGTGCTGC CGAGCGTGCG CGCGCCCGAC
GGCAGCCTGA ACTACTTCGT CGACGAGGCG CCCGGCGCGC CGACGCTGTA CGAATCGGAT
TTCGTGCTGA CCGACATCGA CGGGCCAAGC GAAGTCGGCC CGCTCGCCGG CATCGATCAC
GTGTGCCTCG CGCTGCCCGC CGACGCGCTC GATACGTGGG TGCTGTTCTT CAAGACCGCG
TTCGGCTTCG AGGCCGAGCG CAACTGGCTC GTGCCGGACC CGTACGGGCT CGTGCGCAGC
CGCGCGGTGC GCAGCCCGGA CGGCTCGGTG CGCATCGCGC TCAATGCGTC GGTGGACCGG
CATACGGCCG TCGTCCGGTC GCTCGAGCGC TATCGCGGCA CGGGGCTCAA TCATGTCGCG
TTCCGCGCGG ACGACATCGT CGCGGCGATC GCCGAATTCG CCGCCGACGG CGTGCCGTTC
CTGCGGATTC CGCGCAATTA CTACGACGAT CTCGCCGCAC GCTACGCGCT GCCCGACGAG
ACGATCGACA CGCTGCGCCG CCATCACCTG CTGTACGACC GCGACGACGC GGGCGGCGAA
TTCCTGCATG CGTACACCGA GCTCGTCGAC GGCCGCTTCT CGTTCGAGAT CGTCGAGCGG
CGCGGCGGCT ACGACGGATA CGGCGCGGCG AACGCGGCCG TGCGGCTCGC CGCGCAGGCG
CAGCGCAGGG GGTAA
 
Protein sequence
MQRSIATVSL SGTLVEKLRA IRAAGFDGVE IFENDLLYFD GSPADVRAIA ADLGLAIVLF 
QPFRDFEGVP PERLARNLER AKRKFELMHA LGANRMLVCS NMSPDAIGDD ALLIDQLGAL
ARAAQAAGVV AAYEALAWGR NVKTYGHAWR LVDAVNHPSL GLALDSFHTL SLGDSPDGIA
RIPGERIAFV QIADAPKLAM DVLEWSRHYR SFPGQGDFDL AGFTARVIES GYAGPLSLEI
FNDGFRAAPT ALTAADGYRS LLYLEETTRE RLACDARRAR RAGGAPGAGE TRGERERHDA
HGERGEHRAH DKDDKPDDKR GGPDERESRA AHPRPAQPLF APPPAPAHVG FQFIEFAVDA
AAAENVAGWL GKLRFRRAGR HRSKDVTLYQ HGAASIVLNA ERDSFADAFF QEHGLSLCAS
AFRVDDARLA FERAAGYGYA PFSGRVGPNE RVLPSVRAPD GSLNYFVDEA PGAPTLYESD
FVLTDIDGPS EVGPLAGIDH VCLALPADAL DTWVLFFKTA FGFEAERNWL VPDPYGLVRS
RAVRSPDGSV RIALNASVDR HTAVVRSLER YRGTGLNHVA FRADDIVAAI AEFAADGVPF
LRIPRNYYDD LAARYALPDE TIDTLRRHHL LYDRDDAGGE FLHAYTELVD GRFSFEIVER
RGGYDGYGAA NAAVRLAAQA QRRG