Gene BURPS1106A_A2536 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2536 
SymboldehII 
ID4904916 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2493464 
End bp2494498 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content71% 
IMG OID640145639 
Producthaloacid dehalogenase, type II 
Protein accessionYP_001076566 
Protein GI126456413 
COG category[R] General function prediction only 
COG ID[COG1011] Predicted hydrolase (HAD superfamily) 
TIGRFAM ID[TIGR01428] 2-haloalkanoic acid dehalogenase, type II
[TIGR01493] Haloacid dehalogenase superfamily, subfamily IA, variant 2 with 3rd motif like haloacid dehalogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.762748 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTGCTTGC GTGCCCGTGC GCTCGTGCGC GATGAAGCCG GCCTCTATTT CTCGCCGGGC 
ATGCAGGCAG GCGGTTCGCC GATAATGTCG GACGGCAGAG GTGCGTTTCT TTCGGACGTG
CAGGCGCGGC GGTCCGCGCG TGGCGGATCG TACGGGGCGG TCCTGCGTGC GTCGCCCAGG
CGCGCGCCGC CGGCCGGCGC GCGGCTCGCC GCGGCGTTCG GCCGGGCGCG CGCGTTCGGC
GTCGGGCCGG TGCGAGCCGC CGACGGCGGC CGCTGCATCC TGAAACGTCA TCCCACGACT
CAGGAGAACA TCATGCAGAC GCTTGGCGTG AAGGCATTGG TATTCGACGT GTTCGGCACC
GTGGTCGACT GGCGTTCCGG CGTCATTCGC GACGCGACGC CGTTCCTCGC GAAGTACGGC
GGCGCGGGAG CCGATCCGGC CGCGTTCGCG GATGCGTGGC GCGCGGGCTA TTCGCCCGCG
ATGGAGGAGG TGCGCAGCGG CCGCCGGCCG TTCACGCGGC TCGACGTGCT GCACCGGGAG
AATCTCGACG CGCTGCTGCC CGCGTTCGGC ATCGATCGCG CGAGCGTGGC CGACGCCGAT
CTCGACGCGC TGAACCTCGC ATGGCACCGG CTCGATCCGT GGCCCGATTC GGTCGCGGGG
CTCACGCGGC TGAAGGCGCA TTACATCATC GCGCCGCTGT CGAACGGCAA CGTGATCCTG
ATGATCGACA TGGCCAAGCG CGCGGGGCTG CCGTGGGACG CGATCCTCGG CGCCGAAGTG
GCGCAGGCGT ACAAGCCGAC GCCCGAAGCG TACCTGCGCA CGGCCGATAT CCTCGCGCTG
CGTCCGGATG AGGTGTGCCT CGTCGCCGCG CACAACGGCG ACCTCGCGGC CGCGCGGCGC
TGCGGCTATC GCACCGCGTT CGTCGCGCGA GCGCGCGAGC ATGGTCCCGC GCAGACCACC
GATCTGCGCG CGGAGCAGGA TTGGGACGTC GTCGCGGCCG ATTTCATCGA GCTCGCGCAG
CGCTTCGGCG CGTGA
 
Protein sequence
MCLRARALVR DEAGLYFSPG MQAGGSPIMS DGRGAFLSDV QARRSARGGS YGAVLRASPR 
RAPPAGARLA AAFGRARAFG VGPVRAADGG RCILKRHPTT QENIMQTLGV KALVFDVFGT
VVDWRSGVIR DATPFLAKYG GAGADPAAFA DAWRAGYSPA MEEVRSGRRP FTRLDVLHRE
NLDALLPAFG IDRASVADAD LDALNLAWHR LDPWPDSVAG LTRLKAHYII APLSNGNVIL
MIDMAKRAGL PWDAILGAEV AQAYKPTPEA YLRTADILAL RPDEVCLVAA HNGDLAAARR
CGYRTAFVAR AREHGPAQTT DLRAEQDWDV VAADFIELAQ RFGA