Gene BURPS1710b_A0962 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A0962 
SymboldehII 
ID3694456 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp1223109 
End bp1224059 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content72% 
IMG OID637731216 
Producthaloacid dehalogenase, type II 
Protein accessionYP_336120 
Protein GI76818958 
COG category[R] General function prediction only 
COG ID[COG1011] Predicted hydrolase (HAD superfamily) 
TIGRFAM ID[TIGR01428] 2-haloalkanoic acid dehalogenase, type II
[TIGR01493] Haloacid dehalogenase superfamily, subfamily IA, variant 2 with 3rd motif like haloacid dehalogenase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGGACG GCAGAGGTGC GTTTCTTTCG GACGTGCAGG CGCGGCGGTT CGCGCGTGGC 
GGATCGTACG GGGCGGTCCT GCGTGCGTCG CCCAGGCGCG CGCCGCCGGC CGGCGCGCGG
CTCGCCGCGG CGTTCGGCCG GGCGCGCGCG TTCGGCGTCG GGCCGGTGCG AGCCGCCGAC
GGCGGCCGCT GCATCCTGAA ACGTCATCCC ACGACTCAGG AGAACATCAT GCAGACGCTT
GGCGTGAAGG CATTGGTATT CGACGTGTTC GGCACCGTGG TCGACTGGCG TTCCGGCGTC
ATTCGCGACG CGACGCCGTT CCTCGCGAAG TACGGCGGCG CGGGAGCCGA TCCGGCCGCG
TTCGCGGATG CGTGGCGCGC GGGCTATTCG CCCGCGATGG AGGAGGTGCG CAGCGGCCGC
CGGCCGTTCA CGCGGCTCGA CGTGCTGCAC CGGGAGAATC TCGACGCGCT GCTGCCCGCG
TTCGGCATCG ATCGCGCGAG CGTGGCCGAC GCCGATCTCG ACGCGCTGAA CCTCGCATGG
CACCGGCTCG ATCCGTGGCC CGATTCGGTC GCGGGGCTCA CGCGGCTGAA GGCGCATTAC
ATCATCGCGC CGCTGTCGAA CGGCAACGTG ATCCTGATGA TCGACATGGC CAAGCGCGCG
GGGCTGCCGT GGGACGCGAT CCTCGGCGCC GAAGTGGCGC AGGCGTACAA GCCGACGCCC
GAAGCGTACC TGCGCACGGC CGATATCCTC GCGCTGCGTC CGGATGAGGT GTGCCTCGTC
GCCGCGCACA ACGGCGACCT CGCGGCCGCG CGGCGCTGCG GCTATCGCAC CGCGTTCGTC
GCGCGAGCGC GCGAGCATGG TCCCGCGCAG ACCACCGATC TGCGCGCGGA GCAGGATTGG
GACGTCGTCG CGGCCGATTT CATCGAGCTC GCGCAGCGCT TCGGCGCGTG A
 
Protein sequence
MSDGRGAFLS DVQARRFARG GSYGAVLRAS PRRAPPAGAR LAAAFGRARA FGVGPVRAAD 
GGRCILKRHP TTQENIMQTL GVKALVFDVF GTVVDWRSGV IRDATPFLAK YGGAGADPAA
FADAWRAGYS PAMEEVRSGR RPFTRLDVLH RENLDALLPA FGIDRASVAD ADLDALNLAW
HRLDPWPDSV AGLTRLKAHY IIAPLSNGNV ILMIDMAKRA GLPWDAILGA EVAQAYKPTP
EAYLRTADIL ALRPDEVCLV AAHNGDLAAA RRCGYRTAFV ARAREHGPAQ TTDLRAEQDW
DVVAADFIEL AQRFGA