Gene BURPS668_2039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_2039 
Symbol 
ID4881960 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp2031957 
End bp2033045 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content67% 
IMG OID640127967 
ProductPHB depolymerase family esterase 
Protein accessionYP_001059074 
Protein GI126441643 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3509] Poly(3-hydroxybutyrate) depolymerase 
TIGRFAM ID[TIGR01840] esterase, PHB depolymerase family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.249198 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCAGAA AAAAGTCCAG CATCTGGCTA CCGTACTTCG ACCTGCTCTC CCTGACGCTG 
CGCGCGCGCC CTCGGCAAAA GGTCGCCCAA CCCAAGCCGT CCGTCACGCG CCCCACAAGC
GCCGCGAAGA CTCGGAGCAC GAAACGCACG CGCGCCGCGC ATGCCGGGCG AACGTCGGCG
GACAACGCGC GCGGCACCTG GCTGCGCTCG TTTCACTCCG CCCGCCCCGC GGCGGGACGC
TTGATCAATC ATCTCGCGTA TGCGCTGTAT CTGCCCGCCG CACCGGCCGC GGCGGCAAGC
ATGCCCGCCG TCGTCATGTT GCATGGATGC AAACAGACCG CGGAGTCGTT CGCCACCGGC
ACGCGCATTT GCGATCTCGC GCAGCGGGCG GGGTTTGCCG TGTTGCTTCC CGAGCAGGCC
AAGACTTCGC ATTCTCACCG GTGCTGGAAC TGGCACGGCG ATTCATCGCA GTCCGAAGCG
CCGGCCGTCG CCTCGCTCGT CGACGCGATC GTTCGGCAGT ACGGTTTCGA CCGCGAGCGA
ATCTATCTGG CGGGCCTCTC CGCGGGAGCC GGCCTGGCGG CGGGACTCGC GATGCGCTAT
CCCGAGCTCT TCGCGGCCGT CGGCCTGCAC TCCGGCCCGG TCTTCGGCGC GCCCTCGTCC
ACCCTCGCGG CGATGAGCCT GATGCGCGGC GGCAGCCGGG AAGATCCGCT ACGCGTCATC
GAAAACTGCG TCGACGTTTC GGATCATCCC GGCATGCCCG CACTCATCGT CCACGGTGAA
CACGATACGG TGGTGGCGAA GCAGAACGCG ATGCAACTGG GTCTCGAGTT CGCGCGAATC
AATCGGCTCA TCGACGGGCA GGGCACACTG CGCGTGGGCG AGCAACACGT CTACAGCCGC
AAGGGCGTCG ACTATACCGA CTATCTCAAG TCCGGGCGGC TCGTCGTCAG GGTGTGCATC
GTTCACGGGC TGCGGCACGC ATGGAGTGGC GGCGATCCGC GCGAAGCATT CCATTCCGCC
ACCGGGCCGG ATGCCACCGC GATGTTCTGG CATTTCTTTC GGCCGCGGCG TCGCAAGCGG
GCACAGTGA
 
Protein sequence
MRRKKSSIWL PYFDLLSLTL RARPRQKVAQ PKPSVTRPTS AAKTRSTKRT RAAHAGRTSA 
DNARGTWLRS FHSARPAAGR LINHLAYALY LPAAPAAAAS MPAVVMLHGC KQTAESFATG
TRICDLAQRA GFAVLLPEQA KTSHSHRCWN WHGDSSQSEA PAVASLVDAI VRQYGFDRER
IYLAGLSAGA GLAAGLAMRY PELFAAVGLH SGPVFGAPSS TLAAMSLMRG GSREDPLRVI
ENCVDVSDHP GMPALIVHGE HDTVVAKQNA MQLGLEFARI NRLIDGQGTL RVGEQHVYSR
KGVDYTDYLK SGRLVVRVCI VHGLRHAWSG GDPREAFHSA TGPDATAMFW HFFRPRRRKR
AQ