Gene BURPS1106A_2628 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_2628 
Symbolhsp33 
ID4902571 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp2589108 
End bp2590058 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content67% 
IMG OID640135855 
Productchaperonin, 33 kDa 
Protein accessionYP_001066881 
Protein GI126454324 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1281] Disulfide bond chaperones of the HSP33 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.202437 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCGACC AGTTACAGAA ATTCATGTTC AACGCAGCCC CGGTGCGCGG CGAGATCGTC 
TCGCTGCGCA GCACGTGGCA AGAGGTGCTC ACGCGCCGCG ACTACCCGAC GCCCGTGCGC
AACGTGCTCG GCGAGATGAT GGCGGCGTGC GCGCTGCTGT CGGCGAACCT GAAGTTCGAC
GGCACGCTCA TCATGCAGAT CTTCGGCGAC GGGCCGGTGA AGATGCTCGT CGTCCAGTGC
AGCTCGGATC TCGCGATGCG CGCGACCGCG AAATTCTCGG GCGACGCCGC GCGAACCGTC
GGCGACGGCA CTTCGTTCGC CGAACTGATC AATGCGAGCG GCCACGGCCG TTGCGTGATC
ACGCTCGATC CGGCCGACAA GCGTCCCGGC CAGCAGCCCT ATCAGGGCAT CGTGCCGCTG
AACGGCGAAG ACGGCCCGCT CGCGTCGATC GCCGACGTGC TCGAGCACTA CATGCGCCAT
TCCGAGCAGC TCGACACGCG CCTCTGGCTC GCCGCCGACC ACGATCGCGC GGTCGGCGTG
CTGCTGCAGA AGCTGCCGGG CGACGGCGGC ATCGTGCCGC GCGTCGAGCA AACCGATACG
GATACATGGG AGCGCGTGTG CACGCTCGGC GGCACGCTGT CGTCGAAAGA GCTGCTCGAA
GTGGAACCCG AGACCGTGTT TCGGCGTCTG TTCTGGCAGG AGAATGTGCA GCACTTCGAA
CCGACGTCCA CGCGCTTCCA GTGCACGTGC TCGCGCGAGA AAGTCGGCGG GATGCTGCGC
ATGCTCGGGC GCGTCGAGAT CGACGGCGTG ATCGAAGAGC GCGGCCACGT CGAGATCCAC
TGCGAATTCT GCAATCAGCG CTACGAATTC GATCCGGTCG ACGTCGCCCA GCTGTTCTCG
ACGCCCGAGC TCGGCACCGG GGTCGCGCCC GCCGCCGCGC AACGGCACTG A
 
Protein sequence
MSDQLQKFMF NAAPVRGEIV SLRSTWQEVL TRRDYPTPVR NVLGEMMAAC ALLSANLKFD 
GTLIMQIFGD GPVKMLVVQC SSDLAMRATA KFSGDAARTV GDGTSFAELI NASGHGRCVI
TLDPADKRPG QQPYQGIVPL NGEDGPLASI ADVLEHYMRH SEQLDTRLWL AADHDRAVGV
LLQKLPGDGG IVPRVEQTDT DTWERVCTLG GTLSSKELLE VEPETVFRRL FWQENVQHFE
PTSTRFQCTC SREKVGGMLR MLGRVEIDGV IEERGHVEIH CEFCNQRYEF DPVDVAQLFS
TPELGTGVAP AAAQRH