Gene BURPS1106A_2375 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_2375 
Symbol 
ID4901099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp2347068 
End bp2348177 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content68% 
IMG OID640135604 
Producthistone deacetylase family protein putative 
Protein accessionYP_001066637 
Protein GI126454301 
COG category[B] Chromatin structure and dynamics
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAAAA CCGCTTTCTT CACCGACGAA CGCACTTTCT GGCACACGGG CGGCGCGCAT 
GCGCTGTTCT TTCCGGTCGG CGGCTGGGTG CAGCCGCCGT CGAGCGCGGG CTATGCGGAA
TCGCCCGATT CGAAGCGGCG CCTGCTGTCG CTCGTGCACG CGTCCGGGCT CGCGGCGAAA
CTCGACATGT CGAGCGCGCC CGCCGCGACC GACGACGATC TGCGGCGCAT CCACCCCGCG
CACTACCTCG ACGCGTTCAA GCGCGCGAGC GACGCGGGTG GCGGCGATCT CGGCGAACTC
GCGCCGTTCG GCCGTGGCAG CTACGAGATC GCGGCGCTAT CCGCGGGGCT CGCGCTCGCC
GCCGTCGACG CGGTGCTCGC CGAGCGCACG GCCAACGCGT TCTCGCTGTC GCGCCCGCCC
GGCCATCACT GCCTGCGTGA CAAGCCGATG GGTTTTTGCC TGCTCGCGAA CATTCCGATC
GCGATCGAGG CCGCGCGCGC GAAACATCGC GTCGAGCGCG TCGCGGTGAT CGACTGGGAC
GTGCATCACG GCAACGGCAC GCAGTCGATC TACTACGACG ATCCGAACAC GCTGACGATC
TCGCTGCATC AGGACCGCTG CTTTCCGCCC GGCTACAGCG GCGCCGACGA ACGCGGCGCG
GGCGCGGGTG CGGGCTCGAA CGTCAACGTC CCGCTCCTCG CGGGCGCCGG CGACGACGCG
TATCGATACG CATTCGAGCG AATCGTGCTG CCCGCGCTCG ATGCGTTCCG GCCGGAGCTC
GTCATCGTCG CGAGCGGGCT CGACGCGAAT GCGGTCGACC CGCTCGCGCG GATGCAACTG
CACAGCGACA GCTACCGGTA CATGACGCAT GCGCTGAAGC AGGCCGCGCA GCGGCACTGC
GGGGGACGGC TCGTCATCGT GCACGAGGGC GGTTATTCGG AGGCCTACGT ACCGTTTTGC
GGGCATGCGA TCGTCGAGGC ACTGGCGGGC ATGCGCACCG ACGTCGCCGA TCCGATGCTC
GAGCTCGCGA TCGCGCAACA GCCCGGCGAG CGTTTCAACG CATTCCAGCG GCAACTGATC
GACGAAATGG CGACGAGCTT CGGTTACTGA
 
Protein sequence
MTKTAFFTDE RTFWHTGGAH ALFFPVGGWV QPPSSAGYAE SPDSKRRLLS LVHASGLAAK 
LDMSSAPAAT DDDLRRIHPA HYLDAFKRAS DAGGGDLGEL APFGRGSYEI AALSAGLALA
AVDAVLAERT ANAFSLSRPP GHHCLRDKPM GFCLLANIPI AIEAARAKHR VERVAVIDWD
VHHGNGTQSI YYDDPNTLTI SLHQDRCFPP GYSGADERGA GAGAGSNVNV PLLAGAGDDA
YRYAFERIVL PALDAFRPEL VIVASGLDAN AVDPLARMQL HSDSYRYMTH ALKQAAQRHC
GGRLVIVHEG GYSEAYVPFC GHAIVEALAG MRTDVADPML ELAIAQQPGE RFNAFQRQLI
DEMATSFGY