Gene BURPS668_1148 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_1148 
Symbol 
ID4882040 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp1125940 
End bp1126965 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content69% 
IMG OID640127076 
Productdeacetylases 
Protein accessionYP_001058197 
Protein GI126438867 
COG category[B] Chromatin structure and dynamics
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGACGT ACTTCCACCC CGATCAATCA CTGCATCATC CGCGCACGTA CTTCTCGCGC 
GGCCGGATGC GCATGCCGCA GGAGGTGCCC GAGCGCGCGG CGCGGCTCGT CGCGGCGGCG
TTCGCGATGG GTTTTCCGGT GCGCGAGCCG GACGATTTCG GCATCGCGCC GATCGCGGCC
GTGCACGACA CGCACTACCT GCGCTTTCTC GAGACCGTGC ATCGCGAATG GAAGGCGATG
CCGGAGGACT GGGGCGACGA AGCGATGTCG AATATTTTCG TGCGCGAGCC GAACGCGTTG
CGCGGCGTGC TCGCACAGGC CGCCCGTCAT CTCGCGGACG GCAGTTGCCC GGTCGGCGAG
CACACGTGGC GCGCGGCGTA CTGGTCCGCG CAGAGCGCGC TCGCGGCGGC GGCGGCGGTG
CGCGACGGCG CGCCCGCAGC GTATGCGCTG TGCCGGCCGC CGGGCCATCA TGCGCGCGTC
GACGCCGCGG GCGGCTTCTG TTATCTGAAC AACGCGGCGA TCGCCGCGCA GGCGCTGCGC
GCGCACCATG CGCGCGTCGC CGTCCTCGAC ACCGACATGC ATCACGGGCA AGGCATACAG
GAAATCTTCT ACGCGCGGCG CGACGTGCTG TACGTATCGA TTCACGGCGA TCCGACGAAC
TTCTACCCGG CCGTCGCGGG CTTCGACGAC GAGCGCGGCG CGGGCGAAGG CCTCGGCTAC
AACGTGAATC TGCCGATGCC GCACGGCTCG AGCGAAGCGG CGTTCTTCGA GCGCGTCGAC
GATGCGCTGC GCGAGTTGCG GCGCTTCGCG CCCGATGCGC TCGTGCTGTC GCTTGGGTTC
GACGTCTATC GCGACGACCC GCAATCGCAG GTGGCGGTGA CGACGGACGG TTTCGGTCGG
TTGGGACACC TGATCGGCGC GCTGCGGCTG CCGACCGTCA TCGTGCAGGA AGGCGGCTAT
CACATCGAGA GCCTCGAGGC GAATGCGCGG TCGTTCTTCG GCGGATTCGG CGCGCTGCGC
GGTTGA
 
Protein sequence
MLTYFHPDQS LHHPRTYFSR GRMRMPQEVP ERAARLVAAA FAMGFPVREP DDFGIAPIAA 
VHDTHYLRFL ETVHREWKAM PEDWGDEAMS NIFVREPNAL RGVLAQAARH LADGSCPVGE
HTWRAAYWSA QSALAAAAAV RDGAPAAYAL CRPPGHHARV DAAGGFCYLN NAAIAAQALR
AHHARVAVLD TDMHHGQGIQ EIFYARRDVL YVSIHGDPTN FYPAVAGFDD ERGAGEGLGY
NVNLPMPHGS SEAAFFERVD DALRELRRFA PDALVLSLGF DVYRDDPQSQ VAVTTDGFGR
LGHLIGALRL PTVIVQEGGY HIESLEANAR SFFGGFGALR G