Gene Rsph17029_1054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1054 
Symbol 
ID4895117 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1085423 
End bp1086532 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content75% 
IMG OID640111641 
Producthistone deacetylase superfamily protein 
Protein accessionYP_001042937 
Protein GI126461823 
COG category[B] Chromatin structure and dynamics
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.836355 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.241062 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCTTTC CGGCCGCGCC TCTCTACATC GGCTCCGAGA TCTATCGCCG GTCGAGCTAC 
GGTGGCAGGC ATCCGCTGCG CGTGCCGCGG GTCTCGACCG TGACGGATCT CGGCCGCGCG
CTCGGCTGGC TGCCGGCTGC CAGCTTCCGC ACCGCGCCCC GGGCCAAGCC CGCCGCGCTC
ACCCTCTGGC ACGATCCGGC CTATGTGGCG GCGCTGGCGG CGGCCGAGGC GGGGCTTGCC
GCGCCCGAAC TGCTGGCCGC CCACGGGCTC GGCACGCTCT CGAACCCGGT CTTTCCCGAG
ATGTTCCGCC GGCCCGCGAC CGGCGCGGGG GGCGTCATGC TCGCAGCCGA GCTGCTGCGC
GGCGGCGGCG CGATCCATGT GCCGGGCGGA GGCACGCACC ACGGGATGCG CGACCGGGCG
AACGGCTTCT GCTATCTGAA CGATCCGGTG CTGGGGATGC TGGTCCTGCG CCGGATGGGG
CTCCGGCGCA TCGCCTATGT CGATATCGAC GCCCATCATC CCGACGGGGT GGAGGCGGCC
TTCGCGGGCG ATCCCGAGGC GCTCCTGATC TCGGTGCACG AGGAGGGGCG CTGGCCCTTC
ACCGGTGCGC TCGAGGACGA GGGCGGCGGC ACCTGCTTCA ACCTGCCGGT GCCGCGCGGG
TTCAACGACA CCGAGATGCG GGCGGTGCTC GAGGGGCTGA TCCTGCCGCG GCTCGCGGCC
TTCCGCCCCG AGGCTCTGGT GCTGCAATGC GGGGCCGATG CGCTGGAGGA GGATCCGCTG
TCGCGGCTCT CCCTCTCGAA CAACGTTCAT TTCGAGGTGG TGGCGGCGCT GCGTCCGGTG
GCGCCGCGCT TTCTCGTGCT CGGCGGCGGC GGCTACAACC CGTGGTCGGT CGGTCGCTGC
TGGGCGGGCG TCTGGGCCAC GCTGGCGGGC CACGAGATCC CCGACCGTCT GCCGGAGGCG
GCGCGCGCGG TGCTGGCGGG CCTCGACTGG GGCGGGGGCG GGCGTCCTCC GCCGGCGCCC
GCGCTGATCG AGACGCTGCG CGACGCGCCG CGGGAGGGGC CGGTGCGGCC GGAGATCCGG
GAGAGGCTCG CCCGCCTCGC GCGGCGATGA
 
Protein sequence
MAFPAAPLYI GSEIYRRSSY GGRHPLRVPR VSTVTDLGRA LGWLPAASFR TAPRAKPAAL 
TLWHDPAYVA ALAAAEAGLA APELLAAHGL GTLSNPVFPE MFRRPATGAG GVMLAAELLR
GGGAIHVPGG GTHHGMRDRA NGFCYLNDPV LGMLVLRRMG LRRIAYVDID AHHPDGVEAA
FAGDPEALLI SVHEEGRWPF TGALEDEGGG TCFNLPVPRG FNDTEMRAVL EGLILPRLAA
FRPEALVLQC GADALEEDPL SRLSLSNNVH FEVVAALRPV APRFLVLGGG GYNPWSVGRC
WAGVWATLAG HEIPDRLPEA ARAVLAGLDW GGGGRPPPAP ALIETLRDAP REGPVRPEIR
ERLARLARR