Gene RPB_1654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1654 
Symbol 
ID3909931 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1883709 
End bp1884734 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content67% 
IMG OID637883548 
Producthistone deacetylase superfamily protein 
Protein accessionYP_485273 
Protein GI86748777 
COG category[B] Chromatin structure and dynamics
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGGCCG TCTACAGCGA ACTGCATCGA AGCCACGACC CGCAATTCTT TCTGGTTCGC 
GGGATCGTCC AGCGCACTAC CGAACAACCC GAACGTGCCG ACCGGCTGCT GGCGGGACTG
AAGGCCGGCG GTCATTCGCT CGTCGAGCCG ACGGCGTTCG GACAGGGCCC GCGCGCCAGG
GTGCATAGTC CGGAGTATCT CGGCTTTCTC GCCGAGGCCT GGGACGCCTG GGCGGCGCTG
GGCAATTCCG GCCCCGAGAT GATCGCCAAC ATCCATCCCG TCCGCAACGA GGCGACGTAT
CCGACGCACA TCGTCGGCCG CCTCGGCTGG CACACGATCG ATACGTCCTG CCCGATCGGA
CCCGGCACCT GGGCCGCGGT CTGCGCCGCG ACCGATGTCG CGACCTCGGC AGCCCAACTC
GTGATGGACG GGGAAGACGC CGCCTACGCG CTGTGTCGTC CGCCGGGGCA CCACGCCTAT
CGCGATCTCG CCAGCGGCTT CTGCTTTCTC AACAACAGCG CGATCGCGGC CGCCCATCTG
CGGCTGAAGC ACGAGCGCGT CGCGATCCTC GACGTCGACG TTCATCATGG CAACGGCACG
CAGGGCATCT TCTACGAGCG GCCCGACGTG CTCACCGTTT CGATTCACGC CGACCCGACC
TTCTTCAACC CCTTTGTCTG GGGCTACGCG CACGAACGCG GCGCGGGTCC GGGGCTCGGC
GCCAATCTGA ACATCCCGCT GGCGAAGGGC ACCGACGATG ACGGCTATAT CGAGGCGCTC
GGTGTCGCGG AAAAGACGAT TCGCGCTTTT GCGCCCGGCG CTCTGGTCGT CGCGCTCGGC
CTCGATGCAT CCGAGCATGA CCCGCTTGCG GGGCTGGCCG TCACGACCGA TGGATTCCAT
CGCATCGGTG GCGCCATCGC GCGGTTGGGG CTGCCCACCG TGTTCGTTCA GGAAGGCGGA
TATCTGTCGG AGATTCTCGG ACCCAACCTG ACGTCGGCGC TCGCCGGCTT CGAGCAGGTT
CGCTAG
 
Protein sequence
MKAVYSELHR SHDPQFFLVR GIVQRTTEQP ERADRLLAGL KAGGHSLVEP TAFGQGPRAR 
VHSPEYLGFL AEAWDAWAAL GNSGPEMIAN IHPVRNEATY PTHIVGRLGW HTIDTSCPIG
PGTWAAVCAA TDVATSAAQL VMDGEDAAYA LCRPPGHHAY RDLASGFCFL NNSAIAAAHL
RLKHERVAIL DVDVHHGNGT QGIFYERPDV LTVSIHADPT FFNPFVWGYA HERGAGPGLG
ANLNIPLAKG TDDDGYIEAL GVAEKTIRAF APGALVVALG LDASEHDPLA GLAVTTDGFH
RIGGAIARLG LPTVFVQEGG YLSEILGPNL TSALAGFEQV R