Gene RoseRS_3190 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3190 
Symbol 
ID5210161 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4014235 
End bp4015269 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content61% 
IMG OID640596782 
Producthistone deacetylase superfamily protein 
Protein accessionYP_001277501 
Protein GI148657296 
COG category[B] Chromatin structure and dynamics
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAACAG CCATTGCCAT CAACCCGCGC CACGCCGCCC ACGACGAGCC GCAGCACGTT 
GAGCAGGCGG CGCGTCTGCA TGCGATTACT GCTGCACTGA ACGCCAGTGG TCTACGTTCA
GTGCTGCTCG AAGTTCCGGC GCGTCCGGCG ACTGAGGCGC AACTGCGCGC TGTTCACACC
GAACAGATGA TTGAAGTGGT GCGCTGGTCG GCGACACGCC CGCGATCGTG GATCGATCAC
GACACATACA CTACATCGGC GAGTTGGGAT GCGGCGCTCA TGGCTGCTGG CACGACCCTG
GCGGTCGTTG ACGCGGTCGT CAGCGGGTCG GCGCAGAATG GGTTTGCGCT GGTTCGTCCA
CCCGGTCATC ACGCGACCCG AGCCGAATCC ATGGGGTTTT GCCTGTTCAA CAACGTTGCC
ATTGCTGCGC GCCACGCTAT CGACCATCTG GGAGTCACAC GGGTGGCAAT CGTTGATTTT
GATGTGCATC ACGGAAACGG AACGCAGGAT ATCTTCTACG ATGATGATCG GGTCTTCTTC
TGTTCGACGC ACGCTTCGCC GCTCTACCCG GGCACCGGCG CCGAACGTGA GATCGGTTCG
GGCAGAGGAC GCGGCACGAC GATGAATCTT CCGCTCCCTC ACGGCGTCGG CGACGCCGGG
TTTGCCCGCC TGTTCGATGA TGTCGTCATT CCGGCGCTGC GGCGTTATCG TCCAGACCTG
ATCCTGGTGT CCGCCGGTTA TGATGCCCAT TGGGCTGATC CACTTGGACC ATTGACGCTG
TCGGTCGCCG GGTATGCTGC ACTGACGCGC CGCCTGAAGG AAACGGCTGA AGAGGTCTGT
AACGGACGTA TCGCGCTGGT GCTCGAGGGC GGCTACAACC TGAAAGCGCT GGCGGCAAGT
GTTCTGGCAT GCCTGGAAGT TCTGGCAAAC GATGATACTG TTGTTGACCC GTTCGGACCG
TCAAATGAGC CAGAGCCGGA TATTTCGGCG CTGATCGCGC GTATGCATCA GAATCACCCG
CTGCTTGCCG GATAA
 
Protein sequence
MRTAIAINPR HAAHDEPQHV EQAARLHAIT AALNASGLRS VLLEVPARPA TEAQLRAVHT 
EQMIEVVRWS ATRPRSWIDH DTYTTSASWD AALMAAGTTL AVVDAVVSGS AQNGFALVRP
PGHHATRAES MGFCLFNNVA IAARHAIDHL GVTRVAIVDF DVHHGNGTQD IFYDDDRVFF
CSTHASPLYP GTGAEREIGS GRGRGTTMNL PLPHGVGDAG FARLFDDVVI PALRRYRPDL
ILVSAGYDAH WADPLGPLTL SVAGYAALTR RLKETAEEVC NGRIALVLEG GYNLKALAAS
VLACLEVLAN DDTVVDPFGP SNEPEPDISA LIARMHQNHP LLAG