Gene Sare_1422 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1422 
Symbol 
ID5704811 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1643893 
End bp1645074 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content74% 
IMG OID641270932 
Producthistone deacetylase superfamily protein 
Protein accessionYP_001536313 
Protein GI159037060 
COG category[B] Chromatin structure and dynamics
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000379462 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCCGGACG ACACAGTGGT GGTGTGGGAC GAGGCGTTGC TGTCCTACAA CCTCGGCGAC 
CACCCCCTTG ACCCGGTACG GGTCGAGCTG ACCGTCGCGC TCGCCCGGGA GCTGGGTGTG
CTCGCGCGTT CCGGGGTGCG GCTGGTCGAG CCGAAGCCCG CCGACGACGA CCTGCTGGCC
CAGGTGCACG ACCCGCGCTA CCTGGAAGCC GTGCGGAGTG CCCCGCAGGA CCCGCTCTTC
ACCGGGTTCG GGCTGGGCAC CCCGGACAAC CCGGTCTTCC CGAAGATGCA CGAGGCCAGC
GCGCTGATCG CCGGGGCCAC CGCCACGGCG GCCGAGGCAG TCTGGCGGGG CACGGCACGT
CGGGCGGTCA ACGTGGCCGG CGGTCTGCAC CACGCGATGC CCGACCGGGC CGCTGGCTTC
TGCGTCTACA ACGACCCCGC GGTCGGTATC GCCCGCCTGC TCGACCTGGG TGCACGTCGG
ATCGCGTACG TCGACGTGGA CGTCCACCAC GGCGACGGAG TGCAGCAGGT CTTCTGGGAC
GACCCGCGGG TGCTGACGGT CAGCCTGCAC GAGACGCCGC TGGCGCTCTT CCCCGGCACC
GGCTTCCCCG ACGAGACCGG CGGCGCGCAG GCCCAGGGAA GCGCGGTGAA CGTGGCGTTG
CCGCCGGGTG TCGACGACGC CGGCTGGCAG CGGGCGTTCC ACGCGATCGT GCCGTCGGTG
CTGCGTGCGT TCCAGCCGGA GATCCTGGTC ACCCAGTGCG GTGCGGACGC GCACCGGCTC
GACCCACTCG CCGACCTGCG CCTGTCGGTC GACGGGCAGC GCGCCACCTA CATCGCCCTG
CGGGCACTCG CCGACGAGCT GTGCGAGGGC CGCTGGGTCG CGACCGGCGG CGGGGGGTAC
GCGCTGGTCG AGGTGGTGCC CAGGGCGTGG ACCCACCTGC TCGCGGTGGC CACCGGCGAG
CCGCTCGAAC CGGCGACGCT GTCCCCGCCC GCCTGGCGCG AGCTGGCCCT GGCCCTCCGC
CCCGGGCAGG AGGTGCCGCT GCGGATGACC GACGACGTCA ACCCGTCGTA CGAGCCGTGG
CAGCCGTCCG GGGAGCCGAA CTCGGTGGAC CGGGCCATCG TGGCCGCCCG CAAAGCGGTG
TTCCCGCTGT TCGGGCTCGA CCCGCACGAC CCACGCGACT AG
 
Protein sequence
MPDDTVVVWD EALLSYNLGD HPLDPVRVEL TVALARELGV LARSGVRLVE PKPADDDLLA 
QVHDPRYLEA VRSAPQDPLF TGFGLGTPDN PVFPKMHEAS ALIAGATATA AEAVWRGTAR
RAVNVAGGLH HAMPDRAAGF CVYNDPAVGI ARLLDLGARR IAYVDVDVHH GDGVQQVFWD
DPRVLTVSLH ETPLALFPGT GFPDETGGAQ AQGSAVNVAL PPGVDDAGWQ RAFHAIVPSV
LRAFQPEILV TQCGADAHRL DPLADLRLSV DGQRATYIAL RALADELCEG RWVATGGGGY
ALVEVVPRAW THLLAVATGE PLEPATLSPP AWRELALALR PGQEVPLRMT DDVNPSYEPW
QPSGEPNSVD RAIVAARKAV FPLFGLDPHD PRD