Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1422 |
Symbol | |
ID | 5704811 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 1643893 |
End bp | 1645074 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641270932 |
Product | histone deacetylase superfamily protein |
Protein accession | YP_001536313 |
Protein GI | 159037060 |
COG category | [B] Chromatin structure and dynamics [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00000379462 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGCCGGACG ACACAGTGGT GGTGTGGGAC GAGGCGTTGC TGTCCTACAA CCTCGGCGAC CACCCCCTTG ACCCGGTACG GGTCGAGCTG ACCGTCGCGC TCGCCCGGGA GCTGGGTGTG CTCGCGCGTT CCGGGGTGCG GCTGGTCGAG CCGAAGCCCG CCGACGACGA CCTGCTGGCC CAGGTGCACG ACCCGCGCTA CCTGGAAGCC GTGCGGAGTG CCCCGCAGGA CCCGCTCTTC ACCGGGTTCG GGCTGGGCAC CCCGGACAAC CCGGTCTTCC CGAAGATGCA CGAGGCCAGC GCGCTGATCG CCGGGGCCAC CGCCACGGCG GCCGAGGCAG TCTGGCGGGG CACGGCACGT CGGGCGGTCA ACGTGGCCGG CGGTCTGCAC CACGCGATGC CCGACCGGGC CGCTGGCTTC TGCGTCTACA ACGACCCCGC GGTCGGTATC GCCCGCCTGC TCGACCTGGG TGCACGTCGG ATCGCGTACG TCGACGTGGA CGTCCACCAC GGCGACGGAG TGCAGCAGGT CTTCTGGGAC GACCCGCGGG TGCTGACGGT CAGCCTGCAC GAGACGCCGC TGGCGCTCTT CCCCGGCACC GGCTTCCCCG ACGAGACCGG CGGCGCGCAG GCCCAGGGAA GCGCGGTGAA CGTGGCGTTG CCGCCGGGTG TCGACGACGC CGGCTGGCAG CGGGCGTTCC ACGCGATCGT GCCGTCGGTG CTGCGTGCGT TCCAGCCGGA GATCCTGGTC ACCCAGTGCG GTGCGGACGC GCACCGGCTC GACCCACTCG CCGACCTGCG CCTGTCGGTC GACGGGCAGC GCGCCACCTA CATCGCCCTG CGGGCACTCG CCGACGAGCT GTGCGAGGGC CGCTGGGTCG CGACCGGCGG CGGGGGGTAC GCGCTGGTCG AGGTGGTGCC CAGGGCGTGG ACCCACCTGC TCGCGGTGGC CACCGGCGAG CCGCTCGAAC CGGCGACGCT GTCCCCGCCC GCCTGGCGCG AGCTGGCCCT GGCCCTCCGC CCCGGGCAGG AGGTGCCGCT GCGGATGACC GACGACGTCA ACCCGTCGTA CGAGCCGTGG CAGCCGTCCG GGGAGCCGAA CTCGGTGGAC CGGGCCATCG TGGCCGCCCG CAAAGCGGTG TTCCCGCTGT TCGGGCTCGA CCCGCACGAC CCACGCGACT AG
|
Protein sequence | MPDDTVVVWD EALLSYNLGD HPLDPVRVEL TVALARELGV LARSGVRLVE PKPADDDLLA QVHDPRYLEA VRSAPQDPLF TGFGLGTPDN PVFPKMHEAS ALIAGATATA AEAVWRGTAR RAVNVAGGLH HAMPDRAAGF CVYNDPAVGI ARLLDLGARR IAYVDVDVHH GDGVQQVFWD DPRVLTVSLH ETPLALFPGT GFPDETGGAQ AQGSAVNVAL PPGVDDAGWQ RAFHAIVPSV LRAFQPEILV TQCGADAHRL DPLADLRLSV DGQRATYIAL RALADELCEG RWVATGGGGY ALVEVVPRAW THLLAVATGE PLEPATLSPP AWRELALALR PGQEVPLRMT DDVNPSYEPW QPSGEPNSVD RAIVAARKAV FPLFGLDPHD PRD
|
| |