Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1137 |
Symbol | |
ID | 8136459 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1329610 |
End bp | 1330767 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644868748 |
Product | Histone deacetylase |
Protein accession | YP_003020956 |
Protein GI | 253699767 |
COG category | [B] Chromatin structure and dynamics [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 141 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTCCCGCA AAACCGCTCT CATCGCCTGT GCCGATCTGG CAGGGTACAG CTATGGTGAC CACCACCCGT TCAAGGTTCA GCGCTACCGG CTGGCCCACG ACCTAATGGA GGCTTACGGT CTGCTGGAGC TGCCGGGGAT GGAGTTGGTG CGGCCGCGAC CGGTCGGCGA GGCGGAACTG TTGACGGCGC ACAGCCGCGA GTACCTGGAC CGGCTGCGGG AGTTCAGCGC CTCGGCAGAA CCCAGGGCGG ATTTCCGCTT CGGCCTTGGC GACCTGGAGA ATCCTGTCTT CCCCGGCGTT TACGATTGGG CCTGCCTCGG GGTCTCCGGT ACCGTGGAGG CGGCGCGCCT GGTGACGGAA GAAGGATTCG CCGCCGCCTT CAACCCTTTT GGCGGCTATC ATCACGCCCA AAGAAGCCGA GCCTCCGGGT TTTCTTATCT GAACGACGCC GTGGTGGCCA TAAACCACCT GGTGCAGCAG GGGAAGCGGG TTGTTTACCT GGACCTGGAC GCCCACCACG GCGACGGGGT GCAAGACGCC TTCTACCAAA GCGATCGGGT CTTGAGCATC TCGCTGCACG AAAGCGGCGT CTATTTCTTT CCCGGCACCG GCTTCGAGCG GGAGTCAGGG GAAGGGGCGG GGCGGGGCTA CTCGGTGAAC CTGCCGCTTT TGGCCCACAC GGACGACGCG CTCTTCATGA AGGCGTTCGA CGAGGTGGCC TTTCCGCTGA TCGCTGCGTT CGACCCGGAC GTGCTCTTCA CCCAGCTTGG CGCCGACACC TTCAGGACCG ATCCGCTCAC CAGGCTCGAG GTCACCACCC ACGCCTATTC TTATATACTG CGCAAGTTGC GCGCCCTGCA GATCCCCTGG GTGGCGGTGG GGGGAGGGGG GTACGACATG ATGAACGTGG CCCGCGCCTG GACCATCGCC TGGGGGGTTA TGAACGACCG GGTGCTCTCG CCCAGGCTTC CAGCGTCCTT CGTCAAACTG ATCTCGGATC TTGGGTACCC GCACCGCATG CTCCTGGACG CCATGCATTG GGCCCAGGAG GACGACCGCA ACCGGGCGCT GGACGCGGTG GAGAAGAGCA TCGCTTACCT GCGGGCCCAG ATCTTCCCGG TTCTCATCGG CGACTATGGC ACAAAACGGA GCACATGA
|
Protein sequence | MSRKTALIAC ADLAGYSYGD HHPFKVQRYR LAHDLMEAYG LLELPGMELV RPRPVGEAEL LTAHSREYLD RLREFSASAE PRADFRFGLG DLENPVFPGV YDWACLGVSG TVEAARLVTE EGFAAAFNPF GGYHHAQRSR ASGFSYLNDA VVAINHLVQQ GKRVVYLDLD AHHGDGVQDA FYQSDRVLSI SLHESGVYFF PGTGFERESG EGAGRGYSVN LPLLAHTDDA LFMKAFDEVA FPLIAAFDPD VLFTQLGADT FRTDPLTRLE VTTHAYSYIL RKLRALQIPW VAVGGGGYDM MNVARAWTIA WGVMNDRVLS PRLPASFVKL ISDLGYPHRM LLDAMHWAQE DDRNRALDAV EKSIAYLRAQ IFPVLIGDYG TKRST
|
| |