Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A2cp1_3536 |
Symbol | |
ID | 7299586 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaeromyxobacter dehalogenans 2CP-1 |
Kingdom | Bacteria |
Replicon accession | NC_011891 |
Strand | + |
Start bp | 3951748 |
End bp | 3953532 |
Gene Length | 1785 bp |
Protein Length | 594 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 643596349 |
Product | histone deacetylase superfamily |
Protein accession | YP_002493932 |
Protein GI | 220918628 |
COG category | [B] Chromatin structure and dynamics [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.380166 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCCCGC GCGCACTCCT CTACCGGCTC CGCTCCTGGT TCTACCGCCG GGACGTGACG CTCTGGTACG ACCCGCGCTA CCGCCTGCCG CTCTCCGGCC TCGAGTCCGG CGCCGGCATG GAGCCGCGCC GCGCCGACTT CGTGGCCTGG TGGCTCGCCG ACTCCGGCGC CGTGCCGCGC TCCCGGCGCC GCACGCCGCG GCGGATCTCG TTCGAGGACC TGGCGCGCGT GCACGACGCG GAGCTGCTCG AGTCGCTGGG CCACCCCGAC ACGCTGGCGC GGGTGTTCGC GGTGGACCCG TCCGACGTCC CGGTGGACGA GGTCATGACC ACCGTCCGGC TGGCCTGCGG CGGCACCCTG GGCGCGGCCC GCGAGACGCT CCGCACGCGC GCCCCCGCCA TCAACCTCCT GGGCGGCTTC CACCACGCGT TCCCCGGCGC CGCGGGCGGC TTCTGCCCGG TCAACGACGT CGCCGTCGCG ATCGCCGCGG TGCGCGCGGA GGGGTTCTCG GGGCGCGTGG CGGTCCTCGA TCTCGACGCG CACCCGCCCG ACGGCATCGC CGCCTGCCTG GCGCAGGATC CCGATCACTG GATCGGCTCC ATCTCCGGGT CGGACTGGGG GCCGCTCGAG GGCGTGGACG AGACGGTGCT CCCGCCGGGC AGCGGCGACG ACCCCTACCT CGAGGCGCTC GGCGCGCTGC TCTCGCGGAT GCCGCGGCCG CAGCTCGCGT TCGTGCTGGC CGGGGGCGAC GTGCTGGCCG GCGATCGCTT CGGCCAGCTC GGGCTGTCGC TCGACGGCGC GCGCGAGCGC GACCTGCTCG TCGCCGCCGA GCTGGACTTC GTGCCCACCG TGTGGCTCTC GGCGGGCGGC TACTCGCGGC GATCCTGGCG CGCGCTGGCC GGCACGGGCA TGGCGGTCGC GGCCGGCTCG CTCGCGCCCA TCCCCGACGA GTACGACCCG CTCTCCGCCC GCTTCGAGAT GGTCTCGCAG AAGCTCCTGC CCGGCGATCT CGGCGACACC GGCGACATCA CCGCCGAGGA TCTCGAGGAG GCGCTCGGCA TGCGGCCGCG GCGCCAGCGG CTGCTGCTCG GCTTCTACAC CGCCTCCGGC ATCGAGCACG CGCTGTTCCG CTACGGCGTG CTGGAGCAGC TCGAGCGCAT GGGCTACCGG CAGTTCCGGG TCGGGTTCGA CTCCGCCGGG CTGGGCGATC GGGTGCGGCT CCACGGCGAG GCGGAGGGGC AGGAGCACCT GCTCGTCGAG CTGATCCTGG AACGGCGACA CGTGCTCGGG GTCGAGGTGC TGTTCGTGCA CTGGCTCTCG CTGCGCAACC CGCGGGCGCA GTTCAGCGAC CGCCGCCCGC GCCTCCCGGG GCAGGAGGTG CCGGGGCTCG GCCTGGCGCA CGAGGCCGGG TCGATGCTGG CGCGCATGGC CATCCGCCTC GGGCTGGGCG GCGTCGCGTT CCGGCCGGCG CACTTCCACA CCGCCTACGC CGCCCGCCAC GCGTTCGCGT TCATCGACCC GGAGCGGCAG GGGCGCTTCG AGGCGCTGGT GCGGGACCTC GCGACGGTGC CGCTCCTCGA GGCGACGCGC GCGGTCTCCG AAGGCCGGGT GCTCCTCGAC GGGCGCCCCT ACGCGTGGGA GGCGGACGAG ATGGCGTACT GGCTGCGCGA GTCGCCGTCC GAGCCGGGGG AGGCGGAGCG CGAGCGCGAG CGGGTCCGCT TCACGCTGCT GCCGGAGGCG CCCCCGCCGG CGGTCAGGGC GCCGGCTCCG CCAGGAGCAG CCTGA
|
Protein sequence | MRPRALLYRL RSWFYRRDVT LWYDPRYRLP LSGLESGAGM EPRRADFVAW WLADSGAVPR SRRRTPRRIS FEDLARVHDA ELLESLGHPD TLARVFAVDP SDVPVDEVMT TVRLACGGTL GAARETLRTR APAINLLGGF HHAFPGAAGG FCPVNDVAVA IAAVRAEGFS GRVAVLDLDA HPPDGIAACL AQDPDHWIGS ISGSDWGPLE GVDETVLPPG SGDDPYLEAL GALLSRMPRP QLAFVLAGGD VLAGDRFGQL GLSLDGARER DLLVAAELDF VPTVWLSAGG YSRRSWRALA GTGMAVAAGS LAPIPDEYDP LSARFEMVSQ KLLPGDLGDT GDITAEDLEE ALGMRPRRQR LLLGFYTASG IEHALFRYGV LEQLERMGYR QFRVGFDSAG LGDRVRLHGE AEGQEHLLVE LILERRHVLG VEVLFVHWLS LRNPRAQFSD RRPRLPGQEV PGLGLAHEAG SMLARMAIRL GLGGVAFRPA HFHTAYAARH AFAFIDPERQ GRFEALVRDL ATVPLLEATR AVSEGRVLLD GRPYAWEADE MAYWLRESPS EPGEAERERE RVRFTLLPEA PPPAVRAPAP PGAA
|
| |