Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2789 |
Symbol | |
ID | 4072412 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 3300823 |
End bp | 3301719 |
Gene Length | 897 bp |
Protein Length | 298 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637984807 |
Product | histone deacetylase superfamily protein |
Protein accession | YP_591864 |
Protein GI | 94969816 |
COG category | [B] Chromatin structure and dynamics [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCCGCC GATTGTTTTA TACCGACCAT TACACCCTTC CCTTGCCTGA GGGACACCGG TTTCCGATTT CAAAATACAA ATTGCTGCGC GAGATGTTGG AGCGCGATTC GCTCTTTGAG TTTGTCCCGG CACCGCTGGC TAAGCCGGAG GTGATTGCAC TCGTGCATGA CGCTGCCTAC GTTGAACAAT TTGTTCAGGG GGAGTTGAGT GCGCAGGCGA TGCGGAGGAT TGGGTTTCCG TGGTCGCCGG AGTTGGTGAA GCGGACACTC GGATCGGTGG GCGGAACGTT GAGTGCGGGG ATGGATGCGT TGAGCTCAGG ATTCGGTGGG ACGCTGGCAG GTGGGACGCA TCACGCCTTC CGTAGCGAAG GCTCTGGCTA TTGCGTCTTC AACGATATCG CCATCGCCAT TCTCTACCTG CGAAGCAAAG GCCTCGCTCA GCGCGCTGCC GTGATCGATC TCGACGTGCA CCAGGGCGAT GGCACAGCGC AGATATTTCA GAACGATGCG TTGGTACTGA CGATCTCAGT CCATAGCCGA GCGAATTTTC CGTTTCGCAA GCAGGTGAGC AAGATCGACA TCGAGTTGGA AGACGCAACG CACGATGACG AGTATTTGAA CGTTGTTGAT GGGTTGTTGC CGCGGGTGGC GGATTTCAAA CCCGAGATTT TGTTCTATCA ATCAGGCGTG GATGGGTTGG CGACGGATTC GCTGGGGAGA TTGGCATTGA CCCACGCAGG TTTAAAAGAG CGCGACCGCC GCGTGTGCAC CTTCGCGCGC AGCTTCGGCG TGCCGCTGGT CATCACGCTC GGCGGCGGCT ACTCGCTCCC CATCGAGCAT ACCGTCACCG CCCACGCCAA CACTTTCCGC ACCGCAGCGG ATGTTTTTGT CGTGTGA
|
Protein sequence | MSRRLFYTDH YTLPLPEGHR FPISKYKLLR EMLERDSLFE FVPAPLAKPE VIALVHDAAY VEQFVQGELS AQAMRRIGFP WSPELVKRTL GSVGGTLSAG MDALSSGFGG TLAGGTHHAF RSEGSGYCVF NDIAIAILYL RSKGLAQRAA VIDLDVHQGD GTAQIFQNDA LVLTISVHSR ANFPFRKQVS KIDIELEDAT HDDEYLNVVD GLLPRVADFK PEILFYQSGV DGLATDSLGR LALTHAGLKE RDRRVCTFAR SFGVPLVITL GGGYSLPIEH TVTAHANTFR TAADVFVV
|
| |