Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3902 |
Symbol | |
ID | 4072239 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 4616603 |
End bp | 4617676 |
Gene Length | 1074 bp |
Protein Length | 357 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637985928 |
Product | histone deacetylase superfamily protein |
Protein accession | YP_592976 |
Protein GI | 94970928 |
COG category | [B] Chromatin structure and dynamics [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000251706 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.03889 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCTGCTT TGCCCGTCCG GTCGTATTGC GAACCTCGCG CTAATCAACG AAAATGCAGT GTTTCTATGT TGCCCTTCAA GCTCGTTTAT AGCGACCACT ACCGCTTACC TCTGGGCGAG CATGTGTTCC CCACGCAGAA ATACGAACTC GTAAAACAGG AGCTGCTGGA AGAAGGGGTC GCCTCGACGC AGGATTTCCT TACGCCGACG CCTGCTACAG AAGCCGATGT TCTGCTGGTG CACTCCCATT TCTACGTAGA TAAGCTGATC GAAGGAACAT TAACAGCGCG TGAGGAACTG GCCCTCGAGA TCCCCTACTC CCACGAAGCC GTGCAAGCGT TCTTGTGGCA CACCGGAGGC ACCATCCTGG CGGCGGAGCG CGCACTTTCC GATGGAGTGG CGTTCAACCT CGGCGGCGGA TTTCACCACG CGTATCCCGA CCACGGCGAA GGGTTCTGCA TGATTCACGA CGTGGCGGTG GCGATCAGGA AACTGCAAAA ACAAGGCAGA ATCCAGCGTG TGATGACGCT CGACTGCGAT GTTCATCAGG GAAATGGAAC TGCCGTAATT TTCGCAAAAC ACAGAGATGA GAATTCCGAG GCCCTGCCTT CGCGTTCTAC TTCGACGATC GGCAACAGGC TCAGCGGAAC GATGTTGGAG CGCGGTGCCG ACGATGTCTT CACGATCTCA TTGCATCAGG AGAACAACTA CCCACTGCAA AAGCCGCCGT CGTCCATAGA CGTCAATCTC CCCGACGGTA CAACAGATTC CGAATACATC GCGTGGCTCG ACAACGCGAT AAGTTCGGGG TTCCGGCAGT TCCAACCAGA TTTGCTTTGC TATATCGCCG GCGCGGACCC TTACAAGGAA GATCAACTCG GCGGCCTGAA CCTCACTATT GACGGTTTGA AACATCGTGA TGAGCTCGTA TTCCAAGCGG CACGCGCAAA GGGAATTCCC GTCATGGTGA CATTTGCCGG CGGCTATGCG CGCAAGATCC AGGACACCGT GCGGATACAC CGCAACACGG TCGGGGCAGC GAAGGAAGTT TTCTCAGGAG CAGGGAAGAG CTAA
|
Protein sequence | MAALPVRSYC EPRANQRKCS VSMLPFKLVY SDHYRLPLGE HVFPTQKYEL VKQELLEEGV ASTQDFLTPT PATEADVLLV HSHFYVDKLI EGTLTAREEL ALEIPYSHEA VQAFLWHTGG TILAAERALS DGVAFNLGGG FHHAYPDHGE GFCMIHDVAV AIRKLQKQGR IQRVMTLDCD VHQGNGTAVI FAKHRDENSE ALPSRSTSTI GNRLSGTMLE RGADDVFTIS LHQENNYPLQ KPPSSIDVNL PDGTTDSEYI AWLDNAISSG FRQFQPDLLC YIAGADPYKE DQLGGLNLTI DGLKHRDELV FQAARAKGIP VMVTFAGGYA RKIQDTVRIH RNTVGAAKEV FSGAGKS
|
| |