Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0989 |
Symbol | |
ID | 3830865 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1016102 |
End bp | 1017415 |
Gene Length | 1314 bp |
Protein Length | 437 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637828918 |
Product | histone deacetylase superfamily protein |
Protein accession | YP_429847 |
Protein GI | 83589838 |
COG category | [B] Chromatin structure and dynamics [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.595114 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATAAAG CCAAGCGCCG CCTGGGCCTG GTTTTCTTCC CTGCCTTTGA CTGGGCCATC AGCCCTACCC ACCCTGAGCG GGAAGAGCGC CTCCTCTATA CCCAGGACCA GGTGTTCGAA GAAGGCATCC TGGACATCGA AGGCATCAGG GAATATCGCC CTCGCCTGGC CAGTACCGGC GATGTAGAGC GGGTGCATAT CTGTGTGCCG GATGTCCAAT CCCGGACGAC GGAATCCCAC CTCATTGCCG CCGGCGGGGC CATTGTTGCC GCTGAAGCCG TTTTAAAGGG AGAAGTAGAT AAGGCCTTTG CCATTATCCG GCCGCCGGGG CATCACTCCT TCCGCATCGT CCATGGCGCC AGGGGTTTTT GTAATATAAA TATGGAGGCC ATTATGCTTG AGTATATTCG CCGGCATTAC GGCCCCAAAC GGATAGCCAT TGTCGATACC GACTGCCACC ATGGGGACGG CAGCCAGGAT ATCTACTGGC ACGACAAGGA TACCCTTTAT ATCTCCCTAC ACCAGGACGG CCGTACCCTT TTCCCGGGTA CCGGCTTCCT GAACGAATTC GGCGGCCCCA ATGCCTTCGG CTATACCCTT AACCTTCCCC TGCCCCCTGA CACCGGGGAA GAGGGGTTCC TTTATGCCCT GGAACATTTT ATCCTGCCGG TGCTAGCCGA ATTTAAGCCG GATCTGGTCA TCAACTCCGC CGGCCAGGAT AACCATTATA CCGACCCGAT AACCAACATG CGTTTTTCAG CCCAGGGTTA CGCCCGCCTT AACGACCGCC TGCAACCGGA TATCGCCGTC CTGGAGGGAG GTTATTCCAT CGAAGGCGCC CTGCCCTACG TCAACGCAGG CATTATCCTG GCCCTGGCCG GCCTCGATTA CTCAAGGGTC CGCGAACCTG ACTACACTCC GGAAAAGGTG GCTCAATCGC CCCGCGTCAG TGAGTATATC GCCCGCCTCT GCGACGAGTC CTACCAGGCC TGGCAGCAGA GGGATACCCT CCGGGAAGAG TATATCCGGG GTAAAAAAGA AATCAGCCGC CGGCGCCGTA TTTTCTACGA CACCGAGGGG CTGATGGAAA CCCAGCAAGA AACGACCCGC GTTTGCCATG ATTGCGGCGG CGTGACCTGG ATTGATTCCC GTACCGACCA GGGCCGCCAC ATTCTGGCCA TCTTTATACC TCGGGACGCC TGTCCCCGCT GCCAGGAAGA AGGCCATGCC CTCTTTGAAA GCAGCCAGAC CATTGATTAC CCGGACGGAG TCTTTCTCCA GGACCGTTTG GAAGATAAGT TGTTCAAGAA ATAA
|
Protein sequence | MYKAKRRLGL VFFPAFDWAI SPTHPEREER LLYTQDQVFE EGILDIEGIR EYRPRLASTG DVERVHICVP DVQSRTTESH LIAAGGAIVA AEAVLKGEVD KAFAIIRPPG HHSFRIVHGA RGFCNINMEA IMLEYIRRHY GPKRIAIVDT DCHHGDGSQD IYWHDKDTLY ISLHQDGRTL FPGTGFLNEF GGPNAFGYTL NLPLPPDTGE EGFLYALEHF ILPVLAEFKP DLVINSAGQD NHYTDPITNM RFSAQGYARL NDRLQPDIAV LEGGYSIEGA LPYVNAGIIL ALAGLDYSRV REPDYTPEKV AQSPRVSEYI ARLCDESYQA WQQRDTLREE YIRGKKEISR RRRIFYDTEG LMETQQETTR VCHDCGGVTW IDSRTDQGRH ILAIFIPRDA CPRCQEEGHA LFESSQTIDY PDGVFLQDRL EDKLFKK
|
| |