Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0335 |
Symbol | |
ID | 4269898 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 379529 |
End bp | 380452 |
Gene Length | 924 bp |
Protein Length | 307 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638125066 |
Product | histone deacetylase superfamily protein |
Protein accession | YP_741180 |
Protein GI | 114319497 |
COG category | [B] Chromatin structure and dynamics [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACCT GGCTCATCAC CCACCCGGCC TGCCTGGAGC ACGATACCGG CCCAGGGCAC CCGGAAAGCA TCGCCCGACT GCAGACCATC CTGACAGCGC TGCAGTCGTC CACCTTCGAG TTCCTGATCC GTGAGCCGGC GCCGCGCGCC GAGGTGGAAC AGCTCAACCG GGTACACGAC CCGGACTACG TGCGCGACAT CCTCGCCCGG ATCCCCGAAC AGGGCCAGGC CTTCATCGAC AGCGACACCG TGGTCTGCCC GGCCACCGGC GAGGCGGCGC TGCGGGCGGC CGGGGCCGTC TGCCATGCGG TGGATGCGGT GGTCGGCGGC CAGGCGCAGA ACGCCTTTTG CGCGGTGCGC CCGCCCGGCC ACCACGCCGA ACCCAACCAG GCGATGGGCT TTTGTTTCTT CAATAACATC GCCGTGGGCG CGGCCCACGC TATCGCGGCG CACGGCCTGG AGCGGGTGGC GATCATGGAC TTCGATGTCC ACCATGGCAA TGGCACCCAG ACCATAGCTG AGCGCAACCC GAAGATGTAC TACCTCTCCA CCCACCAGGC ACCGCTGTTC CCGGGTACCG GCGATCCCTC CGAAACCGGT CAGGGCAACA TCCGCAACGC GACCCTGGAG GATGGTGACG GCTCCGAGAT GTTCCGCTTC CAGTTCGAGG AACGCATCCT GCCGGAGCTG CACCGGTACA AGCCGCAGCT GGTGATGATC TCCGCCGGCT TCGACGCCCA CCGCTCCGAC CCCCTGGCCA CCCTGCGGCT GGACGAAACC GACTTTGCCT GGGCCACAAG GGAGCTGGTG GCCATCGCCC GCAAGTACTG TGACGGCAAG GTGGTCTCGG CGCTGGAGGG GGGGTACAGC CTGAACGCGG TGGGCCGTTG CGCCCGCGCC CATGTGGAAG CACTGATGAC CTGA
|
Protein sequence | MTTWLITHPA CLEHDTGPGH PESIARLQTI LTALQSSTFE FLIREPAPRA EVEQLNRVHD PDYVRDILAR IPEQGQAFID SDTVVCPATG EAALRAAGAV CHAVDAVVGG QAQNAFCAVR PPGHHAEPNQ AMGFCFFNNI AVGAAHAIAA HGLERVAIMD FDVHHGNGTQ TIAERNPKMY YLSTHQAPLF PGTGDPSETG QGNIRNATLE DGDGSEMFRF QFEERILPEL HRYKPQLVMI SAGFDAHRSD PLATLRLDET DFAWATRELV AIARKYCDGK VVSALEGGYS LNAVGRCARA HVEALMT
|
| |