Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2048 |
Symbol | |
ID | 4270182 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 2320634 |
End bp | 2321731 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 638126804 |
Product | histone deacetylase superfamily protein |
Protein accession | YP_742880 |
Protein GI | 114321197 |
COG category | [B] Chromatin structure and dynamics [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.309662 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.408539 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATTGGGG ACCGCCGCGT CATCGCCTTC CACGATGCCC GAATGCTGGA GCACCGGCCG GACGTGCAGG ATGCGTATCA GCCGGGCCGC CTGGCCACCC GGGTCAAGCG CATGCTGGAC GGGCTCACCA TCCAGTGGAA CTACCCGGAG CATCCAGGGC GCCTTACCGC CATCATGGAC CTGCTGGTCC GCGAGCCGGT CCCCGGGGTG ACCTTCCGCA CTGGCCGGGC CGCCACCCCG GCAGAACTGG GCCGGGTGCA TACCCTCTCC TATCTGGAGA CCATCTACGC CCTGCGCGGC AAGCACGCCT GGCTGGATGT GGACACCACT GCGGTCTGCC CGGGCAGTGT GGACGCCGCC GAGGTGGCGG CCGGCACCGC CATTGCCGCG GTGGAGGCGG TGGTACAGGG TGACGCTGAG GCCGCCTTCG CCCTGGTGCG CCCCCCGGGG CACCACGCCG AGGCGGTGCG CGCCCGCGGT TTTTGCCTGT TCAACAATGT CGCCGTGGCG GCCGCCCACG CCCAAGCGGC ACTGGGCTGC CAGCGGGTGC TGATCGTGGA CTGGGACGTC CACCACGGCA ACGGCACCCA GGACATCTTC CGCGCCGACC CGGATGTGCT CTTTTTCGAC ACCCACCGGG CCTCGCCCTT CTACCCGGGC TCGGGCCGAC TGGAGGAGGT CGGTCACGGC CTGGGGGAAG GCACCACGGT CAACGTCCCG TTACCGCCCG GGGCCGGTGA TGCCGCGCTC CTGCGGGCCT TCCACGAAAT CCTCGTCCCC GCCGCTGACT GGTTCCAGCC CGACCTGGTG CTGGTCTCGG CCGGCTTCGA CCCCCACCGG CTGGACCAGG CCCTGAATAT GAGTTACGAG GGCTTCGCCG CCTTGACCGC AGTGCTGCAG GAGATCGCCA CAAGGCACGC CCAGGGGCGG CTGGCCTTCG TGCTGGAAGG GGGCTACAAC CTGGAGGCGC TGTCCCGGGG GGTACGGACC GTGCTGGAGG TGCTGGCCGG CGCCGAACTC GAACCCCTGC AGGCGGCCGG AATGGAAGAG CTGGAACAGG CCATCGCCTT CCACCGGGAT GCCTTCCAGG CGCCCTGA
|
Protein sequence | MIGDRRVIAF HDARMLEHRP DVQDAYQPGR LATRVKRMLD GLTIQWNYPE HPGRLTAIMD LLVREPVPGV TFRTGRAATP AELGRVHTLS YLETIYALRG KHAWLDVDTT AVCPGSVDAA EVAAGTAIAA VEAVVQGDAE AAFALVRPPG HHAEAVRARG FCLFNNVAVA AAHAQAALGC QRVLIVDWDV HHGNGTQDIF RADPDVLFFD THRASPFYPG SGRLEEVGHG LGEGTTVNVP LPPGAGDAAL LRAFHEILVP AADWFQPDLV LVSAGFDPHR LDQALNMSYE GFAALTAVLQ EIATRHAQGR LAFVLEGGYN LEALSRGVRT VLEVLAGAEL EPLQAAGMEE LEQAIAFHRD AFQAP
|
| |