Gene Mlg_2048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2048 
Symbol 
ID4270182 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2320634 
End bp2321731 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content70% 
IMG OID638126804 
Producthistone deacetylase superfamily protein 
Protein accessionYP_742880 
Protein GI114321197 
COG category[B] Chromatin structure and dynamics
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.309662 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.408539 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATTGGGG ACCGCCGCGT CATCGCCTTC CACGATGCCC GAATGCTGGA GCACCGGCCG 
GACGTGCAGG ATGCGTATCA GCCGGGCCGC CTGGCCACCC GGGTCAAGCG CATGCTGGAC
GGGCTCACCA TCCAGTGGAA CTACCCGGAG CATCCAGGGC GCCTTACCGC CATCATGGAC
CTGCTGGTCC GCGAGCCGGT CCCCGGGGTG ACCTTCCGCA CTGGCCGGGC CGCCACCCCG
GCAGAACTGG GCCGGGTGCA TACCCTCTCC TATCTGGAGA CCATCTACGC CCTGCGCGGC
AAGCACGCCT GGCTGGATGT GGACACCACT GCGGTCTGCC CGGGCAGTGT GGACGCCGCC
GAGGTGGCGG CCGGCACCGC CATTGCCGCG GTGGAGGCGG TGGTACAGGG TGACGCTGAG
GCCGCCTTCG CCCTGGTGCG CCCCCCGGGG CACCACGCCG AGGCGGTGCG CGCCCGCGGT
TTTTGCCTGT TCAACAATGT CGCCGTGGCG GCCGCCCACG CCCAAGCGGC ACTGGGCTGC
CAGCGGGTGC TGATCGTGGA CTGGGACGTC CACCACGGCA ACGGCACCCA GGACATCTTC
CGCGCCGACC CGGATGTGCT CTTTTTCGAC ACCCACCGGG CCTCGCCCTT CTACCCGGGC
TCGGGCCGAC TGGAGGAGGT CGGTCACGGC CTGGGGGAAG GCACCACGGT CAACGTCCCG
TTACCGCCCG GGGCCGGTGA TGCCGCGCTC CTGCGGGCCT TCCACGAAAT CCTCGTCCCC
GCCGCTGACT GGTTCCAGCC CGACCTGGTG CTGGTCTCGG CCGGCTTCGA CCCCCACCGG
CTGGACCAGG CCCTGAATAT GAGTTACGAG GGCTTCGCCG CCTTGACCGC AGTGCTGCAG
GAGATCGCCA CAAGGCACGC CCAGGGGCGG CTGGCCTTCG TGCTGGAAGG GGGCTACAAC
CTGGAGGCGC TGTCCCGGGG GGTACGGACC GTGCTGGAGG TGCTGGCCGG CGCCGAACTC
GAACCCCTGC AGGCGGCCGG AATGGAAGAG CTGGAACAGG CCATCGCCTT CCACCGGGAT
GCCTTCCAGG CGCCCTGA
 
Protein sequence
MIGDRRVIAF HDARMLEHRP DVQDAYQPGR LATRVKRMLD GLTIQWNYPE HPGRLTAIMD 
LLVREPVPGV TFRTGRAATP AELGRVHTLS YLETIYALRG KHAWLDVDTT AVCPGSVDAA
EVAAGTAIAA VEAVVQGDAE AAFALVRPPG HHAEAVRARG FCLFNNVAVA AAHAQAALGC
QRVLIVDWDV HHGNGTQDIF RADPDVLFFD THRASPFYPG SGRLEEVGHG LGEGTTVNVP
LPPGAGDAAL LRAFHEILVP AADWFQPDLV LVSAGFDPHR LDQALNMSYE GFAALTAVLQ
EIATRHAQGR LAFVLEGGYN LEALSRGVRT VLEVLAGAEL EPLQAAGMEE LEQAIAFHRD
AFQAP