Gene Mlg_1020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1020 
Symbol 
ID4270050 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1160528 
End bp1161634 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content67% 
IMG OID638125772 
Producthistone deacetylase superfamily protein 
Protein accessionYP_741863 
Protein GI114320180 
COG category[B] Chromatin structure and dynamics
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.283226 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAATTAG CGGTTAGTAT GCGGGTAGGG CTGGTCGGGC GCCGGCCGGC GATGACACCA 
CGGGCCGTTG ATGGGGGATC CGACGTGAAG GCATTTTTCC ACCCGAGCCA GGACAAGCAC
ATCCCCAGGA GCTACCTCTC CCGCGGTCAG ATGCGGGCGC CGCTGGAACT CCCCGAGCGC
ACCGGGCATA TCCTGGAGGG GCTGCGGACG CTGGATATCT CGGTGGAGAC ACCCTCTGAT
CACGGGATGC AGGCCATTGC CCGGGTCCAC GACATGGGCT ACCTGCGGTT CCTGGAGTCG
GCGCATCGGC GCTGGAAGGC CATCCCCGAT GACTGGGGTG ATGAGGTGAT GTCCAATGTC
TTCGTGCGCT CGCCCAACCC CATGAAGGGC ATCCTGGCCG AGGCCGCCCG CTACCTGGCC
GACGGCAGTT GCCCCATTGG CGAACACACC TTCGAGGCCG CCTACTGGTC GGCCCAGACC
GCGTTATCGG CCAGCGACGA GCTGCTGCGG GGGGCGAAGC GCGCCTATGC GGTCTGCCGC
CCGCCGGGAC ACCACGCCCG CCGCGACGCC GCGGGTGGCT TCTGCTATCT GAACAATGCC
GCCATCGCCG CCGAGGCCCT CAAGGCCCAG TACCCCCGGA TCGCCATCCT CGACCCGGAC
ATGCACCATG GCCAGGGCAT CCAGGAGATC TTCTACGACC GGGACGATGT GCTCTATATC
TCCATCCACG GCGACCCCAC CAACTTCTAC CCGGTGGTGA GCGGCCACGA GGAGGAGCGC
GGGGCCGGGG CCGGCGAGGG CTATAATATC AACCTGCCCA TGCCCCACGG CTCACCGGAG
GCCACCTACT TCCAGCGCTT GGAGGAGGCG GCGCACGCCA TCGAGCTCTA CGCCCCCGAC
GCGCTCATCG TCACCCTGGG CTTCGATATC TATAAGGATG ACCCGCAGAA CAAGGCGGCG
GTGAGCTCAC CCGGCTTCAA CCGCATGGGT CGCACCCTGG CCGAGCTCGC TCTGCCGACG
CTGATCATCC AGGAGGGGGG CTATCACATG GCGACGCTGG CACAGAACAC CCGCGAGTTC
TTCACCGGCT TGGGCGACCC GCGCTGA
 
Protein sequence
MELAVSMRVG LVGRRPAMTP RAVDGGSDVK AFFHPSQDKH IPRSYLSRGQ MRAPLELPER 
TGHILEGLRT LDISVETPSD HGMQAIARVH DMGYLRFLES AHRRWKAIPD DWGDEVMSNV
FVRSPNPMKG ILAEAARYLA DGSCPIGEHT FEAAYWSAQT ALSASDELLR GAKRAYAVCR
PPGHHARRDA AGGFCYLNNA AIAAEALKAQ YPRIAILDPD MHHGQGIQEI FYDRDDVLYI
SIHGDPTNFY PVVSGHEEER GAGAGEGYNI NLPMPHGSPE ATYFQRLEEA AHAIELYAPD
ALIVTLGFDI YKDDPQNKAA VSSPGFNRMG RTLAELALPT LIIQEGGYHM ATLAQNTREF
FTGLGDPR