Gene Mlg_2059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2059 
Symbol 
ID4270445 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2333533 
End bp2334633 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content67% 
IMG OID638126815 
Productalcohol dehydrogenase 
Protein accessionYP_742891 
Protein GI114321208 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0604] NADPH:quinone reductase and related Zn-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.286272 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAAGA AGACAATGCA CGCCGTGCAA CTGACCCGCC ACGGTGATTT GGATGCCCTG 
GTCTATCGCG ACGATGTGCC GCGCCCGGAA CCGGCGAGGG GCGAGGTGCT GATCGAGGTC
AGTGCCTGCG GCATGAACAA CACCGACGTC TGGGTGCGGC AGGGGGCCTA CGGCACCGAG
ACCGATCCGG ACAGTGTCTC CACCTGGCGC CGGGGCCGCT CGACCCTGAC CTTTCCGCGC
ATCCAGGGCA CCGATATCGT CGGCACCGTC GTGGCCGTAG GCGAGGGCGT GCCCGAGGCC
CGCATCGGTG AGCGGGTCAT GGTGGACTTC AGCCTCTATA ACCGGGCGGA TGACAGCCTC
GCCGATATCG ACTACATCGG CCACGGCCGT GACGGGGGCT ATGCCGAGTA CACTGCGGTG
CCCTCGGAGA ACGCCCACGT GGTGGATACC GATATGAGCG ACGCCGAACT GGCGACCTTC
TGCTGTGCCT ATCTGACCGG CGAGCACATG CTGGAACGGG CCCGGGTGCA GGCGGGGGAG
CGGGTGCTGG TGACCGGTGC CTCCGGCGGC GTGGGCTCCG GCATCATACA GCTGTGCCGG
GCGCGGGGCG CCATCCCCTA CGCCGTGACC AGCCGGGACA AGGCAGAGGC GGTGCGCGGG
ATTGGTGCCG AAGCGGTCAT CCCCCGTGAG AGTGGCGATC TGGTGACGGC GGTGGACCAG
GCCACCGAAG GCCGGCCCAT CGACGTGGTG GCCGATCTGG TGGCCGGCCC GCTGTTCAAC
GACCTGCTGC GGGTGCTGCG TCCGGAGGGC CGGTATACCA CGGCCGGCGC CATCGCGGGG
CCCGTGGTGC AGTTGGATCT GCGGACGCTC TATCTCAAGC ATCTGCAACT GCACGGCTCC
TCCCAGGGCA CCCGCGGGGA TTTTCGGCGC CTGGTCGGCT ATATCGAGAG GGGGCAGGTG
CGGGCGCTGC TGTACAACAC CTACCGGCTC TCCGATTTCC ATCGTGCGCA GCGGGATTTC
ATGGAAAAGT CCTATATCGG CAAGCTGGTG GTGGTGCCTG ATCGAAAATG GGACGAGGTG
GGCCGCCCCC ATGCGCGCTA A
 
Protein sequence
MAKKTMHAVQ LTRHGDLDAL VYRDDVPRPE PARGEVLIEV SACGMNNTDV WVRQGAYGTE 
TDPDSVSTWR RGRSTLTFPR IQGTDIVGTV VAVGEGVPEA RIGERVMVDF SLYNRADDSL
ADIDYIGHGR DGGYAEYTAV PSENAHVVDT DMSDAELATF CCAYLTGEHM LERARVQAGE
RVLVTGASGG VGSGIIQLCR ARGAIPYAVT SRDKAEAVRG IGAEAVIPRE SGDLVTAVDQ
ATEGRPIDVV ADLVAGPLFN DLLRVLRPEG RYTTAGAIAG PVVQLDLRTL YLKHLQLHGS
SQGTRGDFRR LVGYIERGQV RALLYNTYRL SDFHRAQRDF MEKSYIGKLV VVPDRKWDEV
GRPHAR