Gene Mlg_1412 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1412 
Symbol 
ID4270410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1618506 
End bp1619522 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content71% 
IMG OID638126168 
Productalcohol dehydrogenase 
Protein accessionYP_742251 
Protein GI114320568 
COG category[R] General function prediction only 
COG ID[COG1064] Zn-dependent alcohol dehydrogenases 
TIGRFAM ID[TIGR02822] zinc-binding alcohol dehydrogenase family protein 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.498942 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGCAG AGGATACGGA ACGCACCACC ATGCGGGTGA TGGCGCTGGA GGCCCCGGGG 
CAGGCCCTGC AGGCCCAATC CTGGCCCGTC CCGGAGCCCG GCATCGGGCA ACTCCGCCTC
CGGGTGCGCG CCTGCGCCGT CTGCCGCACG GATCTGCACG TGGTGGACGG CGAGTTGCCG
GACCCCGTAC TGCCCATCAT CCCGGGCCAT GAGATCGTCG GTGTGGTGGA CCGCGTGGGC
GAGGGCTGCC AAAGGTACCG CCCGGGCGAC CGGGTGGGGG TGCCCTGGCT CGGCCATACC
TGCGGGACCT GCGATCATTG CCGCGCCGGC CGGGAAAACC TCTGCGACCA GGCACGGTTC
ACCGGCTACC AGTTGCAGGG CGGTTACGCT GAATACGCCA TCGCCGACGA GCGGTTCTGC
TTCCCGATCC CCGCCGCGTA CACCGACGCC GGCGCCGCGC CACTGTTGTG CGCCGGGCTC
ATCGGTCACC GCTCACTGAG CATGGCCGGC GACGACGCCC GCCGTCTGGG GATCTACGGT
TTCGGTGCGG CGGCCCATAT CGTGGCCCAG GTGGCGCGCC ACCAGGAGCG CGACCTCTAC
GCCTTCACCC GTCCGGGGGA TCAAAAGGCC CAGGCGTTTG CCCGCGCCCT CGGCGCCTGC
TGGGCCGGGC CCTCGGACCG GCTGCCACCC AAGCCATTGG ACGCGGCCAT CATCTTTGCC
CCCGTCGGCG ACCTGGTCCC CCAGGCCCTG CGTGCGGTGC GCAAGGGCGG CCGGGTGGTC
TGTGGCGGCA TCCACATGAG CGATATCCCC TCGTTCCCCT ACGCCTGGCT CTGGGGCGAA
CGCAGCCTCT GCTCGGTCGC CAACCTGACC CGGGCCGATG GTGAGGCCTT CATGGCCCTT
GCGCCCGAGG TGCCGGTACG CACGGAAGTG GTGGAGTACC CCCTGGACCA GGCCAATCAG
GCCCTGGATG ACCTGCGCGG CGGTCGCCTG CAGGGCGCCG CCGTGCTGAT CCCTTAG
 
Protein sequence
MEAEDTERTT MRVMALEAPG QALQAQSWPV PEPGIGQLRL RVRACAVCRT DLHVVDGELP 
DPVLPIIPGH EIVGVVDRVG EGCQRYRPGD RVGVPWLGHT CGTCDHCRAG RENLCDQARF
TGYQLQGGYA EYAIADERFC FPIPAAYTDA GAAPLLCAGL IGHRSLSMAG DDARRLGIYG
FGAAAHIVAQ VARHQERDLY AFTRPGDQKA QAFARALGAC WAGPSDRLPP KPLDAAIIFA
PVGDLVPQAL RAVRKGGRVV CGGIHMSDIP SFPYAWLWGE RSLCSVANLT RADGEAFMAL
APEVPVRTEV VEYPLDQANQ ALDDLRGGRL QGAAVLIP