Gene Mlg_1286 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1286 
Symbol 
ID4269052 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1488097 
End bp1490985 
Gene Length2889 bp 
Protein Length962 aa 
Translation table11 
GC content66% 
IMG OID638126036 
Productformate dehydrogenase alpha subunit 
Protein accessionYP_742125 
Protein GI114320442 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing
[COG3383] Uncharacterized anaerobic dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.15303 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTCG AGAAAAGAAA GGGCAGCTCG GCAGCCACTG CCGTGGGCGG CTTTCTCGGA 
CGGGCCGTGG ACCGGCGCAC CTTCTTGAGG GGTTCCGGCC TCGCTGTCGG GGGCACGGCG
GCGGCCGCCG CCATGCTCGA GCCGCGGATG ATGCGCAAGG CGAAGGCGTC GGAGCGCGCC
CGGTTTGACC CCAATGCCGA CATCAAGGAG GTGAAGACGG TCTGCACCCA CTGCTCCGTG
GGGTGCGGTG TGATCGCCGA AGTGCAGAAC GGTGTCTGGG TGGGGCAGGA GCCCAATTTC
GACAGCCCGA TCAACCTCGG GGCGCACTGT GCCAAGGGCG CCTCGTTACG TAACCACGGC
CACTCGGGGC GGCGCACCAA GTACCCGATG AAGCTGGTCA ATGGCCGCTG GGAGCGTATC
GGCTGGGAGC AGGCCATCGA GGAGATCGGC GACAAGCTGC TGCAGATCCG CGAGGAGTCG
GGTCCGGACG CACTCTGGTT CGCCGGCTCG TCCAAGGCCA GCAACGAGGG GGCCTACCTG
CAGCGCAAGT TCGCGGCCTT CTGGGGCTCG AACAACTGTG ACCACCAGGC GCGTATCTGC
CACTCCACCA CGGTGGCGGG GGTGGCGAAC ACCTGGGGTT ATGGTGCCAT GACCAACTCC
TACAACGACA TCACGAAGTC CAACTGCATC GTGATGTGCG GCTCCAATGC GGCCGAGGCC
CACCCGGTCG CCATGCAGAT GATCCTGCGG GGAAAGGAGA ATGGCGCCAA ACTGATTGTT
ATGGATCCGC GTTTCACCCG CACCGCCGCC CACTCCGATA TCTTCGTGCG GCCCCGGTCG
GGCACCGACG TGGCGCTGAT CTACGGCATC CTGTGGCATA TCTTCGAGAA CGGCTGGGAG
GACAGCACCT ACATCCGCCA GCGGGTCTGG GGCATGGATC GGGTGCGCGA GGAGGTGGCC
CACTGGACCC CCGAAGAGGT GGAGCGGGTG GCCGGCGTCG GCGAGGAGGC GCTGCGCGAT
GTGGCCAAGA CCATCGTCGA CAACAGCCCG ATGACCTTTA TTTGGTGTAT GGGCGGCACC
CAGCACACGA TCGGTAACAA CTACGTGCGC GCCTACAATC ACCTGCTGCT GGCCACCGGC
AACGTCGGGG TCTCCGGCGG CGGGGCCAAT ATCTTCCGCG GTCACGACAA CGTCCAGGGG
GCCACCGACG TCGGCCCCAA CGCCCACATC CTGCCCGGCT ACTTCCCCCA CTCCGAGGGG
GGCTGGCGGC ACTGGGCCCG GGTCTGGGAC GTGGACTACG AGTACCTGCA GGGCCGGTTT
GACCCGGCGG AGTACGACGA CGGCTCCGGT GGCCAGGCGA AGCCCATGCA CATGCCGGGC
ATCCCGGTAT CGCGCTGGAT CGACGGGGTG CTCGAGGATG CGGACAACCT CTCGCAAAAA
GACAACATCC GCGCCGTGAT GTTCTGGGGC CACGCGCCCA ACAGTCAGAC CCGGCTGCCG
GAGATGAAGG AGGCCATGGA GAAGCTGGAT CTCCTGGTCA TCGCGGACCC CTATCCCACG
GTCTCCGCTG TGCTGCATGA CCGGCAGAAC GACACCTACC TGCTGCCCGC CTGCACCCAG
TTCGAGACCC GGGGCTCGGC CACCGCCTCC AACCGCTCCC TGCAGTGGCG CGACAAGGTC
ATCGAGCCGC TGTTCGAGGG CAAGCCGGAT CACGAGATCG CCTATCTGCT CGCCCGCAAG
CTCGGGTTTG CCGACGAGAT GTTCAAGAAC ATCCGCATCG AGGGCACCGA GCCGGTGGTG
GAGGATGTGC TGCGGGAGAT CAACCGGGGA ACCTGGACCA TCGGCTACAC CGGCCAGAGC
CCGGAGCGGC TCAAGAAGCA CATGGAGAAC CAGGCCACCT TCGATGTCGT CTCGCTGAAA
GCGGAGGGCG GTCCCTGTGA CGGAGAGTAC TACGGCCTGC CGTGGCCCTG CTGGGGCACG
GCGGAGATGG GTCACCCGGG TACCCCCAAC CTGTATGACA CCAGCAAGCA TGTGGCCGAG
GGCGGACTCA CCTTCCGCGC CCGTTTTGGC ACCGAGTACG AGGGCGAGAA CCTGTTGGCT
GAGGGCTCCT ACTCCAAGGG CTCGGCCATC CGGGACGGTT ACCCGGAGAT CACCGCCGAC
ATGATCAAGC AGCTCGGCTG GTGGGATGAT CTCACTGACG AGGAGAAGGC GCAGGCCGAG
GGCCGGGACT GGAAGACCGA TCTCTCCGGC GGGCTGCAGC GGGTGGCCAT CAAGCACGGT
CTGGCCCCCT TCGGTAACGC CAAGGCCCGC TGCTACGTCT GGAACTTCCC GGACCCCGTG
CCGGTTCACC GCGAGCCGCT GTACACCCCG CGGTATGACC TGGTGTCCGA CTACCCGACC
TACGAGGACC GCAAGGCTTT CTATCGGCTG CCCACCCGCT GGGCCTCCGT TCAGGCCCAG
GACTACTCCG GCGACTACCC GCTGATCCAC ACCAGCGGCC GCCTGGTGGA GTACGAGGGC
GGCGGTGAGG AGACCCGTTC CAACCCCTGG CTGGCGGAGC TCAAGCAGGA GATGTTCGTG
GAGATCAACC CGCGCGACGC CAACAACGCC GGGGTGCGGA ATAACGAGTG GGTCTGGCTG
GAGGGCCCCG AGGGCGGGCG CATCCGCATC AAGGCCCTGG TCACCCGTCG GGTGGCGGCG
GGCACGGTGT TCACGCCCTT CCACTTCGGC GGCCACTTCC AGGGTGAGGA CCTGCTGGCC
AAGTACCCGG AGGGGGCCTC CCCCTACGTG CGTGGCGAGT CCAACAATAC CGCCACCACC
TATGGGTACG ACTCGGTCAC GATGATGCAG GAGACCAAGA CCACCCTGTG CCGTGTGGTT
CCGGCCTGA
 
Protein sequence
MKLEKRKGSS AATAVGGFLG RAVDRRTFLR GSGLAVGGTA AAAAMLEPRM MRKAKASERA 
RFDPNADIKE VKTVCTHCSV GCGVIAEVQN GVWVGQEPNF DSPINLGAHC AKGASLRNHG
HSGRRTKYPM KLVNGRWERI GWEQAIEEIG DKLLQIREES GPDALWFAGS SKASNEGAYL
QRKFAAFWGS NNCDHQARIC HSTTVAGVAN TWGYGAMTNS YNDITKSNCI VMCGSNAAEA
HPVAMQMILR GKENGAKLIV MDPRFTRTAA HSDIFVRPRS GTDVALIYGI LWHIFENGWE
DSTYIRQRVW GMDRVREEVA HWTPEEVERV AGVGEEALRD VAKTIVDNSP MTFIWCMGGT
QHTIGNNYVR AYNHLLLATG NVGVSGGGAN IFRGHDNVQG ATDVGPNAHI LPGYFPHSEG
GWRHWARVWD VDYEYLQGRF DPAEYDDGSG GQAKPMHMPG IPVSRWIDGV LEDADNLSQK
DNIRAVMFWG HAPNSQTRLP EMKEAMEKLD LLVIADPYPT VSAVLHDRQN DTYLLPACTQ
FETRGSATAS NRSLQWRDKV IEPLFEGKPD HEIAYLLARK LGFADEMFKN IRIEGTEPVV
EDVLREINRG TWTIGYTGQS PERLKKHMEN QATFDVVSLK AEGGPCDGEY YGLPWPCWGT
AEMGHPGTPN LYDTSKHVAE GGLTFRARFG TEYEGENLLA EGSYSKGSAI RDGYPEITAD
MIKQLGWWDD LTDEEKAQAE GRDWKTDLSG GLQRVAIKHG LAPFGNAKAR CYVWNFPDPV
PVHREPLYTP RYDLVSDYPT YEDRKAFYRL PTRWASVQAQ DYSGDYPLIH TSGRLVEYEG
GGEETRSNPW LAELKQEMFV EINPRDANNA GVRNNEWVWL EGPEGGRIRI KALVTRRVAA
GTVFTPFHFG GHFQGEDLLA KYPEGASPYV RGESNNTATT YGYDSVTMMQ ETKTTLCRVV
PA