Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1286 |
Symbol | |
ID | 4269052 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 1488097 |
End bp | 1490985 |
Gene Length | 2889 bp |
Protein Length | 962 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 638126036 |
Product | formate dehydrogenase alpha subunit |
Protein accession | YP_742125 |
Protein GI | 114320442 |
COG category | [C] Energy production and conversion [R] General function prediction only |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.15303 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACTCG AGAAAAGAAA GGGCAGCTCG GCAGCCACTG CCGTGGGCGG CTTTCTCGGA CGGGCCGTGG ACCGGCGCAC CTTCTTGAGG GGTTCCGGCC TCGCTGTCGG GGGCACGGCG GCGGCCGCCG CCATGCTCGA GCCGCGGATG ATGCGCAAGG CGAAGGCGTC GGAGCGCGCC CGGTTTGACC CCAATGCCGA CATCAAGGAG GTGAAGACGG TCTGCACCCA CTGCTCCGTG GGGTGCGGTG TGATCGCCGA AGTGCAGAAC GGTGTCTGGG TGGGGCAGGA GCCCAATTTC GACAGCCCGA TCAACCTCGG GGCGCACTGT GCCAAGGGCG CCTCGTTACG TAACCACGGC CACTCGGGGC GGCGCACCAA GTACCCGATG AAGCTGGTCA ATGGCCGCTG GGAGCGTATC GGCTGGGAGC AGGCCATCGA GGAGATCGGC GACAAGCTGC TGCAGATCCG CGAGGAGTCG GGTCCGGACG CACTCTGGTT CGCCGGCTCG TCCAAGGCCA GCAACGAGGG GGCCTACCTG CAGCGCAAGT TCGCGGCCTT CTGGGGCTCG AACAACTGTG ACCACCAGGC GCGTATCTGC CACTCCACCA CGGTGGCGGG GGTGGCGAAC ACCTGGGGTT ATGGTGCCAT GACCAACTCC TACAACGACA TCACGAAGTC CAACTGCATC GTGATGTGCG GCTCCAATGC GGCCGAGGCC CACCCGGTCG CCATGCAGAT GATCCTGCGG GGAAAGGAGA ATGGCGCCAA ACTGATTGTT ATGGATCCGC GTTTCACCCG CACCGCCGCC CACTCCGATA TCTTCGTGCG GCCCCGGTCG GGCACCGACG TGGCGCTGAT CTACGGCATC CTGTGGCATA TCTTCGAGAA CGGCTGGGAG GACAGCACCT ACATCCGCCA GCGGGTCTGG GGCATGGATC GGGTGCGCGA GGAGGTGGCC CACTGGACCC CCGAAGAGGT GGAGCGGGTG GCCGGCGTCG GCGAGGAGGC GCTGCGCGAT GTGGCCAAGA CCATCGTCGA CAACAGCCCG ATGACCTTTA TTTGGTGTAT GGGCGGCACC CAGCACACGA TCGGTAACAA CTACGTGCGC GCCTACAATC ACCTGCTGCT GGCCACCGGC AACGTCGGGG TCTCCGGCGG CGGGGCCAAT ATCTTCCGCG GTCACGACAA CGTCCAGGGG GCCACCGACG TCGGCCCCAA CGCCCACATC CTGCCCGGCT ACTTCCCCCA CTCCGAGGGG GGCTGGCGGC ACTGGGCCCG GGTCTGGGAC GTGGACTACG AGTACCTGCA GGGCCGGTTT GACCCGGCGG AGTACGACGA CGGCTCCGGT GGCCAGGCGA AGCCCATGCA CATGCCGGGC ATCCCGGTAT CGCGCTGGAT CGACGGGGTG CTCGAGGATG CGGACAACCT CTCGCAAAAA GACAACATCC GCGCCGTGAT GTTCTGGGGC CACGCGCCCA ACAGTCAGAC CCGGCTGCCG GAGATGAAGG AGGCCATGGA GAAGCTGGAT CTCCTGGTCA TCGCGGACCC CTATCCCACG GTCTCCGCTG TGCTGCATGA CCGGCAGAAC GACACCTACC TGCTGCCCGC CTGCACCCAG TTCGAGACCC GGGGCTCGGC CACCGCCTCC AACCGCTCCC TGCAGTGGCG CGACAAGGTC ATCGAGCCGC TGTTCGAGGG CAAGCCGGAT CACGAGATCG CCTATCTGCT CGCCCGCAAG CTCGGGTTTG CCGACGAGAT GTTCAAGAAC ATCCGCATCG AGGGCACCGA GCCGGTGGTG GAGGATGTGC TGCGGGAGAT CAACCGGGGA ACCTGGACCA TCGGCTACAC CGGCCAGAGC CCGGAGCGGC TCAAGAAGCA CATGGAGAAC CAGGCCACCT TCGATGTCGT CTCGCTGAAA GCGGAGGGCG GTCCCTGTGA CGGAGAGTAC TACGGCCTGC CGTGGCCCTG CTGGGGCACG GCGGAGATGG GTCACCCGGG TACCCCCAAC CTGTATGACA CCAGCAAGCA TGTGGCCGAG GGCGGACTCA CCTTCCGCGC CCGTTTTGGC ACCGAGTACG AGGGCGAGAA CCTGTTGGCT GAGGGCTCCT ACTCCAAGGG CTCGGCCATC CGGGACGGTT ACCCGGAGAT CACCGCCGAC ATGATCAAGC AGCTCGGCTG GTGGGATGAT CTCACTGACG AGGAGAAGGC GCAGGCCGAG GGCCGGGACT GGAAGACCGA TCTCTCCGGC GGGCTGCAGC GGGTGGCCAT CAAGCACGGT CTGGCCCCCT TCGGTAACGC CAAGGCCCGC TGCTACGTCT GGAACTTCCC GGACCCCGTG CCGGTTCACC GCGAGCCGCT GTACACCCCG CGGTATGACC TGGTGTCCGA CTACCCGACC TACGAGGACC GCAAGGCTTT CTATCGGCTG CCCACCCGCT GGGCCTCCGT TCAGGCCCAG GACTACTCCG GCGACTACCC GCTGATCCAC ACCAGCGGCC GCCTGGTGGA GTACGAGGGC GGCGGTGAGG AGACCCGTTC CAACCCCTGG CTGGCGGAGC TCAAGCAGGA GATGTTCGTG GAGATCAACC CGCGCGACGC CAACAACGCC GGGGTGCGGA ATAACGAGTG GGTCTGGCTG GAGGGCCCCG AGGGCGGGCG CATCCGCATC AAGGCCCTGG TCACCCGTCG GGTGGCGGCG GGCACGGTGT TCACGCCCTT CCACTTCGGC GGCCACTTCC AGGGTGAGGA CCTGCTGGCC AAGTACCCGG AGGGGGCCTC CCCCTACGTG CGTGGCGAGT CCAACAATAC CGCCACCACC TATGGGTACG ACTCGGTCAC GATGATGCAG GAGACCAAGA CCACCCTGTG CCGTGTGGTT CCGGCCTGA
|
Protein sequence | MKLEKRKGSS AATAVGGFLG RAVDRRTFLR GSGLAVGGTA AAAAMLEPRM MRKAKASERA RFDPNADIKE VKTVCTHCSV GCGVIAEVQN GVWVGQEPNF DSPINLGAHC AKGASLRNHG HSGRRTKYPM KLVNGRWERI GWEQAIEEIG DKLLQIREES GPDALWFAGS SKASNEGAYL QRKFAAFWGS NNCDHQARIC HSTTVAGVAN TWGYGAMTNS YNDITKSNCI VMCGSNAAEA HPVAMQMILR GKENGAKLIV MDPRFTRTAA HSDIFVRPRS GTDVALIYGI LWHIFENGWE DSTYIRQRVW GMDRVREEVA HWTPEEVERV AGVGEEALRD VAKTIVDNSP MTFIWCMGGT QHTIGNNYVR AYNHLLLATG NVGVSGGGAN IFRGHDNVQG ATDVGPNAHI LPGYFPHSEG GWRHWARVWD VDYEYLQGRF DPAEYDDGSG GQAKPMHMPG IPVSRWIDGV LEDADNLSQK DNIRAVMFWG HAPNSQTRLP EMKEAMEKLD LLVIADPYPT VSAVLHDRQN DTYLLPACTQ FETRGSATAS NRSLQWRDKV IEPLFEGKPD HEIAYLLARK LGFADEMFKN IRIEGTEPVV EDVLREINRG TWTIGYTGQS PERLKKHMEN QATFDVVSLK AEGGPCDGEY YGLPWPCWGT AEMGHPGTPN LYDTSKHVAE GGLTFRARFG TEYEGENLLA EGSYSKGSAI RDGYPEITAD MIKQLGWWDD LTDEEKAQAE GRDWKTDLSG GLQRVAIKHG LAPFGNAKAR CYVWNFPDPV PVHREPLYTP RYDLVSDYPT YEDRKAFYRL PTRWASVQAQ DYSGDYPLIH TSGRLVEYEG GGEETRSNPW LAELKQEMFV EINPRDANNA GVRNNEWVWL EGPEGGRIRI KALVTRRVAA GTVFTPFHFG GHFQGEDLLA KYPEGASPYV RGESNNTATT YGYDSVTMMQ ETKTTLCRVV PA
|
| |