Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mchl_5043 |
Symbol | |
ID | 7113646 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium chloromethanicum CM4 |
Kingdom | Bacteria |
Replicon accession | NC_011757 |
Strand | + |
Start bp | 5391054 |
End bp | 5394023 |
Gene Length | 2970 bp |
Protein Length | 989 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643527737 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_002423736 |
Protein GI | 218532920 |
COG category | [R] General function prediction only |
COG ID | [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.331865 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.361747 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAACG GCCCCGAACC GCACGGCAAC AAGATCGAAC AGCCCGAGAT CCGCGCCGAC GAACGTCAGG ATGCCGGCGG GCCGGCAAAT GGCGCGCCAT CGACCTCCGG CGGCGCCTAC TCGCAGGGCG CCAAGTCGGG TGGCCAGGCC GCGCCGGACC CTTCCGGCAC CTACGGCATC AAGGACGCCC CGGTCGCGCC TGCGACCATC GCCTTCGAGT TCGACGGCCA ACAGGTCGAG GCCCAGCCCG GCGAGACGAT CTGGGCGGTC GCCAAGCGCC TCGGCACGCA TATTCCGCAT CTCTGCCACA AGCCCGATCC CGGCTACCGG CCGGACGGCA ATTGCCGCGC CTGCATGGTC GAGATCGAGG GCGAGCGCGT GCTCGCCGCC TCGTGCAAGC GCACGCCCGC CATCGGGATG AAGGTGAAGT CGGCCACCGA GCGCGCCACC AAGGCCCGCG CCATGGTGCT CGAACTGCTC GTGGCCGATC AGCCCGAGCG TGCGACCTCG CATGACCCGT CGTCGCATTT CTGGGTGCAG GCCGACGTTC TCGACGTGAC CGAGAGCCGC TTCCCGGCGG CCGAGCGCTG GACCAGCGAC GTCAGCCATC CGGCGATGAG CGTCAACCTC GACGCCTGCA TCCAGTGCAA TCTCTGTGTG CGCGCCTGCC GCGAGGTTCA GGTCAACGAC GTGATCGGCA TGGCCTACCG CGCCGCGGGC TCCAAGGTCG TGTTCGACTT CGACGATCCG ATGGGTGGCT CCACCTGCGT CGCCTGCGGC GAGTGCGTCC AGGCCTGCCC GACCGGCGCG CTGATGCCGG CCGCCTATCT CGACGCCAAC CAGACCCGGA CGGTCTATCC CGACCGCGAG GTGAAGTCGC TCTGCCCCTA TTGCGGCGTC GGCTGCCAAG TCTCCTACAA GGTCAAGGAC GAGCGCATCG TCTACGCCGA GGGCGTGAAC GGACCGGCCA ACCAGAACCG GCTCTGCGTG AAGGGCCGCT TCGGCTTCGA CTACGTCCAC CACCCCCACC GCCTGACGGT GCCGCTGATC CGCCTGGAGA ACGTGCCCAA GGACGCCAAC GATCAGGTCG ATCCGGCGAA CCCCTGGACG CATTTCCGCG AGGCGACTTG GGACGAGGCG CTCGACCGCG CGGCGGGCGG CCTGAAGACG ATCCGTGACA CCAACGGGCG CAAGGCGCTG GCGGGCTTCG GCTCGGCCAA GGGTTCGAAC GAGGAGGCGT ACCTCTTCCA GAAGCTCGTC CGCCTCGGCT TCGGCACCAA CAACGTCGAT CACTGCACGC GCCTGTGCCA CGCCTCGTCG GTGGCCGCGC TGATGGAAGG CCTGAATTCC GGCGCCGTCA CCGCTCCCTT CTCGGCAGCG CTCGACGCCG AAGTCATCGT CGTCATCGGC GCCAACCCGA CCGTGAACCA TCCGGTCGCG GCGACCTTCC TCAAGAACGC GGTGAAGCAG CGCGGCGCCA AGCTGATCAT CATGGACCCG CGGCGCCAGA CGCTCTCGCG CCACGCCTAT CGGCACCTCG CCTTCCGCCC CGGCTCGGAC GTGGCGATGC TCAACGCGAT GCTCAACGTG ATCGTCACGG AGGGCCTCTA CGACGAGCAG TACATCGCCG GCTACACCGA GAACTTCGAG GCCCTGCGCG AGAAGATCGT CGACTTCACG CCGGAGAAGA TGGCTTCGGT CTGCGGCATC GACGCCGAGA CCCTGCGCGA GGTCGCCCGG CTCTACGCCC GGGCCAAGTC GTCGCTCATC TTCTGGGGCA TGGGCGTCAG CCAGCACGTC CACGGCACCG ACAACTCTCG CTGCCTGATC GCGCTCGCCC TCATCACCGG CCAGATCGGC CGGCCCGGCA CCGGCCTGCA CCCGTTGCGC GGTCAGAACA ACGTCCAGGG CGCGTCCGAT GCCGGCCTGA TCCCGATGGT CTACCCGGAC TATCAGTCGG TCGAGAAGGA CGCGGTGCGC GAGCTGTTCG AGGAGTTCTG GGGCCAGTCC CTCGATCCGC AGAAGGGCCT CACCGTGGTC GAGATCATGC GCGCGATCCA CGCGGGCGAG ATCCGGGGCA TGTTCGTCGA GGGCGAGAAC CCGGCGATGT CCGACCCCGA CCTCAACCAC GCCCGCCACG CGCTGGCGAT GCTCGACCAC CTCGTGGTAC AGGACCTGTT CCTGACTGAG ACCGCCTTCC ACGCCGACGT GGTGCTGCCG GCCTCGGCCT TTGCCGAGAA GGCCGGCACC TTCACCAACA CCGACCGGCG CGTGCAGATC GCCCAGCCCG TCGTCGCCCC TCCGGGCGAT GCGCGCCAGG ATTGGTGGAT CATCCAGGAA CTGGCCCGAC GCCTCGACCT CGACTGGAAC TACGGCGGCC CGGCCGACAT CTTCGCCGAA ATGGCGCAGG TGATGCCGTC CTTGAACAAC ATCACCTGGG AGCGTCTGGA GCGCGAGGGG GCGGTGACCT ATCCGGTCGA TGCCCCGGAC CAGCCTGGCA ACGAGATCAT CTTCTATGCC GGCTTCCCGA CCGAGAGCGG TCGCGCCAAG ATCGTGCCCG CGGCGATCGT GCCGCCGGAC GAGGTGCCGG ACGACGAGTT CCCGATGGTG CTCTCGACCG GCCGCGTGCT CGAACACTGG CACACCGGCT CGATGACCCG GCGCGCGGGC GTGCTCGACG CGCTGGAGCC GGAGGCGGTG GCGTTCATGG CACCCAAGGA GCTCTACCGG CTCGGTCTCC GGCCCGGCGG GTCGATGCGG TTGGAAACAC GGCGCGGCGC CGTCGTGTTG AAGGTCCGCT CCGACCGGGA CGTGCCGATC GGCATGATCT TCATGCCCTT CTGCTACGCG GAAGCCGCTG CCAACCTTCT GACCAACCCC GCCCTCGACC CCCTCGGAAA GATTCCCGAG TTCAAATTCT GCGCAGCCCG CGTCGTCCCC GCGGAGGCTG CGCCGATGGC CGCCGAGTAA
|
Protein sequence | MSNGPEPHGN KIEQPEIRAD ERQDAGGPAN GAPSTSGGAY SQGAKSGGQA APDPSGTYGI KDAPVAPATI AFEFDGQQVE AQPGETIWAV AKRLGTHIPH LCHKPDPGYR PDGNCRACMV EIEGERVLAA SCKRTPAIGM KVKSATERAT KARAMVLELL VADQPERATS HDPSSHFWVQ ADVLDVTESR FPAAERWTSD VSHPAMSVNL DACIQCNLCV RACREVQVND VIGMAYRAAG SKVVFDFDDP MGGSTCVACG ECVQACPTGA LMPAAYLDAN QTRTVYPDRE VKSLCPYCGV GCQVSYKVKD ERIVYAEGVN GPANQNRLCV KGRFGFDYVH HPHRLTVPLI RLENVPKDAN DQVDPANPWT HFREATWDEA LDRAAGGLKT IRDTNGRKAL AGFGSAKGSN EEAYLFQKLV RLGFGTNNVD HCTRLCHASS VAALMEGLNS GAVTAPFSAA LDAEVIVVIG ANPTVNHPVA ATFLKNAVKQ RGAKLIIMDP RRQTLSRHAY RHLAFRPGSD VAMLNAMLNV IVTEGLYDEQ YIAGYTENFE ALREKIVDFT PEKMASVCGI DAETLREVAR LYARAKSSLI FWGMGVSQHV HGTDNSRCLI ALALITGQIG RPGTGLHPLR GQNNVQGASD AGLIPMVYPD YQSVEKDAVR ELFEEFWGQS LDPQKGLTVV EIMRAIHAGE IRGMFVEGEN PAMSDPDLNH ARHALAMLDH LVVQDLFLTE TAFHADVVLP ASAFAEKAGT FTNTDRRVQI AQPVVAPPGD ARQDWWIIQE LARRLDLDWN YGGPADIFAE MAQVMPSLNN ITWERLEREG AVTYPVDAPD QPGNEIIFYA GFPTESGRAK IVPAAIVPPD EVPDDEFPMV LSTGRVLEHW HTGSMTRRAG VLDALEPEAV AFMAPKELYR LGLRPGGSMR LETRRGAVVL KVRSDRDVPI GMIFMPFCYA EAAANLLTNP ALDPLGKIPE FKFCAARVVP AEAAPMAAE
|
| |