Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MCA1391 |
Symbol | |
ID | 3104957 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylococcus capsulatus str. Bath |
Kingdom | Bacteria |
Replicon accession | NC_002977 |
Strand | - |
Start bp | 1474796 |
End bp | 1477639 |
Gene Length | 2844 bp |
Protein Length | 947 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637170566 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_113849 |
Protein GI | 53804301 |
COG category | [R] General function prediction only |
COG ID | [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTCTAC TAAAAGACAA AGATTTCGGC ACGCCGCCCA GCACTTCCGA CAAAATGGTC ACCCTGGAAA TCGACGGCTT CACCGCCACC GTGCCCGAAG GCACCTCGGT CATGCGGGCG GCCGCCAGTA TCGGCATCGA CATCCCCAGG CTCTGTGCCA CCGACAGCCT GGAACCGTTC GGTTCCTGCC GTTTGTGCGT GGTCCAGATC GAAGGCGGGC GCGGGCTTCC GGCCTCCTGT ACCACGCCCG TGTTCGAGGG CATGAAGGTC GTGACCCAGA ACGACCGGCT GGCCCAGATC CGCCGCGGCA TCATGGAGCT GTACATCTCC GATCACCCCC TCGATTGCCT GACCTGTTCC GCCAATGGCA ACTGCGAGCT CCAGGACATG GCCGGCGTGG TCGGTCTGCG GGAAGTACGC TACGGCTACG AAGGCGCAAA CCATTTCGAT GCGAAAAAGG ACCTGAGCAA TCCGTACTTC CAGTTCGAGC CCTCCAAGTG CATCGTCTGC TCGCGCTGTG TCCGCGCCTG CGAACAGACC CAGGGCACCT TCGCTCTCAC CATCGACGGC CGCGGCTTCG ACTCCAAGAT CTCACCCGGA CAGAACCAGG CCTTCATGGA TTCCGAGTGC GTCTCCTGCG GCGCCTGTGT CCAGGCCTGC CCGACCGCTA CACTCATCGA GAAATCCGTG GTCGAAAAGG GCCAGGCCGA TCATTCGATC ATCACGACCT GTGCTTATTG CGGCGTCGGC TGTTCGTTCA AGGCGGAGAT GAAGGGTTCC GAGGTCGTAC GGATGGTCCC GGACAAGAAC GGCCAGGCCA ACCATGGCCA TTCCTGCGTC AAAGGCCGCT TCGCCTTCGG CTATGCGACC CATCCGGACC GTATCACCTC GCCGATGATC CGCAAGAGCA TCCATGATCC CTGGCAGAAA GTGAGCTGGG AAGAGGCGAT CAACTATGCC GCCTCGGAAT TCAAGCGCAT CCAGGCCAAA TACGGCAAGG ATTCGGTCGG AGGACTCACC TCCAGCCGCT GCACCAACGA GGAGGCCTAC CTGGTGCAGA AGCTGGTCCG CGCCGCTTTC GGCAACAACA ACGTCGACAC CTGCGCCCGC GTCTGCCACT CACCCACCGG TTATGGCCTC AAGCAGACCC TGGGCGAGTC GGCCGGGACC CAGACCTTCG ACTCGGTGAT GAAGTCCGAC GTGATCATGG TCATCGGCGC CAATCCCACC GACGGCCATC CGGTGTTCGG CTCGCTGATG AAGAAGCGAG TGCGCCAGGG CGCCAGGCTC ATCGTGGTCG ACCCGCGCGA CATCGACATG GTCCGCTCGC CGCACATCGA GGCCGATTAT CACCTGAAGC TCAAGCCCGG CACCAATGTC GCCGTGGTCA ACGCGCTCGC CCATGTCATC GTCAGAGAAG GGCTGGTCGA CGAAGCCTTC GTGAACGCGC GCTGCGAGGC GGAGTCCTTC AGCAAATGGA AGGATTTCGT CGCCCAGGAA CGCAACTCGC CGGAAGCCAT GGAGTCCGTC ACCGGCGTAC CGGCACAGAC CCTGCGCGAA GCGGCGCGGC TCTATGCCAC CGGCGGCAAT GCGGCGATCT ACTACGGCCT GGGCGTCACC GAGCACAGCC AGGGCTCGAC CACCGTGATC GCCATCGCCA ACCTGGCCAT GGCGACCGGC AACGTCGGCC GCGAGGGCGT GGGCGTCAAT CCGCTGCGCG GGCAGAACAA CGTGCAGGGT TCCTGCGACA TGGGCTCCTT CCCCCACGAG CTGCCAGGTT ATCGCCATGT CTCAGACAGC ACGGTGCGGG CCCAATTCGA AGCGGCCTGG AACACCCGGC TGGATCCCGA ACCAGGCCTG CGCATTCCCA ACATGTTCGC AGCCGCGCTG GAGGGCAGCT TCAAGGGACT CTACTGCGAG GGTGAGGACA TCGCCCAGTC CGACCCGGAC ACCCAGCACG TGTTCGCGGC GCTGGAGGCC ATGGAATGCG TCGTCGTTCA GGATCTGTTC CTGAACGAAA CCGCCAAGTT CGCCCACGTC TTCCTTCCCG GCGCCTCCTT CCTGGAAAAG AGCGGTACTT TCACCAATGC CGAACGACGC ATTTCGCCGG TACGCCGGGT GATGCCGCCG CTGGCCGGTT ACGAGGACTG GCAAGTGACG CAGATGCTGG CCAACGCCCT GGGTTATCCG ATGAACTACA GCCATGCCTC GGAGATCCTG GACGAAATCG CCAGCCTGGT GCCGACCTTC AGCAAGGTCA GCTTCAAGCG CCTGGACGAG GTCGGCAGCC TGCAGTGGCC ATGCAACGAG GAGGCGCCTA ACGGCACGCC CACCATGCAC GTCGACCATT TCGTGCGGGG GAAAGGCCGG TTCATGCTCA CGGAATACGT CCCCACAGAG GAACGCACCA GCAGCAAATT CCCGCTGATC CTGACTACCG GCCGCATCCT GTCGCAGTAC AACGTGGGCG CGCAAACCCG CCGCACTGCG AACACGGCCT GGCATTCGGA AGACCGTCTG GAGATCCATC CCCACGACGC CGAGAACCGG GGCATCAAGG ACGGCGACTG GGTCGGCATC AAGAGCCGGC AGGGCGAGAC GGTGCTGCGG GCGGTGGTTT CCGAGCGCAT GCAACCCGGG GTGGTCTACA CCACCTTCCA CTTCCCGGAA TCCGGCGCCA ACGTCATCAC CACCGACAAT TCGGACTGGG CCACCAATTG TCCGGAGTAC AAGGTCACCG CCGTGCAGGT TACCAAGGTC AATGCGCCTT CGGAGTGGCA GCAGAAATTC CGCAGCTTCA CGGAAGAGCA ATTGTCCTTC CTGGAAAAGG CCACGGCGGG CTGA
|
Protein sequence | MALLKDKDFG TPPSTSDKMV TLEIDGFTAT VPEGTSVMRA AASIGIDIPR LCATDSLEPF GSCRLCVVQI EGGRGLPASC TTPVFEGMKV VTQNDRLAQI RRGIMELYIS DHPLDCLTCS ANGNCELQDM AGVVGLREVR YGYEGANHFD AKKDLSNPYF QFEPSKCIVC SRCVRACEQT QGTFALTIDG RGFDSKISPG QNQAFMDSEC VSCGACVQAC PTATLIEKSV VEKGQADHSI ITTCAYCGVG CSFKAEMKGS EVVRMVPDKN GQANHGHSCV KGRFAFGYAT HPDRITSPMI RKSIHDPWQK VSWEEAINYA ASEFKRIQAK YGKDSVGGLT SSRCTNEEAY LVQKLVRAAF GNNNVDTCAR VCHSPTGYGL KQTLGESAGT QTFDSVMKSD VIMVIGANPT DGHPVFGSLM KKRVRQGARL IVVDPRDIDM VRSPHIEADY HLKLKPGTNV AVVNALAHVI VREGLVDEAF VNARCEAESF SKWKDFVAQE RNSPEAMESV TGVPAQTLRE AARLYATGGN AAIYYGLGVT EHSQGSTTVI AIANLAMATG NVGREGVGVN PLRGQNNVQG SCDMGSFPHE LPGYRHVSDS TVRAQFEAAW NTRLDPEPGL RIPNMFAAAL EGSFKGLYCE GEDIAQSDPD TQHVFAALEA MECVVVQDLF LNETAKFAHV FLPGASFLEK SGTFTNAERR ISPVRRVMPP LAGYEDWQVT QMLANALGYP MNYSHASEIL DEIASLVPTF SKVSFKRLDE VGSLQWPCNE EAPNGTPTMH VDHFVRGKGR FMLTEYVPTE ERTSSKFPLI LTTGRILSQY NVGAQTRRTA NTAWHSEDRL EIHPHDAENR GIKDGDWVGI KSRQGETVLR AVVSERMQPG VVYTTFHFPE SGANVITTDN SDWATNCPEY KVTAVQVTKV NAPSEWQQKF RSFTEEQLSF LEKATAG
|
| |