Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mchl_0421 |
Symbol | |
ID | 7116429 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium chloromethanicum CM4 |
Kingdom | Bacteria |
Replicon accession | NC_011757 |
Strand | + |
Start bp | 406654 |
End bp | 409596 |
Gene Length | 2943 bp |
Protein Length | 980 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643523221 |
Product | molybdopterin oxidoreductase |
Protein accession | YP_002419285 |
Protein GI | 218528469 |
COG category | [C] Energy production and conversion [R] General function prediction only |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.273483 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.81793 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGATCA AGCGCAAGAG CGGCGACGCT GCTCGCACCA AGCACCAGGC CGTCGCCGCC GGCCTCGCCG CGGGCGTGCT CGACCGCCGC GCCTTCCTCC GCAAGTCCGG GCTGACCGCC GGCGCGCTCG CCGCGGCCGG CACGATCCAG CTCGGCTCGG TGCGCAAGGC GCAGGCCGCC GGCTCCTCCG CCGTCGGGCC GGACACGGTC ATCAAGAAGA ACGTCTGCAC CCACTGCTCG GTGGGCTGTA CGGTGACGGC CGAAGTCGTG AACGGCGTCT GGGTCGGTCA GGAGCCGTCC TATGCCAGCC CGATCAACCG CGGCACGCAC TGCGCCAAGG GCGCGGCGAT CCGCGAGCTC GTCTCGTCCG ACCGCCGCCT CAAGTACCCG ATGAAGCTCG AAGGCGGGCA GTGGAAGCGG ATCTCGTGGG ATCAGGCATA CGAGGAGATC GGCGACAAGC TGGTCCAGAT CCGCGAGAAG AACGGCGCTG ACTCGGTCTA CTGGCTGGGT TCGGCCAAGT TCACCAACGA AGCCTCCTAC CTCATGCGCA AGTTCGCGGC CCTGTGGGGC ACGAACTCGA TCGACCATCA GGCACGCATC TGCCACTCGA CGACGGTGGC GGGCGTGGCC AACACCTGGG GCTACGGCGC CCAGACCAAT TCCTACAACG ACATCCGCAA CGCCAAGACG ATGATCATCC TCGGCGGCAA CCCCGCCGAG GCGCACCCGA TCTCCATGCA GCACGTGCTG TCGGGCAAGG AGATCAACCG CGCGAACATG ATCGTCATCG ATCCGCGCTT CACCCGCACC GCGGCGCACG CCACCGAATA CGTGCGCATC CGCTCCGGCA CCGACATTCC GGTGGTCTGG GGCATCCTCT GGCACATCTT CCAGAATGGC TGGGAGGACA AGGAGTTCAT CGCCCAGCGC GTCTACGCGA TGGACGACGT GCGCAAGGAA GTCGCCAAGT GGACGCCCGA CGAGGTCGAG CGCGTCTCCG GCGTGCCGGG CGAACAGCTC AAGCGCGTGG CGGAGAAGTT CGCCAAAGAG AAGCCGGCGA CCCTGATCTG GTGCATGGGC GCGACCCAGC ACACGGTCGG CACCGCCAAC GTGCGCGCGC TGTGCATCCT GTGCCTGGCC ACCGGCAATG TCGGCAAACC GGGCACGGGT GCCAACATCT TCCGCGGCCA CACCAACGTG CAGGGCGCGA CCGATCTCGG TCTCGATGTG ACGTCGCTGC CGCTCTATTA CGGCCTGGTC GAGGGCGGCT GGCGCCATTG GGCCCGCGTC TGGGACGTGG AGTACGATTG GCTGCAATCG CGCTTCGATG AAGTTCCGGC GAAGGGCGGC CGCAAGGCGC GCACCCGCAA GGAAAACATG GAGGCACCCG GCATCACCTC GACCCGGTGG TTCGATGCCG TGAACCTGCC GCCGGAGCAG ATCGACCAGC GCAGCCCGAT CAAGGCGTTC ATGGTGTTCG GCCACGGCGG CAACACCGTG ACCCGCATGC CCGAAGCCAT TGACGGCATG AATAAGCTCG AGTTGCTGGT CGTCGCCGAC CCGCACCCGA CCACCTTCGC AGCGCTCGAT GCCCGGCAGG ACAACACCTA CCTCCTGCCG ATCTGCACCT CGCTGGAGAT GGACGGCTCG CGCACGGCCT CGAACCGCTC GATCCAGTGG GGCGAGCAGA TCGTGAAGCC GGCTTTCGAG TCGAAGAGCG ACTACGAAGT CCTCTACCGC CTCGCGCAGA AGCTCGGTTT TGCCGACAAG CTCTGCAAGA ACATCAAGAT CGTCGACGGC GCCCCCGAAG CGGAAGACAT CCTGCGGGAG ATCAATCGCG GCGGCTGGTC GACCGGCTAT TGCGGCCAGT CGCCGGAGCG GCTGAAGGCG CATATGCGCA ACCAGCACAA GTTCGACCTT GTGACCTTGC GCGCGCCCAA GGACGATCCG GAGGTCGGCG GCGACTATTA CGGTCTGCCC TGGCCGTGCT GGGGCAAGCC GGAGCTGCGC CATCCCGGCT CGGCCATCCT GTACAACACC AGCCTCCACG TGAAGGACGG CGGCGGCGGC TTCCGCGCGC GGTTCGGCAC AGAACGCAAC GGCCAGACCC TGCTTGCCGA GAATTCGTTC TCGAAGGGCT CGGACCTGAC CGACGGCTAC CCCGAATTCA CCTTCGGGGT GTTCAAAAAG CTCGGCTGGG ACAAGGACCT GACGCCGGAC GAACTCGCCA CGATCCTCAA GATCGGCGGC GAGAAGCCCG ACACCGTGAG CTGGGCCACC GACCTCTCGG GCGGCATCCA GCGCGTCTGC CTCGATCACG GCGTCTCGCC CTTCGGCAAC GGCAAGGCCC GGGCCAATGC CTGGAATCTG CCCGACCCGG TGCCGGTGCA CCGCGAGCCG GTCTACTCGC CCCGTCCCGA ACTCGTGGCG AAGTACCCGA CCCGCCCCGA CGAGCGGCAA TTGCGCATGC CGAATATCGG CTTCTCGGTG CAGAAGTCGG TGGTCGATCG CGGCGTCGCC AAGGACTTCC CGATCATCCT CACCTCCGGC CGCCTCGTGG AATACGAGGG CGGCGGCGAG GAGACCCGTT CGAACCCGTG GCTCGCCGAG TTGCAGCAGG ACATGTTCGT CGAGATCAAC ACCGGCGATG CGGCCGCGCG CGGCATCAAG GACGGTCAGT GGGTCTGGGT GTCGGGGCCT GAGAACGGCG CCAAGACCAA GGTCAAGGCG CTGGTGACCG ACCGCGTCGG CAAGGGCGTG GCGTTCATGC CCTTCCATTT CTCCGGTTGG TACCAGGGCA AGGACATGCG CGACTTCTAC CCGAAGGGCA CCGACCCGGT GGTGCTCGGC GAGAGCGTGA ACACTGTGAC GACCTATGGC TTCGATCCTG TGACGGGCAT GCAGGAAACG AAGTGCACCC TGTGCCAGAT CGCAGCGGCG TAA
|
Protein sequence | MLIKRKSGDA ARTKHQAVAA GLAAGVLDRR AFLRKSGLTA GALAAAGTIQ LGSVRKAQAA GSSAVGPDTV IKKNVCTHCS VGCTVTAEVV NGVWVGQEPS YASPINRGTH CAKGAAIREL VSSDRRLKYP MKLEGGQWKR ISWDQAYEEI GDKLVQIREK NGADSVYWLG SAKFTNEASY LMRKFAALWG TNSIDHQARI CHSTTVAGVA NTWGYGAQTN SYNDIRNAKT MIILGGNPAE AHPISMQHVL SGKEINRANM IVIDPRFTRT AAHATEYVRI RSGTDIPVVW GILWHIFQNG WEDKEFIAQR VYAMDDVRKE VAKWTPDEVE RVSGVPGEQL KRVAEKFAKE KPATLIWCMG ATQHTVGTAN VRALCILCLA TGNVGKPGTG ANIFRGHTNV QGATDLGLDV TSLPLYYGLV EGGWRHWARV WDVEYDWLQS RFDEVPAKGG RKARTRKENM EAPGITSTRW FDAVNLPPEQ IDQRSPIKAF MVFGHGGNTV TRMPEAIDGM NKLELLVVAD PHPTTFAALD ARQDNTYLLP ICTSLEMDGS RTASNRSIQW GEQIVKPAFE SKSDYEVLYR LAQKLGFADK LCKNIKIVDG APEAEDILRE INRGGWSTGY CGQSPERLKA HMRNQHKFDL VTLRAPKDDP EVGGDYYGLP WPCWGKPELR HPGSAILYNT SLHVKDGGGG FRARFGTERN GQTLLAENSF SKGSDLTDGY PEFTFGVFKK LGWDKDLTPD ELATILKIGG EKPDTVSWAT DLSGGIQRVC LDHGVSPFGN GKARANAWNL PDPVPVHREP VYSPRPELVA KYPTRPDERQ LRMPNIGFSV QKSVVDRGVA KDFPIILTSG RLVEYEGGGE ETRSNPWLAE LQQDMFVEIN TGDAAARGIK DGQWVWVSGP ENGAKTKVKA LVTDRVGKGV AFMPFHFSGW YQGKDMRDFY PKGTDPVVLG ESVNTVTTYG FDPVTGMQET KCTLCQIAAA
|
| |