Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_0389 |
Symbol | |
ID | 5835634 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 429790 |
End bp | 432732 |
Gene Length | 2943 bp |
Protein Length | 980 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641366173 |
Product | molybdopterin oxidoreductase |
Protein accession | YP_001637882 |
Protein GI | 163849839 |
COG category | [C] Energy production and conversion [R] General function prediction only |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGATCA AGCGCAAGAG CGGCGACGCT GCTCGCACCA AGCACCAGGC CGTCGCCGCC GGCCTCGCCG CGGGCGTGCT CGACCGCCGC GCCTTCCTCC GCAAGTCCGG GCTGACCGCC GGCGCGCTCG CCGCGGCCGG CACGATCCAG CTCGGCTCGG TGCGCAAGGC GCAGGCCGCC GGCTCCTCCG CCGTCGGGCC GGACACGGTC ATCAAGAAGA ACGTCTGCAC CCACTGCTCG GTGGGCTGCA CGGTGACGGC CGAAGTCGTG AACGGCGTCT GGGTCGGGCA GGAGCCGTCC TATGCCAGCC CGATCAACCG CGGCACGCAC TGCGCCAAGG GCGCGGCGAT CCGCGAGCTC GTCTCGTCCG ACCGCCGCCT CAAGTACCCG ATGAAGCTGG AAGGCGGGCA GTGGAAGCGG ATCTCGTGGG ATCAGGCCTA CGAGGAGATC GGCGACAAGC TGGTCCAGAT CCGCGAGAAG AACGGCGCCG ACTCGGTCTA CTGGCTGGGT TCGGCCAAGT TCACCAACGA AGCCTCCTAC CTGATGCGCA AGTTCGCGGC CCTGTGGGGC ACGAACTCAA TCGACCATCA GGCGCGCATC TGCCACTCGA CCACGGTGGC GGGCGTGGCC AACACCTGGG GCTACGGCGC CCAGACGAAC TCCTACAACG ATATCCGCAA CGCCAAGACG ATGATCATCC TCGGCGGCAA CCCCGCTGAG GCGCACCCGA TCTCCATGCA GCACGTGCTG TCGGGCAAGG AGATCAACCG CGCGAACATG ATCGTCATCG ATCCGCGCTT CACCCGCACC GCGGCGCACG CCACCGAATA CGTGCGCATC CGCTCCGGCA CCGACATTCC GGTGGTCTGG GGCATCCTCT GGCACATCTT CCAGAATGGC TGGGAGGACA AGGAGTTCAT CGCCCAGCGC GTCTACGCGA TGGACGACGT GCGCAAGGAA GTCGCCAAGT GGACGCCCGA CGAGGTCGAG CGCGTCTCCG GCGTGCCGGG CGAGCAGCTC AAGCGCGTGG CGGAGAAGTT CGCCAAGGAG AAGCCGGCGA CGCTGATCTG GTGCATGGGC GCGACCCAGC ACACGGTCGG CACCGCCAAC GTGCGCGCGC TGTGCATCCT GTGCCTGGCC ACAGGCAATG TCGGCAAGCC GGGCACGGGC GCCAACATCT TCCGCGGCCA CACCAACGTG CAGGGCGCGA CCGATCTCGG CCTCGATGTG ACGTCGCTGC CGCTCTATTA CGGCCTCGTC GAGGGCGGCT GGCGCCACTG GGCCCGCGTC TGGGACGTGG AGTACGAGTG GCTGCAATCG CGCTTCGATG AGGTTCCGGC GAAGGGCGGC CGCAAGGCGC GCACCCGCAA GGAAAACATG GAGGCGCCCG GCATCACCTC GACCCGGTGG TTCGATGCCG TGAACCTGCC GCCGGAGCAG ATCGACCAGC GCAGCCCGAT CAAAGCGTTC ATGGTGTTCG GCCACGGCGG CAACACCGTG ACCCGCATGC CCGAAGCCAT CGACGGCATG AACAAGCTCG AGCTGCTGGT CGTCGCCGAC CCGCACCCGA CCACCTTCGC GGCGCTCGAT GCCCGGGAGG ACAACACCTA CCTCCTGCCG ATCTGCACCT CGCTGGAGAT GGACGGCTCG CGCACGGCCT CGAACCGCTC GATCCAGTGG GGCGAGCAGA TCGTGAAGCC GGCCTTCGAG TCGAAGAGCG ACTACGAAGT CCTTTACCGC CTCGCGCAGA AGCTCGGCTT CGCCGACAAG CTCTGCAAGA ACATCAAGAT CGTCGATGGC GCCCCCGAAG CGGAAGACAT CCTGCGGGAG ATCAATCGTG GCGGCTGGTC GACCGGCTAT TGCGGCCAGT CGCCGGAGCG GCTGAAGGCG CATATGCGCA ACCAGCACAA GTTCGACCTT GTGACCTTGC GCGCGCCCAA GGACGATCCG GAGGTCGGCG GCGACTATTA CGGTCTGCCC TGGCCGTGCT GGGGCAAGCC GGAGCTGCGC CATCCGGGCT CGGCCATCCT GTACAACACC AGTCTCCACG TGAAGGACGG CGGCGGCGGC TTCCGCGCGC GGTTCGGCAC CGAGCGCAAC GGCCAGACCC TGCTCGCCGA GAATTCCTTC TCGAAGGGCT CGGACCTGAC CGACGGCTAC CCCGAATTCA CCTTCGGGGT GTTCAAGAAG CTCGGCTGGG ACAAGGACCT GACGCCGGAC GAACTCGCCA CGATCCTCAA GATCGGCGGT GAGAAGCCCG ACACCGTGAG CTGGGCGACC GACCTCTCGG GCGGCATCCA GCGCGTCTGC CTCGACCACG GCGTCTCGCC CTTCGGCAAC GGCAAGGCCC GGGCCAATGC CTGGAACCTG CCCGACCCGG TGCCTGTCCA CCGCGAGCCG GTCTACTCGC CCCGTCCCGA ACTCGTGGCG AAGTACCCAA CCCGCCCCGA CGAGCGGCAA TTGCGCATGC CGAATATCGG CTTCTCGGTG CAGAAGTCGG TGGTCGATCG CGGCGTCGCC AAGGACTTCC CGATCATCCT CACCTCCGGT CGCCTCGTGG AATACGAGGG CGGCGGCGAG GAGACCCGGT CGAACCCGTG GCTCGCCGAG CTGCAGCAGG ACATGTTCGT CGAGATCAAC ACCGGCGATG CCGCCGCGCG CGGCATCAAG GACGGTCAGT GGGTCTGGGT GTCGGGCCCT GAGAACGGCG CCAAGACCAA GGTTAAGGCG CTGGTGACCG ACCGCGTCGG CAAGGGCGTG GCGTTCATGC CCTTCCACTT CTCCGGTTGG TACCAGGGCA AGGACATGCG CGACTTCTAC CCGAAGGGCA CCGACCCGGT GGTGCTCGGC GAGAGCGTGA ACACTGTGAC GACCTACGGC TTCGATCCTG TGACGGGCAT GCAGGAAACG AAGTGCACCC TGTGCCAGAT CGCAGCGGCG TAA
|
Protein sequence | MLIKRKSGDA ARTKHQAVAA GLAAGVLDRR AFLRKSGLTA GALAAAGTIQ LGSVRKAQAA GSSAVGPDTV IKKNVCTHCS VGCTVTAEVV NGVWVGQEPS YASPINRGTH CAKGAAIREL VSSDRRLKYP MKLEGGQWKR ISWDQAYEEI GDKLVQIREK NGADSVYWLG SAKFTNEASY LMRKFAALWG TNSIDHQARI CHSTTVAGVA NTWGYGAQTN SYNDIRNAKT MIILGGNPAE AHPISMQHVL SGKEINRANM IVIDPRFTRT AAHATEYVRI RSGTDIPVVW GILWHIFQNG WEDKEFIAQR VYAMDDVRKE VAKWTPDEVE RVSGVPGEQL KRVAEKFAKE KPATLIWCMG ATQHTVGTAN VRALCILCLA TGNVGKPGTG ANIFRGHTNV QGATDLGLDV TSLPLYYGLV EGGWRHWARV WDVEYEWLQS RFDEVPAKGG RKARTRKENM EAPGITSTRW FDAVNLPPEQ IDQRSPIKAF MVFGHGGNTV TRMPEAIDGM NKLELLVVAD PHPTTFAALD AREDNTYLLP ICTSLEMDGS RTASNRSIQW GEQIVKPAFE SKSDYEVLYR LAQKLGFADK LCKNIKIVDG APEAEDILRE INRGGWSTGY CGQSPERLKA HMRNQHKFDL VTLRAPKDDP EVGGDYYGLP WPCWGKPELR HPGSAILYNT SLHVKDGGGG FRARFGTERN GQTLLAENSF SKGSDLTDGY PEFTFGVFKK LGWDKDLTPD ELATILKIGG EKPDTVSWAT DLSGGIQRVC LDHGVSPFGN GKARANAWNL PDPVPVHREP VYSPRPELVA KYPTRPDERQ LRMPNIGFSV QKSVVDRGVA KDFPIILTSG RLVEYEGGGE ETRSNPWLAE LQQDMFVEIN TGDAAARGIK DGQWVWVSGP ENGAKTKVKA LVTDRVGKGV AFMPFHFSGW YQGKDMRDFY PKGTDPVVLG ESVNTVTTYG FDPVTGMQET KCTLCQIAAA
|
| |