Gene Mext_0389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_0389 
Symbol 
ID5835634 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp429790 
End bp432732 
Gene Length2943 bp 
Protein Length980 aa 
Translation table11 
GC content67% 
IMG OID641366173 
Productmolybdopterin oxidoreductase 
Protein accessionYP_001637882 
Protein GI163849839 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing
[COG3383] Uncharacterized anaerobic dehydrogenase 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGATCA AGCGCAAGAG CGGCGACGCT GCTCGCACCA AGCACCAGGC CGTCGCCGCC 
GGCCTCGCCG CGGGCGTGCT CGACCGCCGC GCCTTCCTCC GCAAGTCCGG GCTGACCGCC
GGCGCGCTCG CCGCGGCCGG CACGATCCAG CTCGGCTCGG TGCGCAAGGC GCAGGCCGCC
GGCTCCTCCG CCGTCGGGCC GGACACGGTC ATCAAGAAGA ACGTCTGCAC CCACTGCTCG
GTGGGCTGCA CGGTGACGGC CGAAGTCGTG AACGGCGTCT GGGTCGGGCA GGAGCCGTCC
TATGCCAGCC CGATCAACCG CGGCACGCAC TGCGCCAAGG GCGCGGCGAT CCGCGAGCTC
GTCTCGTCCG ACCGCCGCCT CAAGTACCCG ATGAAGCTGG AAGGCGGGCA GTGGAAGCGG
ATCTCGTGGG ATCAGGCCTA CGAGGAGATC GGCGACAAGC TGGTCCAGAT CCGCGAGAAG
AACGGCGCCG ACTCGGTCTA CTGGCTGGGT TCGGCCAAGT TCACCAACGA AGCCTCCTAC
CTGATGCGCA AGTTCGCGGC CCTGTGGGGC ACGAACTCAA TCGACCATCA GGCGCGCATC
TGCCACTCGA CCACGGTGGC GGGCGTGGCC AACACCTGGG GCTACGGCGC CCAGACGAAC
TCCTACAACG ATATCCGCAA CGCCAAGACG ATGATCATCC TCGGCGGCAA CCCCGCTGAG
GCGCACCCGA TCTCCATGCA GCACGTGCTG TCGGGCAAGG AGATCAACCG CGCGAACATG
ATCGTCATCG ATCCGCGCTT CACCCGCACC GCGGCGCACG CCACCGAATA CGTGCGCATC
CGCTCCGGCA CCGACATTCC GGTGGTCTGG GGCATCCTCT GGCACATCTT CCAGAATGGC
TGGGAGGACA AGGAGTTCAT CGCCCAGCGC GTCTACGCGA TGGACGACGT GCGCAAGGAA
GTCGCCAAGT GGACGCCCGA CGAGGTCGAG CGCGTCTCCG GCGTGCCGGG CGAGCAGCTC
AAGCGCGTGG CGGAGAAGTT CGCCAAGGAG AAGCCGGCGA CGCTGATCTG GTGCATGGGC
GCGACCCAGC ACACGGTCGG CACCGCCAAC GTGCGCGCGC TGTGCATCCT GTGCCTGGCC
ACAGGCAATG TCGGCAAGCC GGGCACGGGC GCCAACATCT TCCGCGGCCA CACCAACGTG
CAGGGCGCGA CCGATCTCGG CCTCGATGTG ACGTCGCTGC CGCTCTATTA CGGCCTCGTC
GAGGGCGGCT GGCGCCACTG GGCCCGCGTC TGGGACGTGG AGTACGAGTG GCTGCAATCG
CGCTTCGATG AGGTTCCGGC GAAGGGCGGC CGCAAGGCGC GCACCCGCAA GGAAAACATG
GAGGCGCCCG GCATCACCTC GACCCGGTGG TTCGATGCCG TGAACCTGCC GCCGGAGCAG
ATCGACCAGC GCAGCCCGAT CAAAGCGTTC ATGGTGTTCG GCCACGGCGG CAACACCGTG
ACCCGCATGC CCGAAGCCAT CGACGGCATG AACAAGCTCG AGCTGCTGGT CGTCGCCGAC
CCGCACCCGA CCACCTTCGC GGCGCTCGAT GCCCGGGAGG ACAACACCTA CCTCCTGCCG
ATCTGCACCT CGCTGGAGAT GGACGGCTCG CGCACGGCCT CGAACCGCTC GATCCAGTGG
GGCGAGCAGA TCGTGAAGCC GGCCTTCGAG TCGAAGAGCG ACTACGAAGT CCTTTACCGC
CTCGCGCAGA AGCTCGGCTT CGCCGACAAG CTCTGCAAGA ACATCAAGAT CGTCGATGGC
GCCCCCGAAG CGGAAGACAT CCTGCGGGAG ATCAATCGTG GCGGCTGGTC GACCGGCTAT
TGCGGCCAGT CGCCGGAGCG GCTGAAGGCG CATATGCGCA ACCAGCACAA GTTCGACCTT
GTGACCTTGC GCGCGCCCAA GGACGATCCG GAGGTCGGCG GCGACTATTA CGGTCTGCCC
TGGCCGTGCT GGGGCAAGCC GGAGCTGCGC CATCCGGGCT CGGCCATCCT GTACAACACC
AGTCTCCACG TGAAGGACGG CGGCGGCGGC TTCCGCGCGC GGTTCGGCAC CGAGCGCAAC
GGCCAGACCC TGCTCGCCGA GAATTCCTTC TCGAAGGGCT CGGACCTGAC CGACGGCTAC
CCCGAATTCA CCTTCGGGGT GTTCAAGAAG CTCGGCTGGG ACAAGGACCT GACGCCGGAC
GAACTCGCCA CGATCCTCAA GATCGGCGGT GAGAAGCCCG ACACCGTGAG CTGGGCGACC
GACCTCTCGG GCGGCATCCA GCGCGTCTGC CTCGACCACG GCGTCTCGCC CTTCGGCAAC
GGCAAGGCCC GGGCCAATGC CTGGAACCTG CCCGACCCGG TGCCTGTCCA CCGCGAGCCG
GTCTACTCGC CCCGTCCCGA ACTCGTGGCG AAGTACCCAA CCCGCCCCGA CGAGCGGCAA
TTGCGCATGC CGAATATCGG CTTCTCGGTG CAGAAGTCGG TGGTCGATCG CGGCGTCGCC
AAGGACTTCC CGATCATCCT CACCTCCGGT CGCCTCGTGG AATACGAGGG CGGCGGCGAG
GAGACCCGGT CGAACCCGTG GCTCGCCGAG CTGCAGCAGG ACATGTTCGT CGAGATCAAC
ACCGGCGATG CCGCCGCGCG CGGCATCAAG GACGGTCAGT GGGTCTGGGT GTCGGGCCCT
GAGAACGGCG CCAAGACCAA GGTTAAGGCG CTGGTGACCG ACCGCGTCGG CAAGGGCGTG
GCGTTCATGC CCTTCCACTT CTCCGGTTGG TACCAGGGCA AGGACATGCG CGACTTCTAC
CCGAAGGGCA CCGACCCGGT GGTGCTCGGC GAGAGCGTGA ACACTGTGAC GACCTACGGC
TTCGATCCTG TGACGGGCAT GCAGGAAACG AAGTGCACCC TGTGCCAGAT CGCAGCGGCG
TAA
 
Protein sequence
MLIKRKSGDA ARTKHQAVAA GLAAGVLDRR AFLRKSGLTA GALAAAGTIQ LGSVRKAQAA 
GSSAVGPDTV IKKNVCTHCS VGCTVTAEVV NGVWVGQEPS YASPINRGTH CAKGAAIREL
VSSDRRLKYP MKLEGGQWKR ISWDQAYEEI GDKLVQIREK NGADSVYWLG SAKFTNEASY
LMRKFAALWG TNSIDHQARI CHSTTVAGVA NTWGYGAQTN SYNDIRNAKT MIILGGNPAE
AHPISMQHVL SGKEINRANM IVIDPRFTRT AAHATEYVRI RSGTDIPVVW GILWHIFQNG
WEDKEFIAQR VYAMDDVRKE VAKWTPDEVE RVSGVPGEQL KRVAEKFAKE KPATLIWCMG
ATQHTVGTAN VRALCILCLA TGNVGKPGTG ANIFRGHTNV QGATDLGLDV TSLPLYYGLV
EGGWRHWARV WDVEYEWLQS RFDEVPAKGG RKARTRKENM EAPGITSTRW FDAVNLPPEQ
IDQRSPIKAF MVFGHGGNTV TRMPEAIDGM NKLELLVVAD PHPTTFAALD AREDNTYLLP
ICTSLEMDGS RTASNRSIQW GEQIVKPAFE SKSDYEVLYR LAQKLGFADK LCKNIKIVDG
APEAEDILRE INRGGWSTGY CGQSPERLKA HMRNQHKFDL VTLRAPKDDP EVGGDYYGLP
WPCWGKPELR HPGSAILYNT SLHVKDGGGG FRARFGTERN GQTLLAENSF SKGSDLTDGY
PEFTFGVFKK LGWDKDLTPD ELATILKIGG EKPDTVSWAT DLSGGIQRVC LDHGVSPFGN
GKARANAWNL PDPVPVHREP VYSPRPELVA KYPTRPDERQ LRMPNIGFSV QKSVVDRGVA
KDFPIILTSG RLVEYEGGGE ETRSNPWLAE LQQDMFVEIN TGDAAARGIK DGQWVWVSGP
ENGAKTKVKA LVTDRVGKGV AFMPFHFSGW YQGKDMRDFY PKGTDPVVLG ESVNTVTTYG
FDPVTGMQET KCTLCQIAAA