Gene M446_3087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3087 
Symbol 
ID6133478 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp3415013 
End bp3417976 
Gene Length2964 bp 
Protein Length987 aa 
Translation table11 
GC content69% 
IMG OID641643278 
Productmolybdopterin oxidoreductase 
Protein accessionYP_001769931 
Protein GI170741276 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing
[COG3383] Uncharacterized anaerobic dehydrogenase 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0589388 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAAGA AGCGCAGAGG CGGCGCCGCG GGCCGCGCGA CCCCCTCCCC GGCCGCCGGC 
ACCCACCATT CCCTGACCGC AGGCGCTCTG CCAGAGACCC TGGATCGCCG CAGCTTCCTG
CGCCGCTCCG GCCTCGCCGC GGGCGGGCTC GCGGCCGCCG GGGGCCTCCA GCTCGGCGCC
GTGCGTCCCG CCCGGGCCGC CGGCTCGGCG GTGAGCGGGC CCGAGGTGAC GATCCGCAAG
AACATCTGCA CGCATTGCTC GGTCGGCTGC ACGGTGACGG CCGAGGTGGT GAACGGGGTC
TGGGTCGGCC AGGAACCCTC CTGGGCGAGC CCGATCAACC GCGGCACCCA CTGCGCCAAG
GGTGCGGCGA TCCGCGAACT GGTGCACGGC GAGCGCCGCC TGAAATACCC GATCAAGCTC
GAGAACGGCG AGTGGAAGCG GATCTCCTGG GAGCAGGCGA TCGACGAGAT CGGCGCCAAG
CTCCTCGCCA TCCGCGAGAA GAGCGGCGCG GATTCGGTCT ACTGGCTCGG CTCTGCCAAG
TTCACCAACG AAGCGGCCTA CCTGTTCCGC AAGTTCGGGG CGTTCTGGGG CACCAACAAC
GTCGACCATC AGGCCCGGAT CTGCCACTCG ACCACCGTGG CGGGCGTCGC CAACACCTGG
GGCTACGGCG CCCAGACCAA TTCCTACAAC GATATCCGCA ACGCCAAGAC CATGATCATC
ATGGGCGGCA ACCCGGCCGA GGCGCATCCG ATCTCGATGC AGCACGTGCT CTCGGGCAAG
GAGATCAACC GGGCGAACCT GATCGTCATC GACCCGCGCT TCACCCGCAC GGCCGCCCAC
GCCACCGAGT ACGTCCGCCT GCGCTCGGGC ACCGACATCC CGGTGATCTG GGGCATGCTC
TGGCACATCT TCCAGAACGG CTGGGAGGAC AAGGAGTTCA TCCGCAGCCG CGTCTACGCC
ATGGAGGATG TCCGCAAGGA GGTCGCCAAG TGGACCCCCG AGGAGGTCGA GCGGGTCTCC
GGCGTGCCGG GCGAGCAGCT GCGGCGCGTG GCCGAACTCT TCGCCAAGGA GAAGCCCGCG
ACCCTGATCT GGTGCATGGG CGCGACCCAG CACACGGTCG GCACCGCCAA CGTGCGGGCC
TTCTGCATCC TGTGCCTCGC GACCGGCAAC GTCGGCAAGC CCGGCACGGG CGCCAACATC
TTCCGCGGGC ACACCAACGT CCAGGGCGCG ACCGATCTCG GCCTCGATGT GACGACGCTG
CCGCTCTATT ACGGGCTGGT CGAGGGCGGC TGGCGGCACT GGGCCCGGGT CTGGGACGTC
GAGTACGAGT GGCTGCAATC GCGCTTCGAC GAGGTGCCGA CGCTGGCCGG GCGCAAGGCG
CGCTCGCGCA AGGAGAACAT GGAGGCGCCC GGGATCACCT CGACGCGCTG GTTCGACGCC
GTCACCTTGC CGGCCGACCA GGTCGACCAG CGGGCGCCCC TGCGGGCCAT GATGGTGTTC
GGCCACGGCG GCAACACCGT GACGCGGCTG CCCGAGGCCG TGAAGGGCAT GAGCGCCCTC
GACCTCCTGG TCGTGGCCGA CCCGCACCCG ACCACCTTCG CGGCGCTCGA CGCGCGGCGG
GACAACACCT ACCTCTTGCC GATCTGCACC TCGCTGGAGA TGGACGGGTC GCGCACGGCC
TCGAACCGCT CGCTGCAATG GGGCGAGCAG ATCGTGAAGC CCGCCTTCGA ATCGAAGAAC
GACTACGAGG TGATCTACCG GCTCGCCGCC AAGCTCGGCT TCGCCGACCG GATGTTCAAG
AACATCAAGG TAACGGACGG CGTGCCGGAG GCGGAGGACC TGCTGCGGGA GATCAACCGC
GGCGGCTGGT CGACGGGCTA TTGCGGGCAG AGCCCGGAGC GCCTGAAGGC GCACATGCGC
AACCAGCACA AGTTCGATCT CGTGACGCTG CGCGCGCCCA AGGACGATCC GGAGGTCGGC
GGCGACTATT ACGGCCTGCC CTGGCCGTGC TGGGGCAAGC CGGAGATCCG CCATCCCGGC
ACGCACATCC TCTACAACAC GGCCCTGCAC GTGAAGGACG GCGGCGGCAC CTTCCGGGCG
CGGTTCGGCA CCGAGCGCAA CGGCCAGACC CTGCTGGCCG AGGATTCCTT CTCCGTCGGC
TCCGACCTGA CAGACGGCTA CCCCGAATTC ACCATGGGTG TCTTCAAGAA GCTCGGCTGG
GACAGGGACC TGACGCCCGA GGAGATGGCC GTCATCACCC GGATCGGCGG CAATAACATC
GACGCGGTGA GCTGGGCCAC CGACCTGTCG GGCGGCATCC AGCGGGTCTG CCTGGAGCAC
GGCGTCACGC CCTTCGGCAA TGGCAAGGCG AGGGCGAATG CCTGGAACCT GCCCGACCCG
GTGCCGGTGC ACCGGGAGCC GATCTACTCG CCCCGGCCCG ACCTCGTCGC CAAGTACCCG
ACCCGGCCCG ACGAGCGCCA GTTCCGCATG CCGAACCTCG GCTTCTCGGT CCAGAAGGCG
GCGGTGGACC GCGCCGTCGC CAGGGACTTC CCGATCATCC TGTCGTCGGG CCGCCTCGTC
GAGTACGAGG GCGGCGGCGA GGAGACGCGC TCGAATCCCT GGCTCGCCGA GTTGCAGCAG
GACATGTTCG TGGAGGTGAA CACGCAGGAC GCGGCCGAGC GCGGCATCAG GGACGGGCAG
TTCGTCTGGG TCTACGGGCC GGAGAACGGC GCCAAGACCA AGGTGAAGGC GCTGGTGACC
GACCGGGTGG CGAAGGGCGT GGCCTGGATG CCGTTCCACT TCTCGGGCTG GTACCAGGGC
AAGGACATGC GCGCCTTCTA CCCGAAGGGC ACCGACCCCG TGGTGCTGGG CGAGAGCGTC
AACACGGTCA CGACCTACGG CTTCGACCCC GTCACGGGGA TGCAGGAGAC CAAGGTCACC
CTCTGCCAGA TCCAGGCCGC GTAA
 
Protein sequence
MLKKRRGGAA GRATPSPAAG THHSLTAGAL PETLDRRSFL RRSGLAAGGL AAAGGLQLGA 
VRPARAAGSA VSGPEVTIRK NICTHCSVGC TVTAEVVNGV WVGQEPSWAS PINRGTHCAK
GAAIRELVHG ERRLKYPIKL ENGEWKRISW EQAIDEIGAK LLAIREKSGA DSVYWLGSAK
FTNEAAYLFR KFGAFWGTNN VDHQARICHS TTVAGVANTW GYGAQTNSYN DIRNAKTMII
MGGNPAEAHP ISMQHVLSGK EINRANLIVI DPRFTRTAAH ATEYVRLRSG TDIPVIWGML
WHIFQNGWED KEFIRSRVYA MEDVRKEVAK WTPEEVERVS GVPGEQLRRV AELFAKEKPA
TLIWCMGATQ HTVGTANVRA FCILCLATGN VGKPGTGANI FRGHTNVQGA TDLGLDVTTL
PLYYGLVEGG WRHWARVWDV EYEWLQSRFD EVPTLAGRKA RSRKENMEAP GITSTRWFDA
VTLPADQVDQ RAPLRAMMVF GHGGNTVTRL PEAVKGMSAL DLLVVADPHP TTFAALDARR
DNTYLLPICT SLEMDGSRTA SNRSLQWGEQ IVKPAFESKN DYEVIYRLAA KLGFADRMFK
NIKVTDGVPE AEDLLREINR GGWSTGYCGQ SPERLKAHMR NQHKFDLVTL RAPKDDPEVG
GDYYGLPWPC WGKPEIRHPG THILYNTALH VKDGGGTFRA RFGTERNGQT LLAEDSFSVG
SDLTDGYPEF TMGVFKKLGW DRDLTPEEMA VITRIGGNNI DAVSWATDLS GGIQRVCLEH
GVTPFGNGKA RANAWNLPDP VPVHREPIYS PRPDLVAKYP TRPDERQFRM PNLGFSVQKA
AVDRAVARDF PIILSSGRLV EYEGGGEETR SNPWLAELQQ DMFVEVNTQD AAERGIRDGQ
FVWVYGPENG AKTKVKALVT DRVAKGVAWM PFHFSGWYQG KDMRAFYPKG TDPVVLGESV
NTVTTYGFDP VTGMQETKVT LCQIQAA