Gene Mext_1809 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1809 
Symbol 
ID5831943 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2031223 
End bp2033028 
Gene Length1806 bp 
Protein Length601 aa 
Translation table11 
GC content64% 
IMG OID641367608 
Productmethanol/ethanol family PQQ-dependent dehydrogenase 
Protein accessionYP_001639279 
Protein GI163851236 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4993] Glucose dehydrogenase 
TIGRFAM ID[TIGR03075] PQQ-dependent dehydrogenase, methanol/ethanol family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGCGG TACATCTCCT CGCACTCGGT GCGGGTCTCG CGGCTGCAAG CCCGGCCCTC 
GCCAACGAAA GCGTTCTGAA GGGCGTCGCC AACCCGGCGG AGCAGGTGCT CCAGACGGTC
GATTACGCCA ACACCCGCTA TTCCAAGCTC GACCAGATCA ACGCCAGCAA CGTCAAGAAC
CTCCAGGTTG CCTGGACCTT CTCGACCGGC GTGCTGCGCG GCCACGAGGG CTCCCCGCTC
GTCGTCGGCA ACATCATGTA CGTCCACACC CCCTTCCCGA ACATCGTCTA CGCGCTGGAC
CTCGACCAGG GCGCCAAGAT CGTGTGGAAG TACGAGCCGA AGCAGGATCC GTCCGTGATC
CCGGTCATGT GCTGTGACAC GGTCAACCGT GGTCTGGCCT ACGCCGACGG CGCGATCCTC
CTGCACCAGG CCGACACCAC GCTCGTCTCG CTCGACGCCA AGTCCGGCAA GGTGAACTGG
TCGGTCAAGA ACGGCGACCC GTCCAAGGGT GAGACCAACA CCGCCACCGT TCTCCCGGTG
AAGGACAAGG TCATCGTCGG CATCTCCGGC GGCGAGTTCG GCGTGCAGTG CCACGTCACC
GCCTACGACC TGAAGTCCGG CAAGAAGGTG TGGCGCGGCT ACTCGATCGG CCCGGACGAT
CAGCTGATCG TCGACCCCGA GAAGACCACC TCGCTCGGCA AGCCGATCGG CAAGGACTCC
TCGCTGAAGA CCTGGGAAGG CGATCAGTGG AAGACCGGCG GCGGCTGCAC CTGGGGCTGG
TTCTCCTACG ATCCCAAGCT CGACCTGATG TATTACGGCT CGGGCAACCC CTCCACCTGG
AACCCCAAGC AGCGTCCGGG CGACAACAAG TGGTCGATGA CCATTTGGGC GCGTAACCCC
GACACCGGCA TGGCCAAGTG GGTCTACCAG ATGACCCCCC ACGACGAGTG GGACTTCGAC
GGCATCAACG AGATGATCCT CACGGATCAG AAGTTCGACG GCAAGGACCG TCCGCTGCTG
ACGCACTTCG ATCGTAACGG CTTCGGCTAC ACGCTCGACC GCGCCACCGG TGAAGTGCTC
GTCGCCGAGA AGTTCGATCC GGTTGTGAAC TGGGCCACCA AGGTCGACCT GGACAAGGGT
TCCAAGACCT ACGGCCGTCC GCTGGTCGTG TCGAAGTACT CGACCGAGCA GAACGGTGAA
GACGTGAACT CGAAGGGCAT CTGCCCGGCG GCTCTCGGCA CCAAGGACCA GCAGCCGGCG
GCCTTTTCGC CCAAGACCGG CCTGTTCTAC GTGCCCACCA ACCACGTCTG CATGGACTAC
GAGCCGTTCC GGGTGACCTA CACCCCGGGC CAGCCCTACG TCGGTGCGAC CCTCTCCATG
TACCCGGCTC CGGGCTCGCA TGGCGGCATG GGCAACTTCA TCGCCTGGGA CAACCTCCAG
GGTAAGATCA AGTGGTCCAA CCCCGAGCAG TTCTCGGCTT GGGGCGGCGC GCTCGCCACT
GCCGGTGACG TGGTGTTCTA CGGCACGCTC GAAGGCTTCC TGAAGGCCGT CGACTCGAAG
ACGGGTAAGG AACTGTACAA GTTCAAGACC CCGTCGGGCA TCATCGGCAA CGTGATGACC
TACGAGCACA AGGGCAAGCA GCACGTCGCC GTCCTCTCCG GCGTCGGCGG CTGGGCCGGC
ATCGGCCTCG CGGCCGGCCT GACCGACCCG AACGCCGGTC TCGGCGCGGT GGGTGGCTAT
GCGGCCCTGT CGAGCTACAC CAACCTCGGT GGCCAGCTCA CGGTCTTCTC GCTGCCGAAC
AACTAA
 
Protein sequence
MRAVHLLALG AGLAAASPAL ANESVLKGVA NPAEQVLQTV DYANTRYSKL DQINASNVKN 
LQVAWTFSTG VLRGHEGSPL VVGNIMYVHT PFPNIVYALD LDQGAKIVWK YEPKQDPSVI
PVMCCDTVNR GLAYADGAIL LHQADTTLVS LDAKSGKVNW SVKNGDPSKG ETNTATVLPV
KDKVIVGISG GEFGVQCHVT AYDLKSGKKV WRGYSIGPDD QLIVDPEKTT SLGKPIGKDS
SLKTWEGDQW KTGGGCTWGW FSYDPKLDLM YYGSGNPSTW NPKQRPGDNK WSMTIWARNP
DTGMAKWVYQ MTPHDEWDFD GINEMILTDQ KFDGKDRPLL THFDRNGFGY TLDRATGEVL
VAEKFDPVVN WATKVDLDKG SKTYGRPLVV SKYSTEQNGE DVNSKGICPA ALGTKDQQPA
AFSPKTGLFY VPTNHVCMDY EPFRVTYTPG QPYVGATLSM YPAPGSHGGM GNFIAWDNLQ
GKIKWSNPEQ FSAWGGALAT AGDVVFYGTL EGFLKAVDSK TGKELYKFKT PSGIIGNVMT
YEHKGKQHVA VLSGVGGWAG IGLAAGLTDP NAGLGAVGGY AALSSYTNLG GQLTVFSLPN
N