Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_1809 |
Symbol | |
ID | 5831943 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 2031223 |
End bp | 2033028 |
Gene Length | 1806 bp |
Protein Length | 601 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641367608 |
Product | methanol/ethanol family PQQ-dependent dehydrogenase |
Protein accession | YP_001639279 |
Protein GI | 163851236 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4993] Glucose dehydrogenase |
TIGRFAM ID | [TIGR03075] PQQ-dependent dehydrogenase, methanol/ethanol family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAGCGG TACATCTCCT CGCACTCGGT GCGGGTCTCG CGGCTGCAAG CCCGGCCCTC GCCAACGAAA GCGTTCTGAA GGGCGTCGCC AACCCGGCGG AGCAGGTGCT CCAGACGGTC GATTACGCCA ACACCCGCTA TTCCAAGCTC GACCAGATCA ACGCCAGCAA CGTCAAGAAC CTCCAGGTTG CCTGGACCTT CTCGACCGGC GTGCTGCGCG GCCACGAGGG CTCCCCGCTC GTCGTCGGCA ACATCATGTA CGTCCACACC CCCTTCCCGA ACATCGTCTA CGCGCTGGAC CTCGACCAGG GCGCCAAGAT CGTGTGGAAG TACGAGCCGA AGCAGGATCC GTCCGTGATC CCGGTCATGT GCTGTGACAC GGTCAACCGT GGTCTGGCCT ACGCCGACGG CGCGATCCTC CTGCACCAGG CCGACACCAC GCTCGTCTCG CTCGACGCCA AGTCCGGCAA GGTGAACTGG TCGGTCAAGA ACGGCGACCC GTCCAAGGGT GAGACCAACA CCGCCACCGT TCTCCCGGTG AAGGACAAGG TCATCGTCGG CATCTCCGGC GGCGAGTTCG GCGTGCAGTG CCACGTCACC GCCTACGACC TGAAGTCCGG CAAGAAGGTG TGGCGCGGCT ACTCGATCGG CCCGGACGAT CAGCTGATCG TCGACCCCGA GAAGACCACC TCGCTCGGCA AGCCGATCGG CAAGGACTCC TCGCTGAAGA CCTGGGAAGG CGATCAGTGG AAGACCGGCG GCGGCTGCAC CTGGGGCTGG TTCTCCTACG ATCCCAAGCT CGACCTGATG TATTACGGCT CGGGCAACCC CTCCACCTGG AACCCCAAGC AGCGTCCGGG CGACAACAAG TGGTCGATGA CCATTTGGGC GCGTAACCCC GACACCGGCA TGGCCAAGTG GGTCTACCAG ATGACCCCCC ACGACGAGTG GGACTTCGAC GGCATCAACG AGATGATCCT CACGGATCAG AAGTTCGACG GCAAGGACCG TCCGCTGCTG ACGCACTTCG ATCGTAACGG CTTCGGCTAC ACGCTCGACC GCGCCACCGG TGAAGTGCTC GTCGCCGAGA AGTTCGATCC GGTTGTGAAC TGGGCCACCA AGGTCGACCT GGACAAGGGT TCCAAGACCT ACGGCCGTCC GCTGGTCGTG TCGAAGTACT CGACCGAGCA GAACGGTGAA GACGTGAACT CGAAGGGCAT CTGCCCGGCG GCTCTCGGCA CCAAGGACCA GCAGCCGGCG GCCTTTTCGC CCAAGACCGG CCTGTTCTAC GTGCCCACCA ACCACGTCTG CATGGACTAC GAGCCGTTCC GGGTGACCTA CACCCCGGGC CAGCCCTACG TCGGTGCGAC CCTCTCCATG TACCCGGCTC CGGGCTCGCA TGGCGGCATG GGCAACTTCA TCGCCTGGGA CAACCTCCAG GGTAAGATCA AGTGGTCCAA CCCCGAGCAG TTCTCGGCTT GGGGCGGCGC GCTCGCCACT GCCGGTGACG TGGTGTTCTA CGGCACGCTC GAAGGCTTCC TGAAGGCCGT CGACTCGAAG ACGGGTAAGG AACTGTACAA GTTCAAGACC CCGTCGGGCA TCATCGGCAA CGTGATGACC TACGAGCACA AGGGCAAGCA GCACGTCGCC GTCCTCTCCG GCGTCGGCGG CTGGGCCGGC ATCGGCCTCG CGGCCGGCCT GACCGACCCG AACGCCGGTC TCGGCGCGGT GGGTGGCTAT GCGGCCCTGT CGAGCTACAC CAACCTCGGT GGCCAGCTCA CGGTCTTCTC GCTGCCGAAC AACTAA
|
Protein sequence | MRAVHLLALG AGLAAASPAL ANESVLKGVA NPAEQVLQTV DYANTRYSKL DQINASNVKN LQVAWTFSTG VLRGHEGSPL VVGNIMYVHT PFPNIVYALD LDQGAKIVWK YEPKQDPSVI PVMCCDTVNR GLAYADGAIL LHQADTTLVS LDAKSGKVNW SVKNGDPSKG ETNTATVLPV KDKVIVGISG GEFGVQCHVT AYDLKSGKKV WRGYSIGPDD QLIVDPEKTT SLGKPIGKDS SLKTWEGDQW KTGGGCTWGW FSYDPKLDLM YYGSGNPSTW NPKQRPGDNK WSMTIWARNP DTGMAKWVYQ MTPHDEWDFD GINEMILTDQ KFDGKDRPLL THFDRNGFGY TLDRATGEVL VAEKFDPVVN WATKVDLDKG SKTYGRPLVV SKYSTEQNGE DVNSKGICPA ALGTKDQQPA AFSPKTGLFY VPTNHVCMDY EPFRVTYTPG QPYVGATLSM YPAPGSHGGM GNFIAWDNLQ GKIKWSNPEQ FSAWGGALAT AGDVVFYGTL EGFLKAVDSK TGKELYKFKT PSGIIGNVMT YEHKGKQHVA VLSGVGGWAG IGLAAGLTDP NAGLGAVGGY AALSSYTNLG GQLTVFSLPN N
|
| |