Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_4150 |
Symbol | |
ID | 5832505 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 4617081 |
End bp | 4618961 |
Gene Length | 1881 bp |
Protein Length | 626 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641369940 |
Product | methanol/ethanol family PQQ-dependent dehydrogenase |
Protein accession | YP_001641590 |
Protein GI | 163853547 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4993] Glucose dehydrogenase |
TIGRFAM ID | [TIGR03075] PQQ-dependent dehydrogenase, methanol/ethanol family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.383489 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.812853 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAGGT TTGTGACATC AGTCTCGGCC TTGGCGATGC TGGCGCTCGC GCCGGCCGCG CTGTCGAGCG GGGCCTACGC CAACGATAAG CTGGTCGAGC TGTCGAAGAG CGACGACAAC TGGGTGATGC CCGGCAAGAA CTACGATTCG AACAACTTCA GCGACCTGAA GCAGATCAAC AAGGGCAACG TGAAGCAGCT TCGGCCGGCT TGGACGTTCT CGACCGGCTT GCTGAACGGC CACGAGGGTG CGCCGCTCGT CGTCGACGGC AAGATGTACA TCCACACCTC GTTCCCGAAC AACACCTTCG CTCTCGGCCT CGACGATCCG GGCACGATCC TGTGGCAGGA CAAGCCGAAG CAGAATCCGG CCGCCCGCGC CGTCGCCTGC TGTGACCTCG TCAACCGCGG CCTCGCCTAC TGGCCCGGCG ACGGCAAGAC CCCCGCGCTG ATCCTCAAGA CCCAGCTCGA CGGCAACGTG GCCGCCCTCA ACGCCGAGAC CGGCGAGACG GTGTGGAAGG TCGAGAACTC CGACATCAAG GTCGGCTCGA CGCTCACGAT CGCCCCCTAT GTCGTCAAGG ACAAGGTCAT CATCGGTTCC TCGGGCGCCG AACTCGGCGT GCGCGGCTAC CTGACCGCCT ACGACGTGAA GACCGGCGAG CAGGTGTGGC GCGCCTACGC CACGGGTCCG GACAAGGACC TGCTGCTGGC CTCCGACTTC AACATCAAGA ACCCCCATTA CGGCCAGAAG GGCCTCGGCA CCGGCACCTG GGAGGGCGAT GCCTGGAAGA TCGGCGGCGG CACCAACTGG GGCTGGTACG CCTACGATCC GGGCACGAAC CTGATCTACT TCGGCACCGG CAACCCGGCG CCGTGGAACG AGACCATGCG TCCGGGCGAC AACAAGTGGA CGATGACGAT CTTCGGCCGC GATGCCGACA CGGGTGAAGC CAAGTTCGGC TACCAGAAGA CCCCGCACGA CGAGTGGGAC TATGCCGGCG TCAACGTCAT GATGCTCTCC GAGCAGAAGG ACAAGGACGG CAAGGCCCGC AAGCTGCTGA CCCACCCGGA CCGCAACGGC ATCGTCTACA CGCTCGACCG GACCGACGGC GCGCTCGTCT CGGCGAACAA GCTCGACGAC ACGGTCAACG TGTTCAAGTC GGTGGATCTC AAGACCGGCC AGCCGGTGCG CGATCCCGAA TACGGCACCC GGATGGACCA CCTCGCCAAG GACATCTGCC CCTCGGCGAT GGGTTACCAC AACCAGGGTC ACGACTCGTA CGATCCGAAG CGTGAACTGT TCTTCATGGG CATCAACCAC ATCTGCATGG ATTGGGAGCC CTTCATGCTT CCCTATCGTG CGGGTCAGTT CTTCGTCGGC GCGACGCTGA ACATGTATCC GGGCCCGAAG GGCGACCGTC AGAACTACGA AGGTCTCGGC CAGATCAAGG CGTACAACGC GATCACCGGC GACTATAAGT GGGAGAAGAT GGAGCGCTTC GCCGTGTGGG GCGGCACCAT GGCCACCGCA GGCGATCTCG TCTTCTACGG CACGCTCGAC GGCTACCTGA AGGCGCGCGA CTCCGACACG GGTGATCTTC TCTGGAAGTT CAAGATCCCG TCCGGCGCCA TCGGCTACCC GATGACCTAC ACCCACAAGG GCACGCAATA CGTCGCCATC TACTACGGCG TCGGCGGCTG GCCGGGTGTC GGCCTCGTGT TCGACCTCGC CGACCCGACC GCCGGTCTCG GCGCGGTGGG CGCCTTCAAG AAGCTCGCCA ACTACACCCA GATGGGTGGC GGCGTGGTGG TGTTCTCGCT CGACGGCAAG GGTCCCTACG ACGATCCGAA CGTCGGCGAG TGGAAGTCAG CCGCCAAGTA A
|
Protein sequence | MSRFVTSVSA LAMLALAPAA LSSGAYANDK LVELSKSDDN WVMPGKNYDS NNFSDLKQIN KGNVKQLRPA WTFSTGLLNG HEGAPLVVDG KMYIHTSFPN NTFALGLDDP GTILWQDKPK QNPAARAVAC CDLVNRGLAY WPGDGKTPAL ILKTQLDGNV AALNAETGET VWKVENSDIK VGSTLTIAPY VVKDKVIIGS SGAELGVRGY LTAYDVKTGE QVWRAYATGP DKDLLLASDF NIKNPHYGQK GLGTGTWEGD AWKIGGGTNW GWYAYDPGTN LIYFGTGNPA PWNETMRPGD NKWTMTIFGR DADTGEAKFG YQKTPHDEWD YAGVNVMMLS EQKDKDGKAR KLLTHPDRNG IVYTLDRTDG ALVSANKLDD TVNVFKSVDL KTGQPVRDPE YGTRMDHLAK DICPSAMGYH NQGHDSYDPK RELFFMGINH ICMDWEPFML PYRAGQFFVG ATLNMYPGPK GDRQNYEGLG QIKAYNAITG DYKWEKMERF AVWGGTMATA GDLVFYGTLD GYLKARDSDT GDLLWKFKIP SGAIGYPMTY THKGTQYVAI YYGVGGWPGV GLVFDLADPT AGLGAVGAFK KLANYTQMGG GVVVFSLDGK GPYDDPNVGE WKSAAK
|
| |