Gene Mext_4150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4150 
Symbol 
ID5832505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4617081 
End bp4618961 
Gene Length1881 bp 
Protein Length626 aa 
Translation table11 
GC content64% 
IMG OID641369940 
Productmethanol/ethanol family PQQ-dependent dehydrogenase 
Protein accessionYP_001641590 
Protein GI163853547 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4993] Glucose dehydrogenase 
TIGRFAM ID[TIGR03075] PQQ-dependent dehydrogenase, methanol/ethanol family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.383489 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.812853 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGGT TTGTGACATC AGTCTCGGCC TTGGCGATGC TGGCGCTCGC GCCGGCCGCG 
CTGTCGAGCG GGGCCTACGC CAACGATAAG CTGGTCGAGC TGTCGAAGAG CGACGACAAC
TGGGTGATGC CCGGCAAGAA CTACGATTCG AACAACTTCA GCGACCTGAA GCAGATCAAC
AAGGGCAACG TGAAGCAGCT TCGGCCGGCT TGGACGTTCT CGACCGGCTT GCTGAACGGC
CACGAGGGTG CGCCGCTCGT CGTCGACGGC AAGATGTACA TCCACACCTC GTTCCCGAAC
AACACCTTCG CTCTCGGCCT CGACGATCCG GGCACGATCC TGTGGCAGGA CAAGCCGAAG
CAGAATCCGG CCGCCCGCGC CGTCGCCTGC TGTGACCTCG TCAACCGCGG CCTCGCCTAC
TGGCCCGGCG ACGGCAAGAC CCCCGCGCTG ATCCTCAAGA CCCAGCTCGA CGGCAACGTG
GCCGCCCTCA ACGCCGAGAC CGGCGAGACG GTGTGGAAGG TCGAGAACTC CGACATCAAG
GTCGGCTCGA CGCTCACGAT CGCCCCCTAT GTCGTCAAGG ACAAGGTCAT CATCGGTTCC
TCGGGCGCCG AACTCGGCGT GCGCGGCTAC CTGACCGCCT ACGACGTGAA GACCGGCGAG
CAGGTGTGGC GCGCCTACGC CACGGGTCCG GACAAGGACC TGCTGCTGGC CTCCGACTTC
AACATCAAGA ACCCCCATTA CGGCCAGAAG GGCCTCGGCA CCGGCACCTG GGAGGGCGAT
GCCTGGAAGA TCGGCGGCGG CACCAACTGG GGCTGGTACG CCTACGATCC GGGCACGAAC
CTGATCTACT TCGGCACCGG CAACCCGGCG CCGTGGAACG AGACCATGCG TCCGGGCGAC
AACAAGTGGA CGATGACGAT CTTCGGCCGC GATGCCGACA CGGGTGAAGC CAAGTTCGGC
TACCAGAAGA CCCCGCACGA CGAGTGGGAC TATGCCGGCG TCAACGTCAT GATGCTCTCC
GAGCAGAAGG ACAAGGACGG CAAGGCCCGC AAGCTGCTGA CCCACCCGGA CCGCAACGGC
ATCGTCTACA CGCTCGACCG GACCGACGGC GCGCTCGTCT CGGCGAACAA GCTCGACGAC
ACGGTCAACG TGTTCAAGTC GGTGGATCTC AAGACCGGCC AGCCGGTGCG CGATCCCGAA
TACGGCACCC GGATGGACCA CCTCGCCAAG GACATCTGCC CCTCGGCGAT GGGTTACCAC
AACCAGGGTC ACGACTCGTA CGATCCGAAG CGTGAACTGT TCTTCATGGG CATCAACCAC
ATCTGCATGG ATTGGGAGCC CTTCATGCTT CCCTATCGTG CGGGTCAGTT CTTCGTCGGC
GCGACGCTGA ACATGTATCC GGGCCCGAAG GGCGACCGTC AGAACTACGA AGGTCTCGGC
CAGATCAAGG CGTACAACGC GATCACCGGC GACTATAAGT GGGAGAAGAT GGAGCGCTTC
GCCGTGTGGG GCGGCACCAT GGCCACCGCA GGCGATCTCG TCTTCTACGG CACGCTCGAC
GGCTACCTGA AGGCGCGCGA CTCCGACACG GGTGATCTTC TCTGGAAGTT CAAGATCCCG
TCCGGCGCCA TCGGCTACCC GATGACCTAC ACCCACAAGG GCACGCAATA CGTCGCCATC
TACTACGGCG TCGGCGGCTG GCCGGGTGTC GGCCTCGTGT TCGACCTCGC CGACCCGACC
GCCGGTCTCG GCGCGGTGGG CGCCTTCAAG AAGCTCGCCA ACTACACCCA GATGGGTGGC
GGCGTGGTGG TGTTCTCGCT CGACGGCAAG GGTCCCTACG ACGATCCGAA CGTCGGCGAG
TGGAAGTCAG CCGCCAAGTA A
 
Protein sequence
MSRFVTSVSA LAMLALAPAA LSSGAYANDK LVELSKSDDN WVMPGKNYDS NNFSDLKQIN 
KGNVKQLRPA WTFSTGLLNG HEGAPLVVDG KMYIHTSFPN NTFALGLDDP GTILWQDKPK
QNPAARAVAC CDLVNRGLAY WPGDGKTPAL ILKTQLDGNV AALNAETGET VWKVENSDIK
VGSTLTIAPY VVKDKVIIGS SGAELGVRGY LTAYDVKTGE QVWRAYATGP DKDLLLASDF
NIKNPHYGQK GLGTGTWEGD AWKIGGGTNW GWYAYDPGTN LIYFGTGNPA PWNETMRPGD
NKWTMTIFGR DADTGEAKFG YQKTPHDEWD YAGVNVMMLS EQKDKDGKAR KLLTHPDRNG
IVYTLDRTDG ALVSANKLDD TVNVFKSVDL KTGQPVRDPE YGTRMDHLAK DICPSAMGYH
NQGHDSYDPK RELFFMGINH ICMDWEPFML PYRAGQFFVG ATLNMYPGPK GDRQNYEGLG
QIKAYNAITG DYKWEKMERF AVWGGTMATA GDLVFYGTLD GYLKARDSDT GDLLWKFKIP
SGAIGYPMTY THKGTQYVAI YYGVGGWPGV GLVFDLADPT AGLGAVGAFK KLANYTQMGG
GVVVFSLDGK GPYDDPNVGE WKSAAK