Gene M446_5752 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_5752 
Symbol 
ID6131305 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp6313995 
End bp6315794 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content65% 
IMG OID641645860 
Productmethanol/ethanol family PQQ-dependent dehydrogenase 
Protein accessionYP_001772474 
Protein GI170743819 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4993] Glucose dehydrogenase 
TIGRFAM ID[TIGR03075] PQQ-dependent dehydrogenase, methanol/ethanol family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0649397 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGCGG TCCATCTCCT AGCGTTGAGC GCGGGTATCC TGGGCGTGAC GCCCGCCCTG 
GCCGATGATC TCCAGAAGGA GATCAACAAC CCGGCACAGC AGGTTCTGCA AACCGTCGAC
TACGCCAACA CCCGGTTCTC GAAGCTCGAC CAGATCAACA CCGGCAACGT CAACAAGCTG
CAGGTGGCCT GGACGTTCTC GACCGGCGTG CTGCGCGGTC ACGAGGGCGC GCCGCTGGTC
GTCGGCGACA TGATGTACGT GCACACGCCG TTCCCGAACA TCGTGTACGC GCTCGACCTG
AACCAGGACG GCAAGATCGT CTGGAAGTAC GAGCCGAAGC AGGATCCGAA CGTGATCCCG
GTGATGTGCT GCGACACGGT CAACCGTGGC CTGGCCTACG CCGACGGCAA GATCATCCTG
CACCAAGCCG ACACGACGGT CGTCGCCCTC GACGCCAAGA CCGGCAAGGT CGCGTGGTCG
GTCGTCAACG GCGACCCCAA GAAGGGCGAG ACCAACACGG CGACCGTTCT TCCGGTCAAG
GACAAGGTCA TCGTCGGCAT CTCGGGCGGT GAGTACGGCG TGCGCTGCCA CGTGACGGCC
TACAGCCTGA AGGACGGCAA GAAGATCTGG CGCGGCTACT CGATGGGCCC GGATGACGAG
ATGCTCGTCA ACCCGGAGAA GACCACCTCC CTCGGCAAGC CGGTCGGCAA GGACTCTTCG
CTGAAGACCT GGGAAGGCGA TCAGTGGAAG CAGGGCGGCG GCTGCACCTG GGGCTGGTAC
TCCTACGACC CCAAGCTGAA CCTGATGTAT TACGGTTCGG GCAATCCCTC GACCTGGAAC
CCGAAGCAGC GCCCCGGCGA CAACAAGTGG TCGATGACCG TTTGGGCGCG TGACGTCGAC
AGCGGCGAGG CCAAGTGGGT CTACCAGATG ACCCCCCACG ACGAGTGGGA CTACGACGGC
ATCAACGAGA TGATCCTCAC GGATCAGAAG GTTGGTGACA AGGAGCGTCC GCTCCTGACC
CACTTCGACC GCAACGGCTT CGGCTACACC CTCGACCGCG CGACCGGCGA GCTGCTCGTC
GCCGAGAAGT ACGATCCGGT GGTGAACTGG GCCTCCAAGG TCGACATGGA CAAGTCGTCC
AAGACCTACG GCCGTCCGCT GGTGCAGGCC AAGTACTCGA CCGAGCAGAA CGGCGAGGAC
GTGAACTCGA AGGGCATCTG CCCGGCGGCT CTGGGCTCCA AGGACCAGCA GCCGGCGGCC
TACTCGCCCA AGACCAACCT GTTCTACGTG CCGACCAACC ACGTCTGCAT GGACTACGAG
CCGTTCCGGG TGAGCTACAC CCCGGGCCAG CCCTATGTCG GTGCGACCCT GTCGATGTAC
CCGGCCCCGA ACAGCCACGG CGGCATGGGC AACTTCATCG CCTGGGACGG CGTCAACGGC
AAGATCAAGT GGTCGGTCCC CGAGCAGTTC TCGGTGTGGT CGGGCGCTCT CGCCACCGCC
GGCGACGTGG TGTTCTACGG CACCCTCGAG GGCTACCTGA AGGCCGTCGA CGCCAAGTCG
GGTAAGGAAC TCTACAAGTT CAAGACCCCG TCGGGCATCA TCGGCAACGT GATGACCTAC
GAGCACAAGG GCAAGCAGAA CGTCGCGGTC CTGTCGGGCG TCGGCGGCTG GGCGGGCATC
GGCCTCGCGG CCGGCCTGAC CGACCCGAAT GCCGGCCTCG GCGCGGTGGG CGGCTACGCG
GCCCTCTCGA ACTACACCAA CCTGGGCGGC CAGCTCACCG TCTTCACGCT GCCGAACTAA
 
Protein sequence
MRAVHLLALS AGILGVTPAL ADDLQKEINN PAQQVLQTVD YANTRFSKLD QINTGNVNKL 
QVAWTFSTGV LRGHEGAPLV VGDMMYVHTP FPNIVYALDL NQDGKIVWKY EPKQDPNVIP
VMCCDTVNRG LAYADGKIIL HQADTTVVAL DAKTGKVAWS VVNGDPKKGE TNTATVLPVK
DKVIVGISGG EYGVRCHVTA YSLKDGKKIW RGYSMGPDDE MLVNPEKTTS LGKPVGKDSS
LKTWEGDQWK QGGGCTWGWY SYDPKLNLMY YGSGNPSTWN PKQRPGDNKW SMTVWARDVD
SGEAKWVYQM TPHDEWDYDG INEMILTDQK VGDKERPLLT HFDRNGFGYT LDRATGELLV
AEKYDPVVNW ASKVDMDKSS KTYGRPLVQA KYSTEQNGED VNSKGICPAA LGSKDQQPAA
YSPKTNLFYV PTNHVCMDYE PFRVSYTPGQ PYVGATLSMY PAPNSHGGMG NFIAWDGVNG
KIKWSVPEQF SVWSGALATA GDVVFYGTLE GYLKAVDAKS GKELYKFKTP SGIIGNVMTY
EHKGKQNVAV LSGVGGWAGI GLAAGLTDPN AGLGAVGGYA ALSNYTNLGG QLTVFTLPN