Gene Msil_0471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_0471 
Symbol 
ID7091203 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp522216 
End bp524093 
Gene Length1878 bp 
Protein Length625 aa 
Translation table11 
GC content58% 
IMG OID643463801 
ProductPQQ-dependent dehydrogenase, methanol/ethanol family 
Protein accessionYP_002360806 
Protein GI217976659 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4993] Glucose dehydrogenase 
TIGRFAM ID[TIGR03075] PQQ-dependent dehydrogenase, methanol/ethanol family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCACAC TTTTAACCTC TGTATCCGTG GCCACGCTCG TGGCCATTTC TCAGTTCGGG 
TTTGCGCCGA TTGCATCGGC GAACGACAAG CTAAACCAAC TGTCGCAGAG CGACGACAAT
TGGGTGATGC CGGGCAAGGA TTATAAATCC GACAACTTTA GCAAGCTCAC CCAGATCAAC
GCGAGCAACG TCAAGCAGCT AAAACCGGCC TGGTCCTTCT CGACTGGCGT TTTGAATGGT
CATGAAGGCG CGCCCCTCGT CGTCGACGGC AAGATGTTTA TCTCGTCGCC CTTTCCCAAC
ACGACTTTTG CGCTCGATCT CGACGAGCCG GGCGTGATCC TTTGGGAAGA CAAGCCCAAG
CAAAATCCCG CGGCGCGCGC GGTCGCATGC TGTGACGTCG TCAATCGCGG CCTCGCCTAT
TGGCCGGGCG ACGGCAAGAC GCCCTCGCTG ATCCTCAAAA CGCAGCTTGA CGGCCATGTC
GTCGCCTTGA ACGCGGCGAC GGGCGAGGTT TACTGGAAGG TCGAGAATTC CGACATCAAG
GTGGGCTCGA CGCTCACCAT CGCGCCTTAT GTTGTAAAAG ATATCGTTCT TGTCGGGTCC
TCAGGCGCCG AGCTTGGCGT GCGCGGATAT GTGACGGGAT ACGACGTGAA AACGGGCGAG
CAGAAATGGC GCGCTTATGC GACCGGCCCG GATTCGGACC TGCTTCTCGC CGACGATTTC
AATATCAAGA ACCCCCATTA TGGCCAGAAA GGTCTTGGCA CTTCCACCTG GGAAGGCGAC
GCCTGGAAGA TCGGCGGCGG CACAAACTGG GGCTGGTACG CATTCGACCC CGACACCAAC
ATGACTTATT TCGGCACCGG CAACCCGGCG CCATGGAACG AAACCATGCG TCCAGGCGAC
AATAAATGGA CCATGACGAT TTTCGGCCGT GACGTCGACA CCGGCAAAGC CCATTTCGGC
TATCAGAAAA CCCCTCACGA CGAATGGGAT TTCGCCGGCG TCAATGTCAT GATGCTGTCC
GATCAGAAAG ACAAGGACGG CAAGCTGCGC AAGCTTCTGA CCCATCCGGA TCGCAACGGC
ATCGTCTACA CGCTTGACCG GACCAATGGC GACCTCGTCA GCGCCAACAA GATCGACGAC
ACCGTCAACG TCTTCAAGCA GGTCGATCTG AAGTCTGGCA CACCGGTCCG CGATCCCGAG
TTCGGCACGC GTATGGACCA TCTCGCTCGC GAGGTGTGTC CGTCGGCGAT GGGCTATCAC
AATCAGGGTC ATGACTCCTA CGACGCCAAC AAGCAGCTCT TCTACATGGG CATCAATCAT
ATCTGCATGG ATTGGGAGCC TTTCATGCTT CCCTACCGCG CAGGTCAGTT CTTTGTGGGC
GCGACGCTGA ACATGTATCC CGGTCCGAAG GGCGACCGTC AGAACTATGA AGGCCTCGGA
CAGATCAAGG CCTACAACGC CATCACCGGC AAATTCAAAT GGGAGAAGAT GGAGAAATTC
GCGGTTTGGG GCGGCACCAT GTCCACGGCT GGCGACCTTG TCTTCTACGG CACGCTCGAC
GGTTTCATCA AGGCGCGCAA CACCGACACC GGAGAGCTGC TGTGGAAGTT CAAGCTTCCG
TCCGGCTCCA TCGGATATCC GATTACCTAC ACCCATAAGG GCATTCAATA TGTCGCCATC
AACTATGGCG TCGGCGGATG GCCGGGCGTT GGCCTCGTGT TCGACCTTCA GGATCCGACC
GCCGGCCTTG GCGCCGTCGG CGCGTTCAAG AAGCTCGCGC ACTACACCCA GATGGGCGGC
GGCACGATGG TCTTCTCGCT CGACGGGAAG GGCCCCTATG ACGACGTGAA TCTCGGCGAG
TTTGCATCAA GCAAATAG
 
Protein sequence
MRTLLTSVSV ATLVAISQFG FAPIASANDK LNQLSQSDDN WVMPGKDYKS DNFSKLTQIN 
ASNVKQLKPA WSFSTGVLNG HEGAPLVVDG KMFISSPFPN TTFALDLDEP GVILWEDKPK
QNPAARAVAC CDVVNRGLAY WPGDGKTPSL ILKTQLDGHV VALNAATGEV YWKVENSDIK
VGSTLTIAPY VVKDIVLVGS SGAELGVRGY VTGYDVKTGE QKWRAYATGP DSDLLLADDF
NIKNPHYGQK GLGTSTWEGD AWKIGGGTNW GWYAFDPDTN MTYFGTGNPA PWNETMRPGD
NKWTMTIFGR DVDTGKAHFG YQKTPHDEWD FAGVNVMMLS DQKDKDGKLR KLLTHPDRNG
IVYTLDRTNG DLVSANKIDD TVNVFKQVDL KSGTPVRDPE FGTRMDHLAR EVCPSAMGYH
NQGHDSYDAN KQLFYMGINH ICMDWEPFML PYRAGQFFVG ATLNMYPGPK GDRQNYEGLG
QIKAYNAITG KFKWEKMEKF AVWGGTMSTA GDLVFYGTLD GFIKARNTDT GELLWKFKLP
SGSIGYPITY THKGIQYVAI NYGVGGWPGV GLVFDLQDPT AGLGAVGAFK KLAHYTQMGG
GTMVFSLDGK GPYDDVNLGE FASSK