Gene Msil_3149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3149 
Symbol 
ID7093809 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp3460200 
End bp3462095 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content61% 
IMG OID643466459 
ProductPQQ-dependent dehydrogenase, methanol/ethanol family 
Protein accessionYP_002363420 
Protein GI217979273 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4993] Glucose dehydrogenase 
TIGRFAM ID[TIGR03075] PQQ-dependent dehydrogenase, methanol/ethanol family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTTAC GGAAAAGTTT ACCGTGGCGC GCGGCCTTCG CCGGTTTGGC GATGGCGTCG 
CTGTCGGGCG CCGCCTGGGC CGGCTCCGAT GAAGAAATCA TAAAGAACAG CAAGAATCCG
GATCTCTGGC CGGGGATGGG GCAAAATCTC GGCCTGCAGC GTCACAGCGA ATTGAAGGAC
ATCAACAAGG ACAACGTCAG CAATCTTCAG ATGTCCTGGT CGCAATCCTC GGGAGCTCTG
CGCGGACACG AAGGGCAGCC AGTCGTTGTC GACGTCGGCG GAAAGCCGAT GATGTTTTTC
GTCAGCGCTT GGCCGAACAT CGTGCAGGCG CTCGATCTTT CGGACCCCGA CAATCCGGTT
CAGGTCTGGT CCTACAACAA GGGGACCGAT CGCGATGTGT CCGCCGTGCC GCGCGCCTGC
TGCGACGTGG TCAACCGCGG CGTGAACTAC GCCGATGGCA AGCTGCTGTT CAACACGCTG
GACGGCTTCC TCATCGCGCT CGACGCCAAG ACCGGCCAAG AATTGTGGGT CGTCAAGCAC
GCCTTTCCCG AACATGGCGA GACGGTGACG AGCGCGCCGT TGATCGCCAA GGACAAGGTG
ATCGTCGGCT TCGGCGGCGA TGAATTCGCT GCGCGCGGAC GGCTCGAAGC CTATGACCTC
GCGACCGGCG ACCTCGCCTG GCGGTGCCAG AGCAATGGGA CGGACAAGGA CGTCTGCCTG
ACCCCCGACA CCAACAAGGC GCATCCCGAA CATGGGACTT ATGGCCACGA TATCGGCCTC
TCGAGCTATC CCGGCGACGA GTGGAAGCGT GGCGGCGGGT CGCCCTGGGC CTGGTACAGC
TATGATCCCG AGCTCGGCCT TGTTTACGCC TCCACCGGCA ACCCGGGGAA CTGGAGCCCG
ACGACGCGTT GCGGCGCCGA CACGGATGCG GAATGCAATT CAGGCAAATG GGACAACAAA
TGGTCGATGA CGATCTTCGC CCGCAAGGTC GACACCGGCG AAGTCGTCTG GGCCTACCAG
ATGACGCCTT TCGATCAATG GGACTATGAC GGCGTCAACG AGAATATCCT TGTCGACATG
CCGAATGTCG ACGGCAAGCC CGTCAAGGCG CTTGTGCACT TCGATCGCAA CGGCTTCGCC
TATGTCCTCG ATCGGACAGA CGGCAATCTC CTGCGCGCCC ACAAATTCGT CACCGTAAAC
TGGGCCGAAA AAGTCGATCT CAAGACCGGA CGGCCGGTCA AGGTGCCGGA GCACTCACCC
TTCAAGATCG GCGTCAACAC GCAGGCCTGT CCCTCGGCGA TGGGCGGCAA GGATCAGCAG
CCGGCTTCGG TCGACCCCAA GGATCCAACC AATTTCTATG TGCCGACGAA CAATTGGTGC
ATGGAAGATG AGCCCCAGGC CCGCACCCAT ACGCAGCAGG GCACGGTTTA TGTCTTCGCC
AATGTCTACA TGTATCCGGA AAAGCCGGGA GTGACGGGCA AGCTCAAGAA GTTCGACGTG
CTGACCGGCA AGACCGCCTG GGAGGTCCCG GACGCCTATC CGAACTGGAG CGGCACGCTG
AACACGGCGG GCGGCCTCGT CTTCTACGGA AGCCTTAACG GCGACTTCCG CGCCGTCGAC
CGCGATAACG GCAAAGTGCT TTGGCAGCGC AAGCTCGGCT CCGGCATCAT CGGCAACCCG
ATCGCTTACA AGATCAAGGG CCATGAATAT ATCTCGGTGT TCGCGGGCAT CGGCGGCTGG
ATCGGCCTGC CGGCGGTGGC GGGCCTCGAT CTCGAAGACA AGTTCGGCGC GATCGGTTCG
ACGGCTCTGA CCAAGGTCAT CGGACTGAAC AAGATCCCGC AGGGCGGCGC CCTCTATACG
TTCCGAGTGC CGGACAAGGA GGCCACAGCC CACTAG
 
Protein sequence
MNLRKSLPWR AAFAGLAMAS LSGAAWAGSD EEIIKNSKNP DLWPGMGQNL GLQRHSELKD 
INKDNVSNLQ MSWSQSSGAL RGHEGQPVVV DVGGKPMMFF VSAWPNIVQA LDLSDPDNPV
QVWSYNKGTD RDVSAVPRAC CDVVNRGVNY ADGKLLFNTL DGFLIALDAK TGQELWVVKH
AFPEHGETVT SAPLIAKDKV IVGFGGDEFA ARGRLEAYDL ATGDLAWRCQ SNGTDKDVCL
TPDTNKAHPE HGTYGHDIGL SSYPGDEWKR GGGSPWAWYS YDPELGLVYA STGNPGNWSP
TTRCGADTDA ECNSGKWDNK WSMTIFARKV DTGEVVWAYQ MTPFDQWDYD GVNENILVDM
PNVDGKPVKA LVHFDRNGFA YVLDRTDGNL LRAHKFVTVN WAEKVDLKTG RPVKVPEHSP
FKIGVNTQAC PSAMGGKDQQ PASVDPKDPT NFYVPTNNWC MEDEPQARTH TQQGTVYVFA
NVYMYPEKPG VTGKLKKFDV LTGKTAWEVP DAYPNWSGTL NTAGGLVFYG SLNGDFRAVD
RDNGKVLWQR KLGSGIIGNP IAYKIKGHEY ISVFAGIGGW IGLPAVAGLD LEDKFGAIGS
TALTKVIGLN KIPQGGALYT FRVPDKEATA H