Gene Msil_0408 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_0408 
Symbol 
ID7093567 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp447294 
End bp449435 
Gene Length2142 bp 
Protein Length713 aa 
Translation table11 
GC content66% 
IMG OID643463738 
Productglycoside hydrolase family 3 domain protein 
Protein accessionYP_002360744 
Protein GI217976597 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGCTCG ATCTCAAGAC ACTTCTCGCC GACATGACGC TGGAGGAAAA AATCGGCCAG 
CTCACCATGA CCGCCGGAAG CTATGCGGTC ACCGGGCCGA TCGTTGGCGG CGACGTTGAG
GCCGCAATTC GCGCCGGGCG CATCGGCAGC CTGCTCAATC TTTGGGGCGC AACGGAAATC
GCCGCTATTC AAAAGATTGC GATGGAGGAG TCGCGTCTCG GCATTCCGCT GATCGTCGGC
TTCGACGTTC TGCACGGCCA TCGCATGATC TTTCCCCTTC CGCTCGCGGA AGCCTGCGCC
TTCGATCCAG CGCTCTGGCG CGCGACCGCG CGGGCCGCGG CGCGGGAGGC GGCCGCCGAT
GGCGTCTCCA TGGTCTTTGC GCCGATGCTC GACGTCGCGC GCGATCCGCG CTGGGGCCGG
ATTGCCGAGG GCGCCGGCGA AGACCCCTTC GTCACATGCG AATTCGCCAA GGCCAAGATC
GCCGGTTTTC AAGGCGAGGA TCTCGCGGCT CCGGCGGCCG TCGCCGCCGT CGCCAAGCAT
TTTTGCGCCT ATGGCGCCGC AGAGGGAGGC CGCGACTACG CCTCCGCGGA TGTGTCCGAG
CGCGCTTTGC ACGAAGTCTA CCTGCCGCCT TTCGCCGCGG CGATTGAGGC CGGCTGCGCA
GCCATCATGC CGGCCTTCAT GGATCTCGCC GGCGTCCCCA TGACCGCGCA TCAACCGCTT
CTGCGCGGCT GGCTGCGCGA GGCGCGGGGC TTTGAAGGCG TCATCGTCAG CGATTACAAC
GCGTTGGCGG AACTGATGCG CCATGGCGTC GCCGCCAATC TGATTGAGGC GGCCGCGCTG
GCGCTGCGGG CGGGCGTCGA CATCGACATG ATGAGTTCGG CTTACGCCGA CGGCCTGCCG
CAGGCTTTGG CGCGGGGCCT TGTCACGGAG GAGGACATCG ACGCCTGCGT GCATCGGGTG
CTGGAGCTAA AGCAAAAACT TGGGCTGTTC GACGGCGTTC TTCGGGGCGC AAGGCGCCAA
GGCCGCGACA GCGCGGAGCC GGCGCTTGCG CGAGACGCCG CGCGGCGTTC GATCGTTCTT
TTGACCAATC ATGGCGCGCT GCCGCTTTCC CCTTCTTTGC GAAAAATCGC CTTGATCGGC
CCCCTGTCGG AGGCCGCGGG CGAAATGGAT GGCTCATGGG CCGCCGCCGG CGACAGAAAC
GCCGCCGTTT CCATTCTGGA CGGGCTGACG GCCGCGCTTC CTACAACCGA GATGTTTTGT
GCGGCGGGCG TCGCCGTCGA TTATGGGAAC GCGGCCGGCA TTGCTGACGC GATGGCGCTC
TGCGAGGAGG CGGAGCTGAT CATTCTCTGT CTTGGCGAAA GCGCCGAAAT GAGCGGGGAG
GCGGCCTCGC GCGCCGATCT CGGCCTGCCC GGGCGACAAA GAGAGCTTGC CGAAGCCGCG
CTCGGTCTGG GGCAGAAACT TGGGCGCCCC GTCGTCGCCC TGCTCTCGAG CGGACGTCCG
CTCACGCTGC CCTGGCTTTT CGAGCGCGCC GACGCTGTGG CGGCGACCTG GTTCCTGGGC
GCTGAAGCCG GCCATGCGAT TGCCGATGTT TTGACTGGCC AGTTCAATCC GGTCGGACGT
CTCGCGCTAA GCTGGCCCCG CGCGGTCGGA CAGATCCCGC TGTTTTATGC GGCCCGGCCG
ACGGGCCGGC CCTTCGCCGC CGAAGATCAT TACACGACCA AATATATCGA TTGCGCCGTG
GAGCCGCAGT TTCCGTTCGG CCATGGCCTG TCCTACAGCC CGTTTTCGCT CGAAGAATTC
GCTGCCGGCC GGCAGAGTTT CTGCGCGCGC GACACGCTCG ACTTTTCGGT CGAGGTCCAC
AATCTTGGAC CGCTCGACGG CGAGGCGACC GTGTTCCTCT TTGCGCGCGA TCTCGTCGCC
TCGGTGGCGC GGCCGGTGCT CGAACTGAAG CGCTTCGGCA AAATCGCCTT GCCGGCGGGC
GAGAGCGGCA CATTACGCTT CGCGCTTGCG GCAAGCGAAC TCGCCTTTCC CGGCGTCGAT
TTTCGGCCCT GCTTCGAGCC CGGCGCTTTT GAATTTTCAG TTGGATTCGA TGCCGATCCA
AGGCGGCACA GGAAGCTGAG CCTGCAGGCG ACGGCGCCTT AA
 
Protein sequence
MALDLKTLLA DMTLEEKIGQ LTMTAGSYAV TGPIVGGDVE AAIRAGRIGS LLNLWGATEI 
AAIQKIAMEE SRLGIPLIVG FDVLHGHRMI FPLPLAEACA FDPALWRATA RAAAREAAAD
GVSMVFAPML DVARDPRWGR IAEGAGEDPF VTCEFAKAKI AGFQGEDLAA PAAVAAVAKH
FCAYGAAEGG RDYASADVSE RALHEVYLPP FAAAIEAGCA AIMPAFMDLA GVPMTAHQPL
LRGWLREARG FEGVIVSDYN ALAELMRHGV AANLIEAAAL ALRAGVDIDM MSSAYADGLP
QALARGLVTE EDIDACVHRV LELKQKLGLF DGVLRGARRQ GRDSAEPALA RDAARRSIVL
LTNHGALPLS PSLRKIALIG PLSEAAGEMD GSWAAAGDRN AAVSILDGLT AALPTTEMFC
AAGVAVDYGN AAGIADAMAL CEEAELIILC LGESAEMSGE AASRADLGLP GRQRELAEAA
LGLGQKLGRP VVALLSSGRP LTLPWLFERA DAVAATWFLG AEAGHAIADV LTGQFNPVGR
LALSWPRAVG QIPLFYAARP TGRPFAAEDH YTTKYIDCAV EPQFPFGHGL SYSPFSLEEF
AAGRQSFCAR DTLDFSVEVH NLGPLDGEAT VFLFARDLVA SVARPVLELK RFGKIALPAG
ESGTLRFALA ASELAFPGVD FRPCFEPGAF EFSVGFDADP RRHRKLSLQA TAP