Gene Msil_0226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_0226 
Symbol 
ID7090543 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp252329 
End bp253942 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content64% 
IMG OID643463560 
Productalpha amylase catalytic region 
Protein accessionYP_002360569 
Protein GI217976422 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.547651 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGACG GCCGCCCCCC CGCGATCCTC GGCGACTGGT GGCGCCCCGG CGCGATCTAC 
CAGATCTATC CGCGCTCCTT CCAGGATTCG GGCGGCGACG GCATTGGCGA TCTCGAAGGG
ATCCGGCGCC GTCTCGATTA TCTCGTCGGC CTTGGCGTCG ACGCGATCTG GATTTCGCCG
TTCTATCCCT CGCCGATGCA TGACTTCGGC TATGACGTCT CCAATTATTG CGACGTCGAT
CCGATTTTCG GCTCCCTTCG CGATTTCGAT CTTCTTCTGG CGGACGCGCA TCGGAGCGGC
CTCAAGATCG TCCTGGATTT CGTGCCGAAC CACACCTCGA TCGAGCATGA ATGGTTCGCT
GCAAGCCGAC AAAGGCGGGA CGACAAGAGC GACTGGTATA TCTGGCGCGA CGGCGCGCCC
TCCGGCGGCC CGCCCAACAA TTGGCGCAGC CATTTTGGCG GCCCGGCCTG GAGTTTCGAT
TCCGCGCGCG GCCAATATTA TTATCACGCC TTTTTGCCGC AGCAGCCCGA CCTCAACTGG
CGCAATCCCA AGGTCAAGGC GGCGATGTTC GACGTGCTGC GGTTCTGGCT GCGGCGCGGC
GTCGACGGTT TTCGCGTCGA CGTCATTTCG CAGCTCATGA AGGATGAAGC GCTGCGCGAC
AATCCGGCAA ATCCCGGTTG GACGCCGCTC CGGCCGCAGA TCGAGGAGCT GCTTCAGCTC
TATTCCGGCG ATCAGGATGA TATTCATCCT TTGATTGCGG AGATGCGCGG CGTTCTCGCC
GAATTTGGCG ATCCTTTGCT GATCGGCGAG ATCTATCTGC CGATGGAGCG CCTTGTCGCT
TATTACGGCG CGGCGCTTTC CGGCGCGCAT CTGCCGTTCA ATTTTCAGCT TCTCGAAACC
CCCTGGCAGG CTGAATCGCT TGGCGCGATG ATCGCCTCCT ACGAGGCCCT TTTGCCGGAG
GGCGCGTGGC CGAACTGGGT CCTCAGCAAC CACGATCGGC CGCGTGTGGC GACGCGCGTT
GGCGACGCGC AGGCGCGCGT CGCGACGATG CTGCTCCTGA CCCTGCGCGG CACGCCGACG
CTGTATTATG GGGATGAGCT CGGGATCGGC CATGTCGACA TTTCGCCGCC CCGCATCCGC
GACCCCTGGG CCCTGCGCGA ACCCTCGCTC GCGGTGGGGC GCGATCCGGT GCGCACGCCC
ATGCAATGGG ACGACAGCGC CAACGCCGGC TTCTCGACGC ATGAGCCATG GCTGCCGCTG
ACGCCGGACT GGCCAGAGCG GAACGTCGAG CGCTTCGAGG CGGAGCCCGC ATCGCTGCTT
CATCTGACGC GCCGCCTGCT CCACTACCGC CGCGATCATC GCACGCTGTC GCTCGGCTCA
TGGCGCCTGC TGGCGAGCAG CAATGAACTG CTCGCCTATG AACGCCGCTC CGGGCAAGAG
ACGACAATCG TCGTGCTCAA TCTCGGCGGC GCGTCGCAGC TTTGGCGGCT CGATCCCGCG
GGCTCGTCGT TTTGCGTGGC GATTTCGACT TATTGTGACC GGGCGGGCGA ACGCGTCGAT
CAAGTGCTGC GCCTGCGGCC GGATGAGGGC GTTGTGCTCG CGGTGTTGGG CTGA
 
Protein sequence
MTDGRPPAIL GDWWRPGAIY QIYPRSFQDS GGDGIGDLEG IRRRLDYLVG LGVDAIWISP 
FYPSPMHDFG YDVSNYCDVD PIFGSLRDFD LLLADAHRSG LKIVLDFVPN HTSIEHEWFA
ASRQRRDDKS DWYIWRDGAP SGGPPNNWRS HFGGPAWSFD SARGQYYYHA FLPQQPDLNW
RNPKVKAAMF DVLRFWLRRG VDGFRVDVIS QLMKDEALRD NPANPGWTPL RPQIEELLQL
YSGDQDDIHP LIAEMRGVLA EFGDPLLIGE IYLPMERLVA YYGAALSGAH LPFNFQLLET
PWQAESLGAM IASYEALLPE GAWPNWVLSN HDRPRVATRV GDAQARVATM LLLTLRGTPT
LYYGDELGIG HVDISPPRIR DPWALREPSL AVGRDPVRTP MQWDDSANAG FSTHEPWLPL
TPDWPERNVE RFEAEPASLL HLTRRLLHYR RDHRTLSLGS WRLLASSNEL LAYERRSGQE
TTIVVLNLGG ASQLWRLDPA GSSFCVAIST YCDRAGERVD QVLRLRPDEG VVLAVLG