Gene Msil_0405 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_0405 
Symbol 
ID7093564 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp442505 
End bp443938 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content63% 
IMG OID643463735 
ProductGlucan 1,4-alpha-glucosidase 
Protein accessionYP_002360741 
Protein GI217976594 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3387] Glucoamylase and related glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.878783 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCTC CCTTCAAACA ACAGGCGTCG CTCGATGTCT GGATGCGCCG TCAATATGGG 
CTTTCCGCCG CGAAAATGAT GAGCGCCATT TCACGGGTCG ACCTCGTCAA GGAGCGGCGC
GGTTTTGGAC GGCTGTTGCG GCCGGCGAAA GGCTCGATCC TCGCTTCGCC CGTGATCGCC
GCCTATGATC CGGACCCCGA CTATTTCTTC CACTGGCTGC GCGATTCTGC CGTTATCATT
GACGCCTTGC GGCTGTTGAT CGAGAGCGAG GAAATCGCGC GGCCCGAGGG CCTTGCGCAT
TTTTCCGATT TCCTCGGCTT CAGCCTGACG CTGTGCGGTC TCGACGGGCG CTCATTTCTG
GCGGGAGCCG GCGATTACCG GGCGAAGGTC GAGCCTCATT TCGCGCAATT TCTGCGGCCC
GAGGCGGATC TTCTCGCCAT CTCAGGCGAC GATATCTTGG GCGAACCGAG ATTCGATCCC
GACGGTTCGA TCGACATCCT GAAATGGTCG CGGCCGCAAC ATGATGGGCC GGCCCTGCGC
GTGCTCGCGG TCGCCCGATT CTGCCAGTCG GTCGGCCCGG GGCCGGACAT TTTCAAGCAG
GCGGAAGAGC TCATCATCCG CGATCTCGGC TTCACCTTCG CCCGCTGGCG CGCGCCATCT
TTCGATATCT GGGAGGAAGA GCTCGGGCGC CACTATTATA CGCAGCTCGT CCAATGCGAG
GCGTTGCGCG AGGGCGGCTT ATGGCTCGAG TCGCGCGGCG CAATCGAGAG CGCCACCGCA
TATCTTGACG CATCGCAGGA GATCGCCGCG GGCCTCGACG ACTTCTGGAG CGCGCCGCAA
GGTTTTGTAC GAAGCCGCAT CGCCGCGGCG GGCTCCGGCC CGCAAAAGGA GCTCGACATC
GCGACCGTGC TCGCTGTGAT CCACGCCGGG CGCGAGGCTG GCGCGCATAG CGTGTGCGAC
TCGAGACTGA TTGCGACGCT TGGCCGGCTC GAAGCGCTTT TCGCCGATGC CTATACAATC
AACACGACAC AACCGGGCGC AGACGCCCCG GCAATGGGCC GCTATGACGG CGATCGCTAC
TACAGCGGCG GCGCCTATTT TTTCTCGACG CTCGGCGCGG CAGAATTCCA TTTCAAAGCG
GCGCAGGCCG TGGCGAAGGG TTTCCTCCAC GACGCCTCGG AATGGGCGCG CATCGGGCTC
GATTCAAAAC ATGACGGTCA TCATCTCTTT GAGGCGCTCC TGCGGTGCGG CGATCTGTTC
ATGACGACGG TCGCCGCCTA CACGTCGGAG AACGGCGACC TCTCCGAACA ATTCGATCAA
ACGACTGGCG TCCAGACATC GGCTAAAAAT CTCGCCTGGA GCCACGCCGC TTTCATCAGC
GCCTACGCCA GCCGGGAGAA GGCGCTTCGT TCCGCCAAAG GCGTCTCGCC GTGA
 
Protein sequence
MSAPFKQQAS LDVWMRRQYG LSAAKMMSAI SRVDLVKERR GFGRLLRPAK GSILASPVIA 
AYDPDPDYFF HWLRDSAVII DALRLLIESE EIARPEGLAH FSDFLGFSLT LCGLDGRSFL
AGAGDYRAKV EPHFAQFLRP EADLLAISGD DILGEPRFDP DGSIDILKWS RPQHDGPALR
VLAVARFCQS VGPGPDIFKQ AEELIIRDLG FTFARWRAPS FDIWEEELGR HYYTQLVQCE
ALREGGLWLE SRGAIESATA YLDASQEIAA GLDDFWSAPQ GFVRSRIAAA GSGPQKELDI
ATVLAVIHAG REAGAHSVCD SRLIATLGRL EALFADAYTI NTTQPGADAP AMGRYDGDRY
YSGGAYFFST LGAAEFHFKA AQAVAKGFLH DASEWARIGL DSKHDGHHLF EALLRCGDLF
MTTVAAYTSE NGDLSEQFDQ TTGVQTSAKN LAWSHAAFIS AYASREKALR SAKGVSP