Gene Msil_1821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1821 
Symbol 
ID7094100 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1984846 
End bp1985874 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content66% 
IMG OID643465148 
ProductAlcohol dehydrogenase GroES domain protein 
Protein accessionYP_002362128 
Protein GI217977981 
COG category[R] General function prediction only 
COG ID[COG1064] Zn-dependent alcohol dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.162011 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCAGGA TGATGAAGGC CGCGGTCGTG CGCGAATTCG GCAAGCCGCT CGTGATCCAA 
GACGCGCCCA TCCCAACGCC AGGACCCGGC GAGGTTCTCG TCAAGGTCGC GGCCTGCGGC
GTCTGCCACA CCGATTTGCA CGCCGCCGAT GGCGATTGGC CGGTCAAGCC CGCGCCGCCG
TTCATTCCGG GCCATGAGGT CGCAGGCATC GTCGCGGCTC TCGGTCCCGG CGTCACCGAT
CTTAAAGAGG GCGACGCCGT CGGCGTCGCC TGGCTGCATG ATTCTTGCCT GCGCTGCGAA
TATTGCGAAA CAGGGTGGGA AACCTTGTGC GAGCATCAAC ACAACACAGG CTATAACGTC
AACGGCGGCT TCGCTGAATA TGTGATCGCG GCCGCCCCCT TCGCAGCGAA GCTGCCGACG
AATATCGACT TCGCGGAGAT CGCGCCGATC CTCTGCGCCG GGGTCACCAC CTACAAGGGC
ATCAAGGAAA CCGAAGCAAG GCCCGGCGAA TGGCTCGCCA TTTCGGGCGT CGGCGGGCTT
GGCCATGTCG GCATCCAATA TGCCAAAGCG ATGGGCCTGC ATGTCGCCGC GCTCGACATC
GCGCCCGAAA AGCTCGACCT CGCCATGGCG GCGGGCGCGG ACATCGCCAT CGACGCGCGA
GAGCCGGACG CCGTGGCGCA AGTCATCAAG GCGACGGGCG GCGGCGCCCA TGGCGTGCTG
GTGACGGCCG TCTCGCCGCC GGCCTTCGGC CAAGCCATTC GTCTCGTGCG CCGCAACGGC
ACCGTGAGCC TCGTCGGCCT GCCGCCCGGC GACTTCCCGA CGCCGATCTT CGAGGTGGTG
CTGAAGCGCA TCACGATTCG CGGCTCGATC GTCGGCACGC GCCGCGACCT CGACGAGGCG
ATCGCCTTCG CCGCCGAGGG CAAGGTCAAG GCGCAGATCG CGCGGGCGCC GCTCGAAGAC
ATCAACGATA TTTTCGCAAA GCTGAAGGCC GGCGAGATCG AGGGACGGAT GGTTCTCGAT
TTTCCGTGA
 
Protein sequence
MVRMMKAAVV REFGKPLVIQ DAPIPTPGPG EVLVKVAACG VCHTDLHAAD GDWPVKPAPP 
FIPGHEVAGI VAALGPGVTD LKEGDAVGVA WLHDSCLRCE YCETGWETLC EHQHNTGYNV
NGGFAEYVIA AAPFAAKLPT NIDFAEIAPI LCAGVTTYKG IKETEARPGE WLAISGVGGL
GHVGIQYAKA MGLHVAALDI APEKLDLAMA AGADIAIDAR EPDAVAQVIK ATGGGAHGVL
VTAVSPPAFG QAIRLVRRNG TVSLVGLPPG DFPTPIFEVV LKRITIRGSI VGTRRDLDEA
IAFAAEGKVK AQIARAPLED INDIFAKLKA GEIEGRMVLD FP