Gene Msil_3685 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3685 
Symbol 
ID7093039 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp4046193 
End bp4047188 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content66% 
IMG OID643466972 
Productaldo/keto reductase 
Protein accessionYP_002363931 
Protein GI217979784 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.181793 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACGC GTCAACTTGG GGCCAAAGGG CCGCCCGTCT CGGCCATTGG CCTTGGCTGC 
ATGGGAATGT CGGATTTCTA TGGACCGTCC GATCGCGAGG AAAGCATCGC GACGATCCAC
GCCGCGCTCG ACGCGGGCGT GACGCTGCTC GATACCGGCG ATTTCTATGG CATGGGCCAT
AATGAGATGC TGATCGCCGA GGCGCTGCAG GGGGTGAGCC GCGAGGCCTT TCAGGTCAGC
GTCAAATTCG GGGCGCAACG CGATCCTTCC GGCGCCTGGA TCGGGTTCGA CGCGCGCCCC
CAGGCGGTGA AGACTTCGCT GACCTATTCG CTGCGCCGGT TGCGGCTCGA TTACGTTGAC
GTCTATCGCC CGGCTCGCCT CGATCCGCAT GTGCCGATCG AGGACACGGT CGGCGCCATC
GCCGATATGG TCAAGGCCGG CTATGTCAGG GAAATCGGCC TGTCCGAGGT CGGCAGCGAG
ACGCTGCGGC GCGCGGCGGC CGTGCATGCG ATCGCCGATC TGCAAATCGA ATATTCGCTG
ATCTCGCGCG GCATCGAGGG CGGCGTTCTC TCCACATGTC GCGAACTTGG GATTGCGCTC
ACCGCCTATG GCGTACTGTC GCGCGGGCTG ATCAGCGGGC ATTGGCGCCC CGGGCCGCTC
GAGCCAGGCG ATTTTCGCTC GCGCAGCCCG CGTTTCCAGG AGGGCAATGT CGACAAGAAT
CTCCAGCTCG TCGAGGCGCT GCGGAAACTG GCGGCAGAGA AGGGCGCAAG CGTCGCGCAG
ATTGCGATCG CCTGGGTTCT GGCGCAGGGC GAGGACATCA TCCCGCTCAT CGGCGCGCGG
CGGCGCGACC GCCTCGCTGA GGCTCTTGGC GCCCTCAACG TCACGCTGAC GCCGAAGGAT
ATTTCTGCGA TCGAGGCGAT CGCGCCCAAA GGCGCCGCCG CGGGCGAGCG CTACGACGCC
CCGCAAATGG CCTTTCTCGA CAGCGAGCGG GGGTAG
 
Protein sequence
MKTRQLGAKG PPVSAIGLGC MGMSDFYGPS DREESIATIH AALDAGVTLL DTGDFYGMGH 
NEMLIAEALQ GVSREAFQVS VKFGAQRDPS GAWIGFDARP QAVKTSLTYS LRRLRLDYVD
VYRPARLDPH VPIEDTVGAI ADMVKAGYVR EIGLSEVGSE TLRRAAAVHA IADLQIEYSL
ISRGIEGGVL STCRELGIAL TAYGVLSRGL ISGHWRPGPL EPGDFRSRSP RFQEGNVDKN
LQLVEALRKL AAEKGASVAQ IAIAWVLAQG EDIIPLIGAR RRDRLAEALG ALNVTLTPKD
ISAIEAIAPK GAAAGERYDA PQMAFLDSER G