Gene Msil_3210 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3210 
Symbol 
ID7090625 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp3524847 
End bp3525875 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content64% 
IMG OID643466518 
Productaldo/keto reductase 
Protein accessionYP_002363479 
Protein GI217979332 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.0255002 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTACA GGCGTCTTGG ACGATCCGGC TTTAGCGTGC CGGTTCTCAG CTTCGGCACC 
GGCACTTTCG GCGGCAAGGG CGAGCTCTTC GCCGCCTGGG GCGCGACCGA CGTCAAGGAG
GCGAGCCGGC TCGTCGATAT CTGTCTCGAC GCGGGGCTGA CGATGTTCGA CAGCGCCGAC
ATCTACTCGT CGGGCGTTGC CGAACAGGTG CTCGGCGAGG CCATCAAGGG CCGCCGCAAT
CAGGTCCTGA TCTCGACCAA AGCGACCTTT CGCTCCGGCC CCGGCCCGAA CGAGGTCGGC
TCGTCGCGCT TCCATTTGAT CGAGGCCGTC GAAGCCGCTC TAAAGCGCCT GCAGACCGAC
CATATCGATC TCTTTCAGCT GCATGCCTTC GACGCCATGA CTCCCGTCGA GGAGACGCTG
TCGGCGCTCG ACGATCTGAT CCGCGCGGGA AAAATTCGCT ACATCGGCTG CTCGAACTTT
TCGGGCTGGC ATCTGATGAA GTCGCTCGCC ACAGCAGACC GCTATAATCT GCCACGCTAC
ATCGCGAATC AGGCTTATTA TTCGCTGGTC GGGCGCGACT ATGAATGGGA GCTGATGCCG
CTCGGGCTTG ACGAAGGCGT CGGCGCGATG GTCTGGAGCC CTCTTGGCTG GGGCCGGCTC
ACCGGCAAGA TCAGGCGCGG GCAGCCGCTG CCCGAGGTCA GTCGGCTGCA CAAGACCAAG
GACATCGGGC CGCAGGTGGA GGACGATTAT CTCTACGACG TCGTCGACGC CCTCGATGAG
ATCGCCAAGG AAACCGGCAA GTCGGTTCCG CAGATCGCGC TCAACTGGCT GTTGCAGCGT
CCGACCGTGT CGAGCGTCAT CATCGGCGCG CGCGACGAGG AGCAGCTGAA GCAAAATCTG
GGGGCGGTCG GCTGGTCCCT TGCCGCGGAG CAGATCGCAA AGCTCGACGC GGCAAGCCAG
CGCGAGCCCG CCTATCCCTA TTGGCACCAG CGCGGCACCT TTGTGGAGCG CAATCCTCTG
CCGGTCTGA
 
Protein sequence
MDYRRLGRSG FSVPVLSFGT GTFGGKGELF AAWGATDVKE ASRLVDICLD AGLTMFDSAD 
IYSSGVAEQV LGEAIKGRRN QVLISTKATF RSGPGPNEVG SSRFHLIEAV EAALKRLQTD
HIDLFQLHAF DAMTPVEETL SALDDLIRAG KIRYIGCSNF SGWHLMKSLA TADRYNLPRY
IANQAYYSLV GRDYEWELMP LGLDEGVGAM VWSPLGWGRL TGKIRRGQPL PEVSRLHKTK
DIGPQVEDDY LYDVVDALDE IAKETGKSVP QIALNWLLQR PTVSSVIIGA RDEEQLKQNL
GAVGWSLAAE QIAKLDAASQ REPAYPYWHQ RGTFVERNPL PV