Gene Msil_2079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_2079 
Symbol 
ID7091445 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp2253237 
End bp2254319 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content67% 
IMG OID643465403 
Productalpha/beta hydrolase fold protein 
Protein accessionYP_002362380 
Protein GI217978233 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR02427] 3-oxoadipate enol-lactonase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGA TCATCGTTGC GGGCGAGCCG TTCAATATCC GTCTTGACGG CGACGCGGAT 
GCGCCGGTTC TCATCCTGGC CCATCAGCTC GGCGGCGCGC TCCAGGTCTG GGATCGCCTC
GCGCCCGCCC TGAGCGAGCG CTTCCGCGTG CTGCGCTATG ATTCGCGCGG CCATGGATCG
AGCGTCGCCA ACCCTGGGCC CTACTCGATC GCCGGGCTGG CGCGGGACGC AATCGGACTT
CTCGACGCGC TGCAAATCGA AAAAGCCCAT TGGATCGGCC TCTCCATGGG CGCGATCGTC
GGCCAGGCCG CGATGCTGCT GGCCCCAGCC CGCATCGGCC GCGCCGTGCT CGCCAATACG
GCGGCGCAAC TCGGGACGCC CGATTTATGG AACGCGCGGA TCAGCGCGAT GCGCGCGGAC
GGCGGCGCGG GCATAGCGTC GGCGACGCAG GAGCGCTGGT TCACGCCGGA ATTTTGCGAG
GCCGAACCGG CCGCCGTCAA AGCCGTCATG GATGATTTCC GCGCGACTCC GGTCGAGGGC
TATGCGTCCG CCTGCGGCGC GCTGCGCGAC GTCGATCTGC GCGAAGCTAT CCGGTCCATC
AGCCATGAAA CCTTGGTGAT CGTCGGCGCG CGCGACCCAT CCGCGCCGCC CGCCCTTGGC
GCCTATGTCG CGAGCGTCAT CGAGGGCGCG AGGCTCGTGA CGCTGGAGAC CTCGCATATT
TCTCCGGTAG AGGATGTAGA GGGTTTCCTC GAGGCGACGC TCGAATTTTT GACCGCGCCC
GAACCCGTCG CCCGGCCGGC GCTGGCCGCG GGCAAGACGG CGCCACGGCG CAAATCGCTG
CAGCGGGTGC TGGCCGGCCG CGCGCCGGCG AAAAAAGCTC CCGCCAAAAA AGCCGCGGCC
AAGAAGGCTC CGGCGAAAAA GGCGCCGGCC AAGAAATCCG CCGCGAAAAA AACTGCGCCG
AAAAAAAGCG CGGCCAAAAA AGCCGTTTCC GTCAAGAAGG GGACGAAGAA GCCGGGCGGG
TCGCGCCGCA AAGGCGCCCG CGGATCGCTG CGAACGGTCC GCGCCAGCAA ACTCAAACAA
TAA
 
Protein sequence
MSEIIVAGEP FNIRLDGDAD APVLILAHQL GGALQVWDRL APALSERFRV LRYDSRGHGS 
SVANPGPYSI AGLARDAIGL LDALQIEKAH WIGLSMGAIV GQAAMLLAPA RIGRAVLANT
AAQLGTPDLW NARISAMRAD GGAGIASATQ ERWFTPEFCE AEPAAVKAVM DDFRATPVEG
YASACGALRD VDLREAIRSI SHETLVIVGA RDPSAPPALG AYVASVIEGA RLVTLETSHI
SPVEDVEGFL EATLEFLTAP EPVARPALAA GKTAPRRKSL QRVLAGRAPA KKAPAKKAAA
KKAPAKKAPA KKSAAKKTAP KKSAAKKAVS VKKGTKKPGG SRRKGARGSL RTVRASKLKQ