Gene Msil_0421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_0421 
Symbol 
ID7093580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp463042 
End bp464031 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content61% 
IMG OID643463751 
ProductGlutathione S-transferase domain protein 
Protein accessionYP_002360757 
Protein GI217976610 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0435] Predicted glutathione S-transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.865646 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACTGC TCGTCGACGG CGTCTGGCGC GATCAATGGT ATGACACGCA AAGCCATGGC 
GGACGGTTTG AGCGCGACGC CGCGAAATTC CGCAACTGGA TCACCCCGGA CGGCGCCCCG
GGCCCATCGG GGCGCGGCGG CTTCAAGGCC GAGCCCGGCC GCTATCATCT CTACGCCGCC
TATTTCTGTC CCTGGGCGCA TCGCACGCTG ATCTTTCGCG AGCTCAAGGG CCTCGCGCCG
CTGATCGACG TCTCGATCGT CAATTGGCTG ATGCGCGAGA ACGGCATCAC CTTCGCGCCG
GCCGACGGCG TGATTGGCGA TCCGCTCTTT GGCGCGCGCA ATCTCTATGA GATCTATCAA
GCCGCCGATC CCGCCTATAG CGGCCGGGTG ACCGTGCCGA CGCTGTGGGA CAAAGAGACG
AAGACGATCG TCTCGACCGA ATCCTCCGAA ATCATCCGCA TGTTCAATTC AGCCTTCGAC
GGCGTCGGGG CGGCGGCGGG GGATTATTAT CCCCCGGAAT TGCGCGACGA AATCGACGCG
CTCAACGCGC GGATTTATCC GACGGTGAAC AACGGCGTCT ATCGCGCCGG CTTTGCGACG
ACGCAGGCGG CCTATGAGGA GGCGATCGGC CCGCTGTTCG AGACGCTGGA TTATCTTGAG
GCGTTGCTGG CCGAGCGCCG CTATCTCTGC GGCGAGCAGA TGACAGAGGC GGACATAAGG
CTCCTCACGA CTCTTTTGCG CTTCGACATC GTCTATGTCG GCCATTTCAA ATGCAATGTG
CGGCGGATCG CTGATTATCC AAATCTTTGG GCCTATGTGC GCGACCTCTA CCAGACCGGA
ACGATCGCAA ACACCTTTCG GCCAGACCAC ATCAAGGGCC ACTATTATCA GAGCCATCTG
CAGATCAATC CGACCGGAAT CGTCTCTGTC GGGCCAAGCA TCGATTTCTC CGCCCCGCAC
GACCGCGCGC GGCTTGGCGG GAGTGGTTAA
 
Protein sequence
MGLLVDGVWR DQWYDTQSHG GRFERDAAKF RNWITPDGAP GPSGRGGFKA EPGRYHLYAA 
YFCPWAHRTL IFRELKGLAP LIDVSIVNWL MRENGITFAP ADGVIGDPLF GARNLYEIYQ
AADPAYSGRV TVPTLWDKET KTIVSTESSE IIRMFNSAFD GVGAAAGDYY PPELRDEIDA
LNARIYPTVN NGVYRAGFAT TQAAYEEAIG PLFETLDYLE ALLAERRYLC GEQMTEADIR
LLTTLLRFDI VYVGHFKCNV RRIADYPNLW AYVRDLYQTG TIANTFRPDH IKGHYYQSHL
QINPTGIVSV GPSIDFSAPH DRARLGGSG