Gene Msil_2142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_2142 
Symbol 
ID7093363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp2316104 
End bp2317096 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content62% 
IMG OID643465467 
Productputative sulfite oxidase subunit YedY 
Protein accessionYP_002362443 
Protein GI217978296 
COG category[R] General function prediction only 
COG ID[COG2041] Sulfite oxidase and related enzymes 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCTATC ATCGCCGCAA AGGCTGGGAG ATTCCGGAGC GCGAGGCGAC GCCCGAAGCG 
CTCTTTTTCG CGCGCCGCTC TCTCCTCAAG GCCGGCGTCG CCGCGGCGGC GCTGGCGGCG
GCTCCTTCCG CGCAGGCGTT CTCTTTGTTC GGCGGCGGCG ACAAGCCCGC CGCGCCGGAG
GCGCCCGATC AGACGCAAGG CCGTTATCCC GCAGGGCGCA ATGATCTGTT CAAGCTCGAC
CGCGATGTGA CGCCGGAGGA GATCAATTCC CACTACAATA ATTTTTACGA ATTCGGCTCG
GGCAAAGATA TTTTTGAGGC GGCGCAGGCG CTCAAGACTC GGCCCTGGAC CCTAAAAATC
GACGGTCTTG TCGAAGCCCC GAAGGAAATG GGGATCGACG ATCTCATTGC CTCCGCCCCG
CTCGAAGAGC GTCTTTACCG GCACCGTTGC GTCGAGGCCT GGGCGATGGC GATTCCCTGG
ACCGGCTTCC CGCTGAAACA CCTTGTCGAT CTGGCAAAGC CCCAGTCGGG CGCGAAATTT
GTGCGCTTCG AGACCTTTTT GGATCGCTCG ATGGCGCCGG GGCAGCGCCA GGTCTGGTAT
CCGTGGCCCT ATGGCGAGGG GCTGACCATG GCCGAGGCGT CAAACGATCT CGCCTTTCTC
GTCACCGGCG CCTATGGCAA GCCGCTCGGA AAGCAGTTCG GCGCGCCGCT GCGGCTGGCG
GTCCCGTGGA AATATGGGTT CAAGTCGATC AAATCGATCA CCAAAATTTC CTTCGTCGCC
GAGCGGCCGA AAACCTTCTG GGAGCAGCTG CAGGCGTCCG AATATGGCTT TTGGGCCAAT
GTGAACCCCG ACGTGCCGCA TCCGCGCTGG AGCCAGGCGA GCGAAGAGGT GCTGGGGACG
CATGAGCGCC GCAAGACGCA GATCTTCAAT GGCTACGGCG AATTCGTCGG CGGCCTCTAT
GTCGGGCTGG AGAAGGAGCG GCTTTACGTT TGA
 
Protein sequence
MFYHRRKGWE IPEREATPEA LFFARRSLLK AGVAAAALAA APSAQAFSLF GGGDKPAAPE 
APDQTQGRYP AGRNDLFKLD RDVTPEEINS HYNNFYEFGS GKDIFEAAQA LKTRPWTLKI
DGLVEAPKEM GIDDLIASAP LEERLYRHRC VEAWAMAIPW TGFPLKHLVD LAKPQSGAKF
VRFETFLDRS MAPGQRQVWY PWPYGEGLTM AEASNDLAFL VTGAYGKPLG KQFGAPLRLA
VPWKYGFKSI KSITKISFVA ERPKTFWEQL QASEYGFWAN VNPDVPHPRW SQASEEVLGT
HERRKTQIFN GYGEFVGGLY VGLEKERLYV