Gene Msil_0953 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_0953 
Symbol 
ID7093632 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1038283 
End bp1039302 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content68% 
IMG OID643464292 
Productpeptidase S1 and S6 chymotrypsin/Hap 
Protein accessionYP_002361284 
Protein GI217977137 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.00207331 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGACTATT TCACTTCCTT TGTGCTCGAT GAATCCGAGC CTGCGCCGCA GGATGACGCG 
GCCGCGCTGA CACGCGATTT CCCCGCGGAT GACGGGGCGC TGCTCGACGC CTATTCGAAA
AGCGTGACGC GCATCGTCGA GGAGGTCGGC CCGAGCGTCG TGCGGCTCGA CGTCAAGCGC
GGGGACGGCC GCAGCGGCGG CTCTGGCTCC GGCGTCATCG TCTCGCCGGA CGGGCTCATC
CTGACCAACA GCCATGTCGT CGGCGGCGCG CGCCGCGCAA CCGTGACGAC GCTGGACGGG
CGCAATCTGT CCGGCCGGGT CCTTGGCGAT GATCCAGACA CCGACCTCGC CTTGGTGCGG
GTCGATGAGA ACGTCACTTT GCCGGCGGCG CGGCTCGGCG ATTCGAAACG GCTGAAGCCG
GGTGAAATCG CGGTCGCCAT CGGCAATCCG CTCGGCTTCG ATTCGACCGT GACGGCGGGC
GTCATTTCGG CGCTCGGGCG TTCGCTGCGC TCGAACAATG GCCGCATGAT CGACGATGTG
ATCCAGACCG ACGCCGCGCT CAATCCCGGC AATTCCGGCG GACCGCTGGT CGCCTCGAAC
GGCGCCGTCA TCGGCGTCAA CACCGCGATC ATCGCTGGCG CGCAGGGCAT CTGCTTTGCG
GTCGCAGCGA ATACGGCGCG TTTCGTTCTT GGCGAACTCG TCGCCCATGG CCGCGTGCGC
CGCGCTTATC TCGGCGTCGG CGCCAGCACG ATCGTCCTGC CGCGCCGCAT CGCGCTCCGG
CTCGGCCTCG AGCAGACCAC GGGCGCGGTG ATCAGCCAGG TCGAAAAGGA TGGCCCCGCC
GATCACGCGG GCCTGCTTAC GGGCGATATC GTCCTTGCCG TCGATGGCGC GCCAGTCGCC
AGCGCTGGCG ATCTTCTGCG CTTGCTTGGC GCCGACAAGA TCAACCAGGT CGCGCCGCTC
GATATTCTGC GGCGCTCCGA CCGGCGCCGG TTCTGGGCCG CGCTGCGCGA GCGCGTTTGA
 
Protein sequence
MDYFTSFVLD ESEPAPQDDA AALTRDFPAD DGALLDAYSK SVTRIVEEVG PSVVRLDVKR 
GDGRSGGSGS GVIVSPDGLI LTNSHVVGGA RRATVTTLDG RNLSGRVLGD DPDTDLALVR
VDENVTLPAA RLGDSKRLKP GEIAVAIGNP LGFDSTVTAG VISALGRSLR SNNGRMIDDV
IQTDAALNPG NSGGPLVASN GAVIGVNTAI IAGAQGICFA VAANTARFVL GELVAHGRVR
RAYLGVGAST IVLPRRIALR LGLEQTTGAV ISQVEKDGPA DHAGLLTGDI VLAVDGAPVA
SAGDLLRLLG ADKINQVAPL DILRRSDRRR FWAALRERV