Gene Msil_3445 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3445 
Symbol 
ID7092468 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp3786853 
End bp3788733 
Gene Length1881 bp 
Protein Length626 aa 
Translation table11 
GC content60% 
IMG OID643466740 
ProductPeptidase S53 propeptide 
Protein accessionYP_002363701 
Protein GI217979554 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4934] Predicted protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCGAT CACTGCGCAA CTATTTTGCT GCGGCAATGA TTTTTGGTGT GACGGGGGCG 
CAAGCCGGGA TTCCCTACCC GACGGCGGCG ACCCCTAAAG CGATCGACCG GGGCTTAATG
AAAAGCCTGG CGAGCAGCAG TGAAATCTCG GTGACGGTCG CGCTGCGGCC GCGCGACCCA
AACGGCGCGG AAGAGTTGCT CAGCGCGCTG ACGACGCCGG ATGACCCTCA GTTTCACAAA
TTTCTCACTC CGCAACAGTT CGCCGCGAAA TTCGGCCCGA GCGCGGCGGA TGTCGCGAAG
GTGATAGCGA CACTGAAGGG CTATGGCCTT CGGGTCGAGC AGGCGACGCC TTTCACGCTG
CGCGCAACTG GAACTCCTGC CAAAATTGAA AGCGCCTTTC ACGTAAGCCT GCACCAGTTC
GATGTTCCGG CGCAGGGCGG CGCGGCCGCC TACAGCTATC ACGCGCCGGC GACGCCTCCG
ACCGCGCCGG ACGCGGTGGC CGGGCTTATC TCCGGCATCG TCGGTCTGGA TACAAAGCCG
CATTTTCGGC CCCACGTTCA GAAGGCTCCG GCGGAGCTAA GCGCCGTCGA GGGGCAGCAG
CAGCAAAGCG GCAACCCAAG CCTTATAAAC CCTTTGGGGT CGCTTACGGT GGCAGATTTC
GCCCAGTATT ACGACGTGAA GCCGCTCTAC GCCGCCGGCG TCGCGGGCAA CGGCCGGACG
CTGGGAATCA TCACTCTTGC GAATTTCACT CCGAGCGACG CTTTCCATTA TTGGACAAGG
GTCGGGCTTG CGGTCGCGTC CAACCGAATG ACCCTCGTCA ACATCGATGG CGGCCCGGGC
GCCCCGAGTG ACGTTTCCGG CTCCGACGAG ACCACACTGG ATGTTGAGCA GTCCGGCGGA
TTGGCGCCGG GGGCCAGGAT GATTGTTTAT TTGGCGCCTA ACACGAACCA GGCTTTCTTC
GACGCCTTCG CCAAGGCGGT CAACGATAAT ACGGCCGATA CGGTATCGGT CAGCTGGGGC
GCTTGGGAGG GATTTGATCA GTCAACCGGA TTTACCAATT CGCTGCATAG CCTGCTCGTT
CAGGCTGCCG TACAAGGGCA GAGCTTCTTC GCCGCCGCTG GCGATGATGG CGCCTACGAC
GTCGATCGCG CGATCGGCGT CCAGGCAGGC GGCGTGACGG TCGATTATCC GGCGAGCGAT
CCGGCCATAA CAGCGGCAGG GGGCACGACG CTTGCCGGCA AGCAGGCGTT TACCGTCAAT
GGACGTCCCC TTGTCATCAA TGTCGCCAAG GAGCGCGTGT GGGGCTGGGA CTATCTTGAT
CCTGTCTGCA AAAAACGAAA ATTGGACCCA ATCGATTGCG GCATCTTCTC CGTGGGCGGT
GGCGGCGGGG TCAGCATTGT GTTTGGCATT CCTGATTATC AAACAGTGAC CAAGAGAAAA
GGCGCTGTGC CGATTCCGGG GATAAAGACA AGCGCAAAGG GGGAGACAGT GCAAGGCGCG
CGGCTGCCGG CTGGTTTCCA GGGGCGCAAC GTGCCGGACA TTTCAGCAAA TGCCGATCCC
AATACCGGAT ATTCGATGGA TTATACCTCT AACATTCACG GCTTCCGCAC GACTACTTTC
AATGGCGGCA CCAGCTTCGT CGCGCCTCAA TTTAATGGCG TCACGGCTCT GCTCTGCCAG
AAAGCAAATA GCAGGCTCGG TTTGATCAAT AACCCTCTCT ACAGTTTAGT GAGAGCGAAT
GCTGGCAAGA AGGCAGGCGG ACCAATAAGA TCCATCGCGA CCGGCGACAA TTGGTTTTAC
AAAGGCGCAC AGGGTTATAG CCCGGCCGCC GGGGCCGGCG TGCTCGACGT GACGAAACTA
GCAACCGAAC CTGGATTCTA G
 
Protein sequence
MNRSLRNYFA AAMIFGVTGA QAGIPYPTAA TPKAIDRGLM KSLASSSEIS VTVALRPRDP 
NGAEELLSAL TTPDDPQFHK FLTPQQFAAK FGPSAADVAK VIATLKGYGL RVEQATPFTL
RATGTPAKIE SAFHVSLHQF DVPAQGGAAA YSYHAPATPP TAPDAVAGLI SGIVGLDTKP
HFRPHVQKAP AELSAVEGQQ QQSGNPSLIN PLGSLTVADF AQYYDVKPLY AAGVAGNGRT
LGIITLANFT PSDAFHYWTR VGLAVASNRM TLVNIDGGPG APSDVSGSDE TTLDVEQSGG
LAPGARMIVY LAPNTNQAFF DAFAKAVNDN TADTVSVSWG AWEGFDQSTG FTNSLHSLLV
QAAVQGQSFF AAAGDDGAYD VDRAIGVQAG GVTVDYPASD PAITAAGGTT LAGKQAFTVN
GRPLVINVAK ERVWGWDYLD PVCKKRKLDP IDCGIFSVGG GGGVSIVFGI PDYQTVTKRK
GAVPIPGIKT SAKGETVQGA RLPAGFQGRN VPDISANADP NTGYSMDYTS NIHGFRTTTF
NGGTSFVAPQ FNGVTALLCQ KANSRLGLIN NPLYSLVRAN AGKKAGGPIR SIATGDNWFY
KGAQGYSPAA GAGVLDVTKL ATEPGF