Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_1362 |
Symbol | |
ID | 7091700 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | + |
Start bp | 1470470 |
End bp | 1472020 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643464700 |
Product | peptidase M48 Ste24p |
Protein accession | YP_002361689 |
Protein GI | 217977542 |
COG category | [R] General function prediction only |
COG ID | [COG4784] Putative Zn-dependent protease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.0606393 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTTTCA TGAGAATTGT TGGCGCTCGA CATCTTGCCG GCGTCGCTCC CCGCCGCAAA GCCCTGACGA CCCGGCCTCC CGCGTCCTCG CTGCGCGCCG CGATCCTCGT CGCATTCGCC GCATCCCTCG GGCTCGCGGG CTGCGCGTCG ATCGAGCCGC AGGGCGAGCA ATCCTTCAAA CTCGAGACGC CGCCGCTGCC GCCGCGCCCG CCAAAGCCGG AAAGCGAGGC GAGCGCCGAG CACAACCGCA TGGTCGCCCT GTTCGACGGC GAGTACAAGG ACCCCGCCGC CGAGCGCTAT CTCAATGATA TTCTGGCGAA GCTCGCCAAG GCCGACGACC GCCGCAGCGA GCCCTATAAG GTGACGATTC TCGATTCGCC GATCGTCAAC GCTTTCGCGC TGCCGCCGCA CGATCTCTTC ATCACCCGCG GCCTGCTGGC GCTCGCCAAT GACGCGTCGG AAGTCGCGGC CGTCATGGCC CATGAGATCG CCCATCTCAC TGCAAAGCAC GCGGTCCGTC GGGAGGAAGA GGAAAAGCGC GCGGCGGTGA TCAGCCGGGC CGCGAGCGTC GTCCAGAACA AGGAAAAGGG CCAGGAGATC GAAGCCTCCG CGCGGCGCAC GATCGCGACC TTCTCGCGCC AGCAGGAGCT CGACGCCGAC CAGATGGGCA TCAAGGGGAT CGCCAAGGCT GGCTTCGACC CTTACGGCGC CTCGCGTTTC CTTGGCGCGC TCGGACGATC CGCCGTCCTG CGCACCTCGC TGATCGGCCA GAACGCCAGC GCGGACAAGC CGGATATCCT CGCAACCCAC CCTTCGACGC CGGAGCGCGT GACGCAGGCG ATCGCCGTCG CGCGCCAGAT CGCCGCCCCG GGCATCGGCG TCACGGACCG CGACGCCTAT CTTTCCGCCA TCGACGGCAT GGTGTTCGGC GACAGTCCAA CGGAAGGAAC GGTGCGCGGA CGCAACTTCC TTCACGCCCG GCTCGGCTTC GCCTTTGCCG CTCCCGAAGG CTTTGTCCTC GAGAATGCGT CGAAGGCGCT GCTGGGCGTC TCGGCCGACG GCAATCAGGC GCTGCGGCTC GACAGCGTCA AGGCCCCGCC GGGCATGTCG CTTGAGGCCT ATATGGCCTC CGACTGGATC GACGGTCTGG AACAGGGCTC GATCGAGACC GTCGACGTCA ACGGTTCGCC GGCGGCCATC GCCAGAGCCA AAGCCGGCGA CTGGAGTTTT CGGCTCGCCG TGATCCGCTT CGAGGCGGGC GAGTTCTACC GGCTGATCTT CGCGACCCGC ATCGAAAGCC AAGACAGCGA GCGGCAGTTC AAGGACGCGC TCTCGTCGTT CCATCGCGCA AGTCCGGAGG AAATCCGCGC CGTACACCCG CTGCGAATCG AAATCGTCAC GGCAAAGCCC GGCGAGCGCG CGGAAGATCT CGCGGAGAAA ATGGCGACGC CCGACCGCAC GCTCGAATTC TTCCGGCTGA TCAACGGCCT TGAGGCCTCG GCGCCGCTGC AGGCGGGCGA GCGCTACAAG ATCGTCGCCG AGCAGAAATA G
|
Protein sequence | MPFMRIVGAR HLAGVAPRRK ALTTRPPASS LRAAILVAFA ASLGLAGCAS IEPQGEQSFK LETPPLPPRP PKPESEASAE HNRMVALFDG EYKDPAAERY LNDILAKLAK ADDRRSEPYK VTILDSPIVN AFALPPHDLF ITRGLLALAN DASEVAAVMA HEIAHLTAKH AVRREEEEKR AAVISRAASV VQNKEKGQEI EASARRTIAT FSRQQELDAD QMGIKGIAKA GFDPYGASRF LGALGRSAVL RTSLIGQNAS ADKPDILATH PSTPERVTQA IAVARQIAAP GIGVTDRDAY LSAIDGMVFG DSPTEGTVRG RNFLHARLGF AFAAPEGFVL ENASKALLGV SADGNQALRL DSVKAPPGMS LEAYMASDWI DGLEQGSIET VDVNGSPAAI ARAKAGDWSF RLAVIRFEAG EFYRLIFATR IESQDSERQF KDALSSFHRA SPEEIRAVHP LRIEIVTAKP GERAEDLAEK MATPDRTLEF FRLINGLEAS APLQAGERYK IVAEQK
|
| |