Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_2993 |
Symbol | |
ID | 7093488 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | + |
Start bp | 3304239 |
End bp | 3305669 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643466304 |
Product | protease Do |
Protein accession | YP_002363266 |
Protein GI | 217979119 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 0.700896 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACGG AATTTTTCGC AAGGGGCGCG TCTCGCCTGC GCCGCGCTTA CGGCGTCGCG CTAGCGCTTG TCTGCTGCCT TGGCCTTCCC GCCCAGGCGG AGACGGCGCG TCAGGCGCCG CAGGCCCCAG CCGAGGTGAT GCTCTCTTTC GCGCCCGTGG TGAAAAAGGC GCAACCTGCG GTCGTCAACG TCTATGCGTC GCGGACGGAG AAGCGGCCGC GCAGCGCACT CTACGACGAT CCGATTTTCG AGCGGTTTTT TGGCGGCGGC GGCCGTCCCG GCGGCTCGAC CTCGCGCTCG CTTGGGTCCG GCGTGCTGGT CGATTCCTCG GGCCTCGTCG TCACCAATTA CCACGTCATC GAAGGCATGA CGGACGTCAA GATCGCGCTT GCCGACAAGC GCGAGTTTGA CGCGGACATT GTGCTGCGCG ACCAGCGCAC CGATCTTGCT GTGCTGCGGC TGAAGGGCGG CGCCAATTTT CCGGTGATGG AGCTTGGCGA TTCCGACGCG CTCGAAGTCG GCGACTTCGT GCTGGCCATC GGCAATCCGT TTGGAGTCGG CCAGACCGTG ACGCAGGGGA TCGTCTCGGC GCTCGCCCGC ACCCAGGCGG GCATTTCCGA TTCCGGCTTC TTCATCCAGA CCGACGCGGC GATTAATCCC GGCAATTCCG GCGGCGGCCT CGTCGATATG AGGGGCCGTC TCGTCGGCAT CAACTCGGCG ATCTTCTCGC AGACGGGCAA TTCGGTCGGC ATCGGCTTCG CCGTCCCGAG CAATATGGTG CGGGTCGTCA TCGCGGCGGC CAAATCCGGC CAGCGGGTGC GGCGGCCCTG GTTAGGGGCA AGCCTGCAGG CGGTCTCGCG TGAAATCGCC GATTCGCTCG GCCTCGACCG CCCGTCGGGC GCCCTGGTCG CCGAGGTGAC GGACGGAGGT CCGGCGGACA AGGCCGGAGT CAAGCGCGGC GACATCATCG CCGCTGTCGA CGGCCAGACC ATCGACGATC CCGAAAGCTT CGGATATCGC CTGTCGACCA AAGCGCTCGG CGGCGAGACC TCGCTTTCGC TGGTGCGCAA CGGCAAGCCG CTGAACGTCA AGCTGGCGCT ATCCCCCGCG GCTGAAATCC CGGCGCGCGA TCCGGTCAAA CTGAAAGGTC CGTCGCCTTT TTCCGGCGCG ACGGTGATCA ATCTATCCCC GGCTGTCATC GAGGAAATGT CGGTCCACGG CGTCAACGAT GGGGTCGTGA TCGGCGATAT CGAGGACGGA TCGACCGCTG CGGAGGTCAA TTTCCAAAAG GGCGACGTCA TTCTTCTCAT CAATGACGTC AAGATCCAGA CGACCCGCGA TCTTGAGAAG GCGGTCAACG GCCGCCATAC CTATTGGAAG CTGACCATTT TACGCGGCGG ACAGGTGGAG ACGACGGTGC TCGGGGGCTG A
|
Protein sequence | MTTEFFARGA SRLRRAYGVA LALVCCLGLP AQAETARQAP QAPAEVMLSF APVVKKAQPA VVNVYASRTE KRPRSALYDD PIFERFFGGG GRPGGSTSRS LGSGVLVDSS GLVVTNYHVI EGMTDVKIAL ADKREFDADI VLRDQRTDLA VLRLKGGANF PVMELGDSDA LEVGDFVLAI GNPFGVGQTV TQGIVSALAR TQAGISDSGF FIQTDAAINP GNSGGGLVDM RGRLVGINSA IFSQTGNSVG IGFAVPSNMV RVVIAAAKSG QRVRRPWLGA SLQAVSREIA DSLGLDRPSG ALVAEVTDGG PADKAGVKRG DIIAAVDGQT IDDPESFGYR LSTKALGGET SLSLVRNGKP LNVKLALSPA AEIPARDPVK LKGPSPFSGA TVINLSPAVI EEMSVHGVND GVVIGDIEDG STAAEVNFQK GDVILLINDV KIQTTRDLEK AVNGRHTYWK LTILRGGQVE TTVLGG
|
| |