Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_0188 |
Symbol | |
ID | 7090505 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | + |
Start bp | 192430 |
End bp | 194004 |
Gene Length | 1575 bp |
Protein Length | 524 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643463522 |
Product | protease Do |
Protein accession | YP_002360531 |
Protein GI | 217976384 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.463194 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTTTCC CTTTGCTTCC TGGTTCTGAA CAGAATGGCC AATCGCGTCC CGCGCCCCGG CGCGGCCTTC GGGTCGTCCT GCTTGGCGCC GTGGCGACGG CTGCCTTGAC CGGAGCGCTG ACGACCGGCT TCGTTTCGCC GCATTCCGCG GTCGCCGAAA CCGCCGCTCC CATCGCGGCG CAAGCGCCTT CCGCCTCGCC GGTATCCTTC GCCGATGTCG TCGACCATGT TCGCGACGCA GTCGTTTCGG TCAAGGTCAA GATCACTGAG ACCGCCGACG CGTCGGATGA CGACGATGAC AACAGCGATT CGCCGCGCCC CGGCCAGATT CCGCGCCTTC AGCCGGGCGA TCCGCTGGAG CGCTTCTTCA AGCGCTTCGG CCAGCCGGGC ATGCCGCATC CGGGCGGTCC CGGCAAGCCG CATTCGGCGC AGGCGCAGGG CTCGGGCTTC ATCATTTCGT CGGACGGCTA TGTGGTGACC AATAACCACG TCGTCGAGAA GGCGACCGAA GTCACGCTCA CCACCGACGA GGGCAAGACC CTTCATGCGA CGGTCGTCGG CACCGACAAG AAGACCGATC TGGCCTTGTT GAAGATCAAG GAAGACGGCT CCTATCCTCA CGTGAAATTC TCCAGCGCCA CCCCCCGCGT CGGCGACTGG GTGATTGCGG TCGGCAATCC GTTCGGCCTT GGCGGAACGG TGACGGCCGG CATCGTCTCG GCGCGCGGCC GCGACATCGG CGCCGGCCCC TATGACGATT TCCTGCAGAT CGACGCCCCG GTGAACCGCG GCAACTCGGG CGGCCCGACC TTCAACACGC TGGGCGACGT CGTCGGCGTC AACACGGCGA TCTTCTCGCC GTCCGGCGGC AGCGTCGGCA TCGGCTTCGC CATCCCCTCG GAGACGGCGC AGTCGATCAT TGCGAGCCTC AAGGACAAGG GCGCCGTGGC GCGCGGCTGG ATCGGCGTGC AGATTCAGCC GGTCACCGAT GAGATCGCCG ACAGCCTTGG CCTCAAGTCG AGCAAGGGCG CACTCGTCGC CGACGCGCAG GACAATTCGC CCGCCAAGGA AGCGGGCATC AAATCCGGCG ACGTGATCCT CGGCGTCAAT GGCGAGCGCG TCGATGGACC GCGCGATCTC GCCAAGAAAG TGGCGGCGCT TGGTCCGGGC AAGAAGGCCG ATCTGCTCTA TTGGCACGAC GGCGCGGAGA AGACCGTAGC GGTGAAGCTC GGCTCGCTCC CCGACGAGAA AGAGGCGGCA AAGCCGGCGG CGCTGCAGGA TAATTCTGCG CTTGCGGGCC TCGGGCTGAA GCTGGCTCCG GCGTCTTCCG TGCAAGGCGC GGGCAATGAT GGCGTCGTTG TCGCCGACAT CGATCCCGAA GGCTCGGCCG CGCAGAAGGG CCTCAGGGTC GGCGATCTCA TCCTCGAGGC CGGCGGCCGC GCGGTGAGCA AGCCGTCCGA AATTGCGGCG ATTATCGCTG ACGCCAAGAA GGATGGCCGC AAGGCCGTGC TGCTGCGGGT CAAGAGCGGC GAAGGCACGC GCTTCGTCGC CGTGGCGACC AACCCCGCTT CCTAA
|
Protein sequence | MAFPLLPGSE QNGQSRPAPR RGLRVVLLGA VATAALTGAL TTGFVSPHSA VAETAAPIAA QAPSASPVSF ADVVDHVRDA VVSVKVKITE TADASDDDDD NSDSPRPGQI PRLQPGDPLE RFFKRFGQPG MPHPGGPGKP HSAQAQGSGF IISSDGYVVT NNHVVEKATE VTLTTDEGKT LHATVVGTDK KTDLALLKIK EDGSYPHVKF SSATPRVGDW VIAVGNPFGL GGTVTAGIVS ARGRDIGAGP YDDFLQIDAP VNRGNSGGPT FNTLGDVVGV NTAIFSPSGG SVGIGFAIPS ETAQSIIASL KDKGAVARGW IGVQIQPVTD EIADSLGLKS SKGALVADAQ DNSPAKEAGI KSGDVILGVN GERVDGPRDL AKKVAALGPG KKADLLYWHD GAEKTVAVKL GSLPDEKEAA KPAALQDNSA LAGLGLKLAP ASSVQGAGND GVVVADIDPE GSAAQKGLRV GDLILEAGGR AVSKPSEIAA IIADAKKDGR KAVLLRVKSG EGTRFVAVAT NPAS
|
| |