Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_0606 |
Symbol | |
ID | 7093684 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | + |
Start bp | 656365 |
End bp | 657936 |
Gene Length | 1572 bp |
Protein Length | 523 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643463938 |
Product | protease Do |
Protein accession | YP_002360940 |
Protein GI | 217976793 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.0210206 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCCTGT TCAAGACTGC GCGGCGCGGC GCGATTGCGT CTGCCGCCGG CGGGGCCGCT CCGCGAAACG CCTGCTCCCT GCTCGCCATT GCCGCCGTGC TCATGTTTGG AGCCGCGTCC GTCGCGCCCG GGCTCGGCGC GCCGCCGGCC TATGCGAAGG GACCGGACTC CCTAGCGGAT CTTGCCGCTG ACGTCAGCGA CGCAGTCGTC AACATCTCGG CGACGCAGAC GATGGACGAA AAGCGCTCGG GCGGGGCTCC GCAGCTTGAG CCGGGCACGC CCTTCGATGA TCTGTTCGAG GAGTTCTTCC GCCGCCGCCA GCAAGGCCAA GGCGGGCCGG ACCAACCCAC GCCGCGCGCG CCGCGCGAGC GCAAGTCGAA TTCGCTCGGC TCAGGCTTCG TCGTCGATCC GTCAGGCATT ATCATTACCA ATAATCACGT CATCGCCGAC GCCAATGACG TCACGGTGAT TTTTACCGAT GGACAGAAAC TGAAGGCCGA AGTCCTCGGC AAGGATTCGA AAGTCGACGT CGCCGTCCTG AAAGTGAAGC CCGACAAGCC GCTGAAGGCG GTCAAATTCG GCGACAGCGA CAAGATGCGC GTTGGCGATT GGGTCATCGC CGTCGGCAAT CCGTTCGGCC TTGGCGGCAC TGTCACCGCC GGAATCATTT CCGCCCTGAA ACGGAACATC GATTCCGGCC CCTATGACAA TTATTTCCAG ACGGACGCCG CGATCAACAA GGGCAATTCG GGCGGACCGC TCTTCAACAT GGCCGGCGAG GTCGTCGGCA TCAACACCGC GATCCTCTCG CCCTCGGGCG GATCGATCGG TATCGGCTTC TCGACGCCTG CTGCGACTGT GACGCCGGTC ATCGATCAGC TGCAGAAATT TGGCGAGACG CGGCGCGGAT GGCTCGGCGT CCGGATCCAG AACGTCGACG ACACGATCGC AGAAACGCTG AACCTTGGCT CGGTGCGGGG CGCGCTGGTC GCCGGCGCGG ACGACAAGGG ACCGGCTAAG GCAGCCGGGA TCGAGGCTGG CGACGTCATC TTGAAATTCG ACGGCGTGCC GATCAAGGAA TCCCACGACC TGCCCAAGAT CGTCGCCTCC GCGCCAGTCG GCAAGGATGT CGAAGTGGTG CTGCTGCGGC AAGGCAAGGA GATCACTAAG ACGATCAAGC TCGGCCGGCT CGAGGACAAT GAAAAGCAAA AGGCCGCCTT GACCGTCAGG CCCGGCGACG ACGACAAGCC GCCGGCGGCC AACGCCTCGA TGGAGCGCGC GCTCGGCATG GCCTTCTCAG GGCTTAATGA CGGCGCGCGC CGGAAATATT CGATCAAACA GAGCGTCGCC GCCGGCGTCA TTGTGACCGA TGTCGAGCCG GATTCGGGCG CCGCGGAAAA GCACATCCAA CCCGGCGACG TGATCATGGA GATCAATCAG GAGCCGGTGA AGGAGCCGGC CGACGTCGCC AAGAAAGTCG CCAAGCTGAA GGACGACGGC AAGAAGTCGG CGCTGCTTCT CGTCGCCAAT GGCCAGGGAG AAATGCGCTT TGTCGCGCTT CCCTTCCCAT AA
|
Protein sequence | MGLFKTARRG AIASAAGGAA PRNACSLLAI AAVLMFGAAS VAPGLGAPPA YAKGPDSLAD LAADVSDAVV NISATQTMDE KRSGGAPQLE PGTPFDDLFE EFFRRRQQGQ GGPDQPTPRA PRERKSNSLG SGFVVDPSGI IITNNHVIAD ANDVTVIFTD GQKLKAEVLG KDSKVDVAVL KVKPDKPLKA VKFGDSDKMR VGDWVIAVGN PFGLGGTVTA GIISALKRNI DSGPYDNYFQ TDAAINKGNS GGPLFNMAGE VVGINTAILS PSGGSIGIGF STPAATVTPV IDQLQKFGET RRGWLGVRIQ NVDDTIAETL NLGSVRGALV AGADDKGPAK AAGIEAGDVI LKFDGVPIKE SHDLPKIVAS APVGKDVEVV LLRQGKEITK TIKLGRLEDN EKQKAALTVR PGDDDKPPAA NASMERALGM AFSGLNDGAR RKYSIKQSVA AGVIVTDVEP DSGAAEKHIQ PGDVIMEINQ EPVKEPADVA KKVAKLKDDG KKSALLLVAN GQGEMRFVAL PFP
|
| |