Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_3227 |
Symbol | |
ID | 7090642 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | + |
Start bp | 3540566 |
End bp | 3541906 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643466535 |
Product | protease-like protein |
Protein accession | YP_002363496 |
Protein GI | 217979349 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4934] Predicted protease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.000167214 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAAGAGGG CGACAATAGC ATCCGTTGCG GCGTTCGCCG TCGCGCTTTC TTATGGGGCG CAGGCCTTGG CGCAAGAGGT CGAGGCGCCA GCCACAGGGC GCTTCATTCA TGCGCCGAAA GCGTCAGTGA CGACCCCTTC CTCGAGCGTC GCCAAACCCG CCGATGCAGG CAAGGCGGCG CATACGAATA CAAAATTCAT AGGCCCGAAT GGTCTTGCTC CGCCGAACGC GGCCGGGCCG CGATCCGGGG CTGCGCCGGC CGGAAGCCCG CCTTACGCGG ACTATGGGTA TGAAACGCCT GCTTCGCTCG CCTGTCTCTA CGGGCTGGTC GCCGCTTCCC CCGGATGCAA TCCGAACGTC GCCTCCGCCG TGCCGACCGC CAAGGGCTCC AAGGCGATAG CTTTGGTGGA CGCCTATGAT TACCCGACCG CCCTCAGCGA TCTGCAAACC TTCAGCGTTC AGTTTGGCCT TCCCTTGCCC AATCTGATCG TGAAATACGC TACGGCGGGA GGCGCGTGCA ACGGACCAAA GCCGGCGAAT GATCCGGGAT GGGAGGGCGA GGAAGCGCTC GACGTTCAAA TGGCGCACGC CATGGCGCCG CAGGCGACCC TTTATCTCGT CGAGGCGCAA GACAACTCCA ACGCCAATTT GGCGGGGGCG ATCGTTTGCG CCAACAGCCT GCTTCAGGCG AGCGGGGGCG GTGAAGTCTC GATGAGCTGG GGCGGAAGCG AAGTCTCAAC CTACGAAAGC GCCTTCAGCG CAAACAACGT CGTCTATTTC GCGTCGAGCG GCGACGCCCC GGGGCCAAGC TGGCCATCGA CCTCGCCGAA TGTCGTGTCG GTTGGCGGAA CGAGCATCGC CCGCGACCCG CAAACCTATA AATTCCTGCA TTATGCAAGC TGGAGCGAGG CGGGCGGCGG CGCGAGCTTG ATTTTTCCGC GCCCGGCCTA CCAAAGCGGC CCCGGAATAG CGGGAACCGC CCGGCTGACG CCTGACATCT CGGCTGTCGC CAACCCCGCC ACCGGCGTGT GGGTCTATGA CAGCAATCCC TCTTTCGGCG CAGGCTGGTA TGTGTTCGGC GGCACCAGCG TCGCTGCGCC GCTGGTGGCG GCGATCACCA ACGCGCATAA TAATTTCCGC GCCAACACGG CGACCGAGCT GACCGCGATC TACAAGGCGA AAAAGGCGAG CGCCAAAGCC TTCGCGACGG CCACCATCGG CTATTGCGGC CCTTATGCGG CGTCTCAACC TACCGCGAAG TGGAACATCT GCCTCGGGGT CGGGACGATC AAGGGAACGG GAACCGCAAA TGTCCTGCCG GTGGTGGATG CGGAGCAATA G
|
Protein sequence | MKRATIASVA AFAVALSYGA QALAQEVEAP ATGRFIHAPK ASVTTPSSSV AKPADAGKAA HTNTKFIGPN GLAPPNAAGP RSGAAPAGSP PYADYGYETP ASLACLYGLV AASPGCNPNV ASAVPTAKGS KAIALVDAYD YPTALSDLQT FSVQFGLPLP NLIVKYATAG GACNGPKPAN DPGWEGEEAL DVQMAHAMAP QATLYLVEAQ DNSNANLAGA IVCANSLLQA SGGGEVSMSW GGSEVSTYES AFSANNVVYF ASSGDAPGPS WPSTSPNVVS VGGTSIARDP QTYKFLHYAS WSEAGGGASL IFPRPAYQSG PGIAGTARLT PDISAVANPA TGVWVYDSNP SFGAGWYVFG GTSVAAPLVA AITNAHNNFR ANTATELTAI YKAKKASAKA FATATIGYCG PYAASQPTAK WNICLGVGTI KGTGTANVLP VVDAEQ
|
| |