Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_1336 |
Symbol | |
ID | 7091674 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | + |
Start bp | 1439605 |
End bp | 1441206 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643464674 |
Product | peptidase S10 serine carboxypeptidase |
Protein accession | YP_002361663 |
Protein GI | 217977516 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2939] Carboxypeptidase C (cathepsin A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGATCCGCA CCGTTTCGGC AGCGCTTCGG CGAGCCGCCG CGCCTGCCGC ACTGGCTATC GCCCTTGTGA CAGCCTCGCT CACCGCGGCA CGCCTTTTCA CAGCGCCGTT CCTGGCGAGC GCCGCGGCTG AAGAAGCCGC GCCTGCGCCG AGCGCGGAGG CGGAATCCGC CCCGGCCGCT GCGTCAAAAC CCGCAACCGC GGCGCCGACG ACGCCGCCTC ACCCCCTCCC GCCGCCGGCG ACGACGCGGC AAAGCGTCGA CCTGCCGGGC CGCGCCCTGC GCTTCAAGGC GGTCGCCGGT GCGATCCGCC TCAGCGATGC GCAGAGCGGC GAGGCGCAGG CCGACGTCGC CACTGTCGCC TACAGCCTCG AAGGTCAGGA CGCGGCCAAA CGTCCCTTGG TCTTCGTCGT CAACGGCGGA CCCGGCGCGG CCTCCGCCTG GCTCAACCTC GGCGCCCTCG GTCCATGGCG GCTCGATCTC GGCAAGGATC CGGTCTCGCC CTCCGCCCCG GCCGCGCTCG TCGGCAATGC CGAGACCTGG CTCGATTTCG CCGATCTGGT GTTCATCGAT CCGCCGCTCA CTGGCTATAG CCGCATCCTC GCCAAGGGCG ACGGCGCGCG GCGTCAGTTA CTGTCGGTCG ACGGCGACAT CGAGGCGCTC GCCGTTGTCA TCCGCAAATG GCTGACGTCC AATCAAAGGC TCGAAAGCCC AAAATTCATC GTCGGCGAAA GCTACGGCGG TTTTCGCGCG CCAAAACTCA CCCGGCGCCT TCAGGAAAAC GAAGGCGTCG GGATCAAGGG GATTATTCTG ATCTCTCCGG TTATCGATTT CAGCTGGTTC GAAGCGGCCA ACAGCCCTTT GCCGGTGATG ACCCAATTGC CCTCGCTTAC TGCCGCGGCG CGGGGCCTGA GGCCCGCGGA CGCCAAGGCG CTTGACGAGG TCGAGAGTTT TGCCTCCGGG CCTTATCTGA CCGACCTCTT GCGCTCGGAA CGCGACCCCG CCGCCCTGAC CCGCATCGTC GATGGGGTCG CGCGCCTGAC CGGCCTCGAA CCCGCCTTTG TCCGCCGCCT GGGCGGCCGG GTGGACCCGT CGAGCTTTGC GCGCGAGGCG GGCCGCGCGG AAGCGAAAAT CTTCAGCCGC TACGACATGA CGATAGCCGG CTTCGATCCC TCGCCGCACG CCGCCGACAC GAATTTTTCC GATCCGGTGC TCGATGCGAC GAAAACGCCT TTCGCCAGCG CTATGGCCAA CCTCACGGCG ACAAAGCTCG GCTGGCCGGT CGATGCGCGC TACGAGATCC TGAACGAATC CGTGAGCCGC CAGTGGAATT GGGACGGCGG ACGCGGCAAG AACCAGTCCT TGAGCGACCT CAGCCAGGCG CTCGCGATCG ATGCGGAGTT TCGCGTGCTG ATTGTGCATG GGCTGACCGA TCTGGTGACG CCCTATTTCG CCTCAAAGCT GCTGATCGGC CAGATCCCGC CCTTCGGCGA TCCCGGCCGC GTCGCGCTCA AAATCTATGA AGGCGGCCAC ATGCCCTGGC TCCGCGAAAG CGGCCGGGCC GCCCTGCGCG ACGACGCGCG CAAATTGATC GAGGGGAAAT AG
|
Protein sequence | MIRTVSAALR RAAAPAALAI ALVTASLTAA RLFTAPFLAS AAAEEAAPAP SAEAESAPAA ASKPATAAPT TPPHPLPPPA TTRQSVDLPG RALRFKAVAG AIRLSDAQSG EAQADVATVA YSLEGQDAAK RPLVFVVNGG PGAASAWLNL GALGPWRLDL GKDPVSPSAP AALVGNAETW LDFADLVFID PPLTGYSRIL AKGDGARRQL LSVDGDIEAL AVVIRKWLTS NQRLESPKFI VGESYGGFRA PKLTRRLQEN EGVGIKGIIL ISPVIDFSWF EAANSPLPVM TQLPSLTAAA RGLRPADAKA LDEVESFASG PYLTDLLRSE RDPAALTRIV DGVARLTGLE PAFVRRLGGR VDPSSFAREA GRAEAKIFSR YDMTIAGFDP SPHAADTNFS DPVLDATKTP FASAMANLTA TKLGWPVDAR YEILNESVSR QWNWDGGRGK NQSLSDLSQA LAIDAEFRVL IVHGLTDLVT PYFASKLLIG QIPPFGDPGR VALKIYEGGH MPWLRESGRA ALRDDARKLI EGK
|
| |