Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_6088 |
Symbol | |
ID | 5320390 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | - |
Start bp | 1022631 |
End bp | 1023818 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640777731 |
Product | peptidase C1A papain |
Protein accession | YP_001314663 |
Protein GI | 150378068 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4870] Cysteine protease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCCTA CAATGCCGTT CTATAACTTT CTGCCGGAGG ATTTCAAGGT CTATGTCGGC GCGGATGGAG GTGTGTCTTA CATGCCCGGC CCGCAGCGGC AGGAGCGCAC TTTGCCGACG ATCAATCGTT ACGGGGGGCC TGACGGTGGC TATGTCGCGG TCTGTTCGCG TGTGGCGGAT CATGCTGTCT ATTCCGTTGG CGACGGGATC TATGTGGTTG GCCAGATCAG GCTGCAAGGT GCCTATGAGG GGCGCTTTTT CGTGCCGAAG GGCTATGGAG GAAAGTGCAT CAGCGCTGCT CCGGACATAA AGGCGATATG CGATCAAGCC TTTCCCGGTA GCGCACCCAA CTGGGCGAGC GGCGATGCCG GCGGTTGGTT TGGTCTGCTG ATCGATAAGC GGGACGTTCG CGATAAGATC CTCACGTTGG ATTTGCCCCA GCAATTGCCG GAAAAGGTTG ATTTGCGACG ATGGTGCTCG CAGGTCCAAA ATCAGGGCAC CTTGAACGCC TGCACCGCCT ACGCCGCCTC GGCCATCCTG GAATATTTCG AAAATCGCGC CGGCGGAAAC GCAGTGTCGC TGTCGGAAAT TTTCCTTTAC AAGGTCACGC GCAACCTGAT GCACAAGACG GGCGACACGG GAGCCAATAC GCGATCCGTC ATGAAGGCTT TGGCAGCCTT AGGCACGGTG CCGGAAGAAT ATTGGCCGGA TGACGCATCC CAATTCGATG CTGAACCGTC TGCCTTTGCC TATGCGCTGG CAGGTCGTTA TCGCTCTCTC AAGTACAGCA GGATTGATGC GAAGGACCGT TCAAAGGACG TGGTGTTGCG CCAGGCCAAG ACATTGCTCA GTCGGAACCG TCCGGTCATG TTTGGCGTCA TGGCGTATTT CGGCACTTGG CAGCAATTCG TGACGTCCGA TCGCCTGCCT TACCCGAGTG AGGATGACAC GCTTTTTGGC GCCCATAACA TTGCCGTCAT GGGCTATGAC GATGGCATCA CAACGGAAAA TGCCAAGAAC CCCGGCATCA AGACGAGGGG CGCCTTCCTC ATTAAAAACT CCTATGGCGA GGAGTGGGGT GACAAGGGCT ACGGCTGGAT TCCTTACGAT TACCTACTGA AACATCAGTC GATTGACTGG TGGACGATCA CCAAGCAGGA ATGGCTCGAC ATGAGCGTGT TCAGCTAA
|
Protein sequence | MQPTMPFYNF LPEDFKVYVG ADGGVSYMPG PQRQERTLPT INRYGGPDGG YVAVCSRVAD HAVYSVGDGI YVVGQIRLQG AYEGRFFVPK GYGGKCISAA PDIKAICDQA FPGSAPNWAS GDAGGWFGLL IDKRDVRDKI LTLDLPQQLP EKVDLRRWCS QVQNQGTLNA CTAYAASAIL EYFENRAGGN AVSLSEIFLY KVTRNLMHKT GDTGANTRSV MKALAALGTV PEEYWPDDAS QFDAEPSAFA YALAGRYRSL KYSRIDAKDR SKDVVLRQAK TLLSRNRPVM FGVMAYFGTW QQFVTSDRLP YPSEDDTLFG AHNIAVMGYD DGITTENAKN PGIKTRGAFL IKNSYGEEWG DKGYGWIPYD YLLKHQSIDW WTITKQEWLD MSVFS
|
| |