Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0842 |
Symbol | |
ID | 4076017 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 888359 |
End bp | 889966 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 638006140 |
Product | peptidase C1A, papain |
Protein accession | YP_612837 |
Protein GI | 99080683 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4870] Cysteine protease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.503051 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCAAAGT CCGCACCGCT TGCGCGGGGC GATTATCTGA GCGCCCCACC TCAGCTGTCG CTCAAACGAT TTGTACCCCC GGTGGGCGAT CAGGGCCAGC AGGGCAGCTG TGTGGGCTGG GCCACCGCTT ATGCGGCGCG CACCTTGCTC AAGGCCAAGG ATCTCGAGGT TGAGAACACT GATCGGCTGC GCGACCTGGT GCTGTCGCCG TCTTATGTGT TCAACCAGAT CCACCAGCCT GGCTGCAACG GGTCTTATGT CGCAGAGGCG TTGACGCTGA TGCAGCGGCA GGGCGTGTCG CTCTTGCGCG ACTTTCCCTA TGATCAATAT TCCTGCACGG CACAGCCTTC GGCGAGCCTG CGCGACAAAG CGAGCCAGTT CCGGATCAAA GGCTATTCAC GCCTCTGGGG GGGCTATGGC CGCAACAAGC ACGTCGCAAC CCGGCGCGCG CTGGCCAATG GCAATCCGGT TGTGATCGTC ATGGGGGTGG GCGACGGGTT CATGCGCCAC AGCGGCAGTG GCATCTGGGA GCCAACCTCT ACTGAATGGA GTGAGCTGCG CAGCAACACG CTGGGGGCGC ATGCGATGAC CGTGGTGGGG TATGATGACA CCCGCGGTGG CGGCGCTTTC GAAGTGGTCA ATTCCTGGAG TGCGGGATGG GGCAACCGCG GCTATTTCTG GATTTCATAC GAGGATTTCA ACGCTTTCGT CTATGAGGGC TATGAGGTGC TGCCGCCCGA TCCACCTCCG CCGCCTCGGG TGGTCGATAT GGCGGGCAGC GCACGGGTGC TGCATCTCTC GGGCAATGAG CTGGACGTGA CGCGCAGCGA AGGCGGTTAC AAAATCCGCA AACCCCTGCC ATCGGGCACG CGCTTTCGGG TCGAGGCCTC AAGCAAATTC AACGGCGCAC TATATGTGAT CGGGGGCGAT TCCAGCGGCG ACTATGTGAG CCTGTTTCCC CGCGGCGACC GGGTGACGCC CTATACCCAT GGCGGAACCA CGATGCTGTT GCCGGGGCCG ACAGAGCAGC ATTTCACCCG GCTCAATGAC ACGGTTGGAA CCGATTACTA TGTCTTGCTT TACGCGCAGG AGCCGCTCGA TCCGGAAACC ATTGCGATGC GTATGGCGCG CGGATCCGGC AGCGTGGAGG CACGGCTGCG CAGTGCGCTT GGCAACCGTC TCGTGCCACA AGATGAGATG GAGTTGCTCG CGTCAGGCAT CGGTTTCGAA GCCGCAAGTG GAGAGGCGGA TGTGGCGGCT CTGGTCCTGA GCATCGACCA CATTGCCCCG GATCCCGCGC AAGCCGATCG CGAGGCGCCA CTGATTGTGC TCACCAACCC GGCGCCCGAG GCCTTTGACA GCGCGGATGC GGTGATCCCC GTGCAAAGCC GTCTTTTCCG CCTCGAGGGC ATGGCGCAGG ATGAAAGCGA GATTGCTTCG CTGCGTGTCA ACGGATCTCT GAGCAGCCGC TATTCCTCGC GCGGACCGTT TCGCGCCGAG ATCGAGCTCC CCGAAGGGCC CGGTCCCCAT TCCATCGAAA TTGAAACACG CGACGCAGCG GGCAATGCCG CGCGCCGGAG TTTTCAATTC AGCCTGACTT TCAACTGA
|
Protein sequence | MPKSAPLARG DYLSAPPQLS LKRFVPPVGD QGQQGSCVGW ATAYAARTLL KAKDLEVENT DRLRDLVLSP SYVFNQIHQP GCNGSYVAEA LTLMQRQGVS LLRDFPYDQY SCTAQPSASL RDKASQFRIK GYSRLWGGYG RNKHVATRRA LANGNPVVIV MGVGDGFMRH SGSGIWEPTS TEWSELRSNT LGAHAMTVVG YDDTRGGGAF EVVNSWSAGW GNRGYFWISY EDFNAFVYEG YEVLPPDPPP PPRVVDMAGS ARVLHLSGNE LDVTRSEGGY KIRKPLPSGT RFRVEASSKF NGALYVIGGD SSGDYVSLFP RGDRVTPYTH GGTTMLLPGP TEQHFTRLND TVGTDYYVLL YAQEPLDPET IAMRMARGSG SVEARLRSAL GNRLVPQDEM ELLASGIGFE AASGEADVAA LVLSIDHIAP DPAQADREAP LIVLTNPAPE AFDSADAVIP VQSRLFRLEG MAQDESEIAS LRVNGSLSSR YSSRGPFRAE IELPEGPGPH SIEIETRDAA GNAARRSFQF SLTFN
|
| |