Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Oter_2994 |
Symbol | |
ID | 6204291 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Opitutus terrae PB90-1 |
Kingdom | Bacteria |
Replicon accession | NC_010571 |
Strand | - |
Start bp | 3845305 |
End bp | 3846522 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641692659 |
Product | peptidase C1A papain |
Protein accession | YP_001819875 |
Protein GI | 182414809 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4870] Cysteine protease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.766742 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.41345 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGCCCG TGCCGAGTGA TCCAACAGAT GGAGCCCGCC GTCCCCGACG GGCTGCGCAC CCGCGGCTTG GCCGCGGCGC CTGCCTCGCC CTCGCACTCG CCTTGGCCGC CACGCCCCCG GCTTTGGGCG CGCCCTCCTC CATTCCGGCA GGCACGCACT TCGACACGTT CACCGTCGGC GCGGCGACCT ATCGCGACGT TAAGGTCCGC TCCGTGAACG CCCGCACCGT AATGATCACG CATTCGCGCG GGATGACGTC GATCAAGCTG CGTGATCTTT CGCCGGAGTG GCAGCAGAAG TTCGGCTACG ATCCCGCCAC GGAGGCCGCC GCCGAGCAGG CCATCGCCAC GCCGCCCAGC CCGCCGCCAT CAAAACCGAC CGCGCGGCCG GCCGCGGGCG CGACGGGTGC GGTGACCGCC AAGCTCGAGC GGCTGCTGCG GCAATTCGGC GAACCCGCGA CGATCAACGC GGAAGTCGAT CTCCGCCCGA AGTATTTTCA GATGGAACTG GCGGTGAGGA GCCAGGGCCG CCGGCCGAGT TGCGCCGTGT TCGCCATCGT CAGCGCGCTC GAATTCCAAG CCGCCGAGCT GACGGGTGAA CCGAGTAAAC TTTCCGAGGA GTATCTCAGC TGGGCCACGC GCAAGACCGT CCAGCGGGTC GTCACGCCGA TCGCGGCCAA CGCCGAGGGC GCGGAGAATT CCACCGACGC CGCCGGCAAT GCCGACGAGG GGTTTTCGTT GAACGAGGTG GTGCTCGCGC TGCGCACCTA CGGCGTGCCG CTGCAATCGT CGATGCCGAA CCGGTTCGGC CGCGCCATCT CCGAGATCGA AGACCCGGCG CCCGCTATCG TGGATGAGGC GCGCACGCAT CAGCGCGTGT TCGTGCTGCC GATTCCCGGC CGCAACACCG GCACGGCCGT CAACAATATC GTCCATGCGC TCAACGCCGG GATCCCGATT CCGATCGGCG TGGAGTGGCC GCACTACCGC TCGATCCGCA CCGGCAGCCT GATCGACCAG AAACCACTCG AGGACGGCGG GCATGCGGTG ACGCTCGTCG GTTATCGCTG CACGACCAAT CGGCTCGAAG ATGTCGTCTT CATTTTCAAG AATTCGTGGG GTCCCGACTG GGGCCAGGGC GGCTACGGGA CGGTGACCTA CGGTTATCTG AAGAAGCACC TGCACAGTGC GGTCCTGCTG GAGGTGCAGC GCGGGTGA
|
Protein sequence | MEPVPSDPTD GARRPRRAAH PRLGRGACLA LALALAATPP ALGAPSSIPA GTHFDTFTVG AATYRDVKVR SVNARTVMIT HSRGMTSIKL RDLSPEWQQK FGYDPATEAA AEQAIATPPS PPPSKPTARP AAGATGAVTA KLERLLRQFG EPATINAEVD LRPKYFQMEL AVRSQGRRPS CAVFAIVSAL EFQAAELTGE PSKLSEEYLS WATRKTVQRV VTPIAANAEG AENSTDAAGN ADEGFSLNEV VLALRTYGVP LQSSMPNRFG RAISEIEDPA PAIVDEARTH QRVFVLPIPG RNTGTAVNNI VHALNAGIPI PIGVEWPHYR SIRTGSLIDQ KPLEDGGHAV TLVGYRCTTN RLEDVVFIFK NSWGPDWGQG GYGTVTYGYL KKHLHSAVLL EVQRG
|
| |