Gene Oter_2994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOter_2994 
Symbol 
ID6204291 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOpitutus terrae PB90-1 
KingdomBacteria 
Replicon accessionNC_010571 
Strand
Start bp3845305 
End bp3846522 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content67% 
IMG OID641692659 
Productpeptidase C1A papain 
Protein accessionYP_001819875 
Protein GI182414809 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4870] Cysteine protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.766742 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.41345 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCCCG TGCCGAGTGA TCCAACAGAT GGAGCCCGCC GTCCCCGACG GGCTGCGCAC 
CCGCGGCTTG GCCGCGGCGC CTGCCTCGCC CTCGCACTCG CCTTGGCCGC CACGCCCCCG
GCTTTGGGCG CGCCCTCCTC CATTCCGGCA GGCACGCACT TCGACACGTT CACCGTCGGC
GCGGCGACCT ATCGCGACGT TAAGGTCCGC TCCGTGAACG CCCGCACCGT AATGATCACG
CATTCGCGCG GGATGACGTC GATCAAGCTG CGTGATCTTT CGCCGGAGTG GCAGCAGAAG
TTCGGCTACG ATCCCGCCAC GGAGGCCGCC GCCGAGCAGG CCATCGCCAC GCCGCCCAGC
CCGCCGCCAT CAAAACCGAC CGCGCGGCCG GCCGCGGGCG CGACGGGTGC GGTGACCGCC
AAGCTCGAGC GGCTGCTGCG GCAATTCGGC GAACCCGCGA CGATCAACGC GGAAGTCGAT
CTCCGCCCGA AGTATTTTCA GATGGAACTG GCGGTGAGGA GCCAGGGCCG CCGGCCGAGT
TGCGCCGTGT TCGCCATCGT CAGCGCGCTC GAATTCCAAG CCGCCGAGCT GACGGGTGAA
CCGAGTAAAC TTTCCGAGGA GTATCTCAGC TGGGCCACGC GCAAGACCGT CCAGCGGGTC
GTCACGCCGA TCGCGGCCAA CGCCGAGGGC GCGGAGAATT CCACCGACGC CGCCGGCAAT
GCCGACGAGG GGTTTTCGTT GAACGAGGTG GTGCTCGCGC TGCGCACCTA CGGCGTGCCG
CTGCAATCGT CGATGCCGAA CCGGTTCGGC CGCGCCATCT CCGAGATCGA AGACCCGGCG
CCCGCTATCG TGGATGAGGC GCGCACGCAT CAGCGCGTGT TCGTGCTGCC GATTCCCGGC
CGCAACACCG GCACGGCCGT CAACAATATC GTCCATGCGC TCAACGCCGG GATCCCGATT
CCGATCGGCG TGGAGTGGCC GCACTACCGC TCGATCCGCA CCGGCAGCCT GATCGACCAG
AAACCACTCG AGGACGGCGG GCATGCGGTG ACGCTCGTCG GTTATCGCTG CACGACCAAT
CGGCTCGAAG ATGTCGTCTT CATTTTCAAG AATTCGTGGG GTCCCGACTG GGGCCAGGGC
GGCTACGGGA CGGTGACCTA CGGTTATCTG AAGAAGCACC TGCACAGTGC GGTCCTGCTG
GAGGTGCAGC GCGGGTGA
 
Protein sequence
MEPVPSDPTD GARRPRRAAH PRLGRGACLA LALALAATPP ALGAPSSIPA GTHFDTFTVG 
AATYRDVKVR SVNARTVMIT HSRGMTSIKL RDLSPEWQQK FGYDPATEAA AEQAIATPPS
PPPSKPTARP AAGATGAVTA KLERLLRQFG EPATINAEVD LRPKYFQMEL AVRSQGRRPS
CAVFAIVSAL EFQAAELTGE PSKLSEEYLS WATRKTVQRV VTPIAANAEG AENSTDAAGN
ADEGFSLNEV VLALRTYGVP LQSSMPNRFG RAISEIEDPA PAIVDEARTH QRVFVLPIPG
RNTGTAVNNI VHALNAGIPI PIGVEWPHYR SIRTGSLIDQ KPLEDGGHAV TLVGYRCTTN
RLEDVVFIFK NSWGPDWGQG GYGTVTYGYL KKHLHSAVLL EVQRG