Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_2670 |
Symbol | |
ID | 4568774 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 3063175 |
End bp | 3064098 |
Gene Length | 924 bp |
Protein Length | 307 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 639767236 |
Product | peptidase C1A, papain |
Protein accession | YP_913078 |
Protein GI | 119358434 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4870] Cysteine protease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00279233 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTCCCA TGCGTTCATT TCTTGAACTC AAATGCATTA ATCGCCCTGT AGGCACCGGC TGGCTTCCCC CTCTGCCGGA TCTCAGAGAC TATAACGCCG AAACGCCTGA AATTATCGAA CTGGCTTCGA AACTCGGCAT TCCTCAAACC GCAAAAACGC TCAAATCGGC ATTGCCGGCC CAGGTCGATC TGCGCCAGTG GTGTTCTCCG ATCGAAAACC AGGGGCTTCT GGGTTCCTGC ACGGCCCAGT CGGCCGTCGG GGTCATCGAA TATTTTCAGT GCCGCGCATT TGGCAAGTAT CTCGACGCAT CGGCGCTCTT TCTCTACAAA GCGACGCGCA ACCTCATGGG CGTAACCGGC GACACCGGAG CATGGTTGCG CAATACCGTC GGCGCCGTTG CGCTGTGCGG TGTTCCGCCT GAAAAATACT GGAAATACAC CGACCAGGAT CCTGATTTCG ATAATGAACC CGGCGGATTC ATCTACGCCG TTGCCGACAA TTTCGAAGCG CTCAAATACT TCTGTCACGA CCCTCTCGGC GCAAAAAAAC CGGCGAACCT CGTGCTTCAA AGCGTAAAAA AATACCTTGC GGCCGGAATA CCCTCCATGT GCGGATTTTA CGGATTCAAC TCCTTTGAAC AATCGGATAA CAAAGGAGCC ATACCCTACC CCTGCCCCGA CGAGCATGCG TCCTGGGGAC ACGCTATTAT GGTTGTCGGC TACGACGACG AAAAAAAAGT GGTCAACACT GCCTGCGGAA AAGCCACGAC GGGAGCTCTG CGCATCCGAA ACTCGTGGGG AACCGGATGG GGGGAGGAGG GGTACGGATG GCTTCCCTAC GAATACGTCC TCAACGGCCT TGCTCTCGAT TTCTGGTCGA TCATCAACAT GGAGTGGGTC GATACCCGGC AGTTCGGATA TTAA
|
Protein sequence | MFPMRSFLEL KCINRPVGTG WLPPLPDLRD YNAETPEIIE LASKLGIPQT AKTLKSALPA QVDLRQWCSP IENQGLLGSC TAQSAVGVIE YFQCRAFGKY LDASALFLYK ATRNLMGVTG DTGAWLRNTV GAVALCGVPP EKYWKYTDQD PDFDNEPGGF IYAVADNFEA LKYFCHDPLG AKKPANLVLQ SVKKYLAAGI PSMCGFYGFN SFEQSDNKGA IPYPCPDEHA SWGHAIMVVG YDDEKKVVNT ACGKATTGAL RIRNSWGTGW GEEGYGWLPY EYVLNGLALD FWSIINMEWV DTRQFGY
|
| |