Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_0109 |
Symbol | |
ID | 6373753 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 102503 |
End bp | 103429 |
Gene Length | 927 bp |
Protein Length | 308 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 642682626 |
Product | peptidase C1A papain |
Protein accession | YP_001958573 |
Protein GI | 189499103 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4870] Cysteine protease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.886044 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATACCGA AACATGTATG CCTGCAACTC TGCGGCTCAA CGCAATCCTT TGGAACGGGA TGGCTTTCTC CGATGCCTGA TTTACGAGAC TATACTGTAG ATACTCCGGA AATTGTCGAT ATGACAAAAA AACTGAAGCT GAGGCAGAGC GAAAAAGCGT TGAAATCTGC TGTGCCGTAT TCCGTGGATT TGAGAGGGTG GTGTTCAGAA GTCGAGAATC AGGGGAAAAT CGGTTCCTGT ACCGCGCATG CCGCCATGGG CGTCGTTGAG TATCTGCAAC GCAGAGCCTT TGATGAGCAT ATCGATGGGT CAAGGCTTTT TGTGTATAAA GCTACCCGCA ACCTTATGCA TGCAACCGGA GATACCGGAG CCTGGCTGAG AAATACCATG GGTGCGCTTG TATTGTGCGG TGTCCCGCAT GAGCAGTACT GGGAGTATAC GGATGTCGAT CCCGATTATG ACAAGGAGCC GACGGGCTTT GTCTATGCAG TGGCTGATAA TTTCGAGGCG TTACGATATT TTTGTCATGA TCCTCAGAGT GGAAATATGG ACAAAAGAGC GGTGCTTGAA AGCGTGAAAA GGTTTCTGGC TGCGGGTATC CCGTCCATGT TCGGCTTTTT CGGTTTTCCT TCGTTCAACA GCTCGGATGA CAAAGGATGT ATTCCTTTTC CCTGTGGCAA TGAGAAGGCT GAGTGGGGAC ACGCCATCGT CGCGGTCGGG TTTGATGATA AAAAAGAGAT AATCAACACG TCCTGTAAGA AGAGCAAGAC AAAAGGGGCT CTGTTGATCA GGAACTCATG GGGGACGGAC TGGGGAGACA ATGGATATGG CTGGCTGCCT TACGAGTATA TCCTGCAGGG GCTCGCTGTC GATTTCTGGT CGCTTCTGAG TATGGATATG GTTGATACCA AGCAGTTTGG ACTGTAA
|
Protein sequence | MIPKHVCLQL CGSTQSFGTG WLSPMPDLRD YTVDTPEIVD MTKKLKLRQS EKALKSAVPY SVDLRGWCSE VENQGKIGSC TAHAAMGVVE YLQRRAFDEH IDGSRLFVYK ATRNLMHATG DTGAWLRNTM GALVLCGVPH EQYWEYTDVD PDYDKEPTGF VYAVADNFEA LRYFCHDPQS GNMDKRAVLE SVKRFLAAGI PSMFGFFGFP SFNSSDDKGC IPFPCGNEKA EWGHAIVAVG FDDKKEIINT SCKKSKTKGA LLIRNSWGTD WGDNGYGWLP YEYILQGLAV DFWSLLSMDM VDTKQFGL
|
| |