Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dbac_1250 |
Symbol | |
ID | 8376915 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfomicrobium baculatum DSM 4028 |
Kingdom | Bacteria |
Replicon accession | NC_013173 |
Strand | + |
Start bp | 1378391 |
End bp | 1379824 |
Gene Length | 1434 bp |
Protein Length | 477 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 645000483 |
Product | peptidase C1A papain |
Protein accession | YP_003157769 |
Protein GI | 256829041 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4870] Cysteine protease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0216738 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTGATA TTCGCTCACT ACCGGGTGTT CCCAGTGATA TCTTGATGGG CTTGGGGCTG CTTGGTTTAA AGACCGTGGA GGGGTTTCTT TCGCTCATGC AGATCCATGA AAGCCGCGAG GCTTTCAAGA AAATGGTCGG TTTGAGCGAA AGTGCACTGG GGCGCAGCAT TCAGCAGCTT CGCGAGCATC TTCCGAAAAG CTGCGGGGCC ATGCACCCAT CGCCTGTAAT GCCGGTTCAG CTTGGTTTGA GCTGGGGCCA TGTTGAGGCG TCATATCCTG CAAACTATTT TGGATCGATG GCCCATGATT CTTCCGAGGA AGAAGTCGAT CTCATTCCTC AGTTGCCACC TGTTCGTAAT CAGGGGCGGC GCGGAACTTG TGTCGCTTTT GCCGCTACGG CCGCCTTCGA ACATGAAATG CGCAGGAATG GCCTGTTCGG AATAGGAAAT GTCCTCGCGT TCACGCGTAC GAGAAGCGGG CGCAAAATTT TCCGTAGATT TGATGATGAA ACCAAGATGC TTTCGCCGCA GTTTTTGTAC TGGGCCTGCA AAAGCGCGGA CGGGGTTGAT GGCCCAGGGA CGATGATCTC AACAGCCATG GAATGCTTGC ATGATCGTGG ATGTTGTCCG GAAAAAGACT GGCCATATTC CCCTGATTCG CAGTGGGGAA ATGAAGGACA GGGGCCACCG CCGTTTGGCG CCGAGGAATC TGCGAGGAAG CGACGTATTG CGCAATATGA TGAACTGACG AGCTTCGGCT CCGTCCCCCT TCAGCATATG AAGCGATTGC TGGCACAAGG TCATTTGCTG GTATTTGGCG TGCCCGTGTT TCCAAGCTGG GGAAATATCG AAACCCAGCA GACAGGACGA GTGATCATGC CCATCTCCGG AGAGCAATCT CTCGGTGGCC ATGCGCTGTG TCTCTGCGGG TATAGAGACG ATGGTTCTTG CCCGGGAGGT GGGGTATTTT ACTTCAGAAA TTCATGGGGC GAAGAATGGG CTTCCCAGAA TATCAGAGGA AGAGGGTACG GCGAAATGCC ATACGCATAC ATGCAGATGT ATGCCAGGGA TGTTTATTCG ATGAGGTGCG AGGCAACAGG TCTGCTGAAT TTCCCTGTAG CTGCCGTCCG AGCGGCGATT GAGCCTGGAG TCGCTTTTAT TAAGTCCGCG GCTATGGTCA TGCTGACAGT GGCAATCATG AGCATGGGCT CCCTTGTTGC CTGGCAATTC GGTCGGGCGG TTCAGTTGCC GACGGCCGAA CCTCAGATGG ATACGCAGCA GCCCGTTGAG CAGGTATATT CAGCTGCTAT GGAGCCAAAG GAAGAAGGCA CAAGTTCGGA AGATAGAGTG TTAGAAAACG AAGAAATGTT GTCAACGAAC AATGTTTATA TCCTTAATAT TCTTGAAGAC GTAAAGAATA TATTGAATAG GTGA
|
Protein sequence | MIDIRSLPGV PSDILMGLGL LGLKTVEGFL SLMQIHESRE AFKKMVGLSE SALGRSIQQL REHLPKSCGA MHPSPVMPVQ LGLSWGHVEA SYPANYFGSM AHDSSEEEVD LIPQLPPVRN QGRRGTCVAF AATAAFEHEM RRNGLFGIGN VLAFTRTRSG RKIFRRFDDE TKMLSPQFLY WACKSADGVD GPGTMISTAM ECLHDRGCCP EKDWPYSPDS QWGNEGQGPP PFGAEESARK RRIAQYDELT SFGSVPLQHM KRLLAQGHLL VFGVPVFPSW GNIETQQTGR VIMPISGEQS LGGHALCLCG YRDDGSCPGG GVFYFRNSWG EEWASQNIRG RGYGEMPYAY MQMYARDVYS MRCEATGLLN FPVAAVRAAI EPGVAFIKSA AMVMLTVAIM SMGSLVAWQF GRAVQLPTAE PQMDTQQPVE QVYSAAMEPK EEGTSSEDRV LENEEMLSTN NVYILNILED VKNILNR
|
| |