Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dfer_0574 |
Symbol | |
ID | 8224143 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dyadobacter fermentans DSM 18053 |
Kingdom | Bacteria |
Replicon accession | NC_013037 |
Strand | - |
Start bp | 651188 |
End bp | 652651 |
Gene Length | 1464 bp |
Protein Length | 487 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 644928450 |
Product | peptidase C1A papain |
Protein accession | YP_003085004 |
Protein GI | 255034383 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4870] Cysteine protease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.17095 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCCCT CTTTCCTATT TGCCGTCGTG TATGTGCTGG CCTTGCTGCT TCCGGCGTCG GAATCGCTGG CGCAAAAACG CGGCATGGGC CTGAAAATGG ACGACGCGCG GTACCTGCGC CTCCCGCGCA AGTCACCCAA TATTGTATTC AAAGGCGTAT TACCGGCCTC ATTCAGCCTG CGAAGCCTGA TGCCGGAAAT CGGCGACCAG GGCATGGACG GGACGTGTGT GGGCTGGTCG GCGGCATACT ACATGCGCAC GGTCATGGAA GCTGGCAAAC AGGGACTCAC AGGTCAGCCG GCCAGGATCA CCGCTACGGC ATTCTCGCCC GGCTGGCTCT ACGGGCAGAT CCGTTCGGGA CAGGATAACG GCTGTGCGGA TGGCGTTTAC CTTGAAGACG CGCTGGAAGT GATGAAAACG AAAGGATCGG CGTTCCTGAG CTGCGCGCCG GGGAATTGCA CGGTCCGTTA CGACCAATGC GACGACAAGG CTACCAACTA CAAAATCGCC GATTACGCCA CACTTTTCAG TCCGGGCGAC AGCAGCTTTA CCGCGGCGCA GCGCATTCAT GCCATCAAAT CAGCGCTGGT GGAAAGCAAA AGTGCCGTGC TCACCGGCAT GCTCGTACCG CCATCCTTCA TAGACGCTAC CGGCGAAAAC TGGCAGCCGG CACCGGGCGA ATCGGCCGCC AATGCCATCG GAGGCCATTC GCTGGCCATC ATCGGCTACG ACGATCATGT GAACGAAGGC TCGTTCCTGA TCGCCAACAG CTGGGGCACG GCGTGGGGAA GCGGCGGTTA CATTTGGGCT AAATACGGCG ACCTGGCCCG CTTCATCAAG AACGCCTACC AGATCTACAC CGAACCGCCC GCCAGGCCGC AACCGCAGAC GATAGCAATG CAGGGAGACA TTGATTTTCT CACCGGAAGC GGCGCTATGG CCGTCCATTC GACCGTTTCG AAAGGTGCCC AGGTAACCAT GGGACAACCA AAAACCGAAA TGCTTACCTA CACCATGAGC CAGCCCCATG CTTCGGGCAC GCGATTCAAG ATGGTGATCA GCAACAGCCG GCAATCGTAT GTTTACATTC TCGGGTCCGA CAAGGAGAAC CGGGTTTCGG CGCTGTTTCC CGACAATTCG GAAGAAATGA TCACAAGCGC CGTGGTCCCC GCCAACAGCC AGATGCTGAT GCCTTCCCCG CATAGTTCCT TCACGCTCGA CGACATGACG GGCGAAGATT ATTTCATTGT ATTTATTTCT CAAAACGAGC TGAACCTCGA AGAACTGGCC GCAAGAATTA AAAATGCCGA CGGTACCATC GTGCAAAAAG CCTTTTCCGC ATTGGGAGAA GAATTTATCT CTCCAAAAGC CATTTCCTAC GCGCCCGGAA AAATTTCCTA CGAAGTGAAA GGCGCTCCGA AAGGCAGCGT TGTTCCGGTA CTAGTTAAGA TCATACACCA ATAA
|
Protein sequence | MKPSFLFAVV YVLALLLPAS ESLAQKRGMG LKMDDARYLR LPRKSPNIVF KGVLPASFSL RSLMPEIGDQ GMDGTCVGWS AAYYMRTVME AGKQGLTGQP ARITATAFSP GWLYGQIRSG QDNGCADGVY LEDALEVMKT KGSAFLSCAP GNCTVRYDQC DDKATNYKIA DYATLFSPGD SSFTAAQRIH AIKSALVESK SAVLTGMLVP PSFIDATGEN WQPAPGESAA NAIGGHSLAI IGYDDHVNEG SFLIANSWGT AWGSGGYIWA KYGDLARFIK NAYQIYTEPP ARPQPQTIAM QGDIDFLTGS GAMAVHSTVS KGAQVTMGQP KTEMLTYTMS QPHASGTRFK MVISNSRQSY VYILGSDKEN RVSALFPDNS EEMITSAVVP ANSQMLMPSP HSSFTLDDMT GEDYFIVFIS QNELNLEELA ARIKNADGTI VQKAFSALGE EFISPKAISY APGKISYEVK GAPKGSVVPV LVKIIHQ
|
| |