Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dfer_1628 |
Symbol | |
ID | 8225199 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dyadobacter fermentans DSM 18053 |
Kingdom | Bacteria |
Replicon accession | NC_013037 |
Strand | - |
Start bp | 1972730 |
End bp | 1975192 |
Gene Length | 2463 bp |
Protein Length | 820 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 644929483 |
Product | peptidase C1A papain |
Protein accession | YP_003086035 |
Protein GI | 255035414 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4870] Cysteine protease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.305041 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCGGG TTTCATCCGG TAGCCGCGCG GATACATTTT TATTTGTCGC AGTTTTGAAA TACAAAGCCG CTCCGCGTTC GGCCGAAGAT TTTCGTAATA ACATCCTCTA TCAGTTCTCG ATGAAATCAC GCTTACTGAA ACCGCTCCGG AAGATTTTTA CAATGCTTCT GGTGTTTCTC ACTGTCATTG CCTGTAAAAA GGACGTCGTT CCCACGCAGC CTGCACCCGA GCCCGAACCG GAAGTCGTGA TTCCCGGCAA AACGGTCGGG TTCGGCGTGG AATTGGTCCC GGCGGAAGAA TATGAGAAAT TGCCTCTGAT CGCCGAGCCG GTCATGGCCA ACGGCCGCAC CATGAAGGAC CCGACGCTCA CCCCAACGTA CGATTTGACG GCCAAAATGC CGCCCGTGGC CAGTCAGGGA AACCAAAACT CCTGCACGGC ATGGGCAACG GCATTTGCCG CGAGAAGCTA CCTCCATTCC CGCCTGACAG GCGCGAGCTA CGTGGGCAGC GACGGCAACC GCGACAATGC CCGCGTATTC AGCCCGGCAT TTGTGTACAA TCAGATCAAC GGAGGCAACG ATAAAGGCTC ATTTACCTAC AATGCGCTGG ACCTGATGAA GAACACAGGC GTGTGTTCCT GGCAGGATAT GCCCTATAAA GACACCGATT TTCTCACCAA GCCAACCAAC GAGCAAACGC AAAAAGCGGG CAATTTTAAG ATTAAAGATT GGGGTAGAAT CAATATTTCG GAGAGCGTTT TTAAGAAATT CATTTACTAC GATTACCCTG TTATCATCAG TGCCTACCTC GATAACAGCT TCCTTGAACT TACGCACAAA GACCCGCAAA ACGAATTTGT GTGGAAAGAA AATACCGGCG CCAAAAGTGC GCACGCCATG GTTGTTGTTG GATACGACGA TAGCCGCAAA GCATTCAAAG TTCAGAATTC CTGGGGGAAA AACTGGGCTA ACCGCGGGTA TATCTGGCTG TCGTACGAGC TGGTCGAAGA AGTGATCAGG GAAGCGTATA TTATGATCGT CGATGACAAA TCCATTGTGG CACCACCGAA GGTCGAAACC GTGGGGGCGA ACCTGGAAGA GGACGGCGAA GTGGTGTTCT CTGCCCGCGT TACCGACCGC GGCGACGCGC CGATCCTTGG CGTCGGATTT TGCATTGCCA GTACTAGAAG TCTGCCCGAA GTGAAATCGA GCGTGCGCAT TGAAGGGATC TCGATGGTGC CCTACAACTT TACCTATTCG CAACGCCTCG CGGGCGATAC GCTCTGGTAC CGTGCCTATG CCGAAACCGT TTCGGGAACG GTTTATGGCG ATACCGCCCA CGTGGTGCTC AAAAACACCA GTAACCCCGT CGGCTCACTG GCCCAGAATA CGTTGTTTTT CAACGACGGA CGCCAGGCAT TCTCGGTCGA CGTCGATAAT GGAACGGTAC TATGGACCTC GCCCAAGGAT GGATCTTCGA ACGATAAGGG AAGCGTTTAT GCCAATGGCA TGTATGTTTT CGGCGAACAT CGCCTTGTGG GTGTTGATGC CGTCACCGGG AAAACGAAGT GGATGTATTC CGATCCCCGG TACCACACCT CATTTTATTC CCAACCGGTG GCAATCGGCG GAACAGTGTT CATGATCGGT GAGTATCGCC TCACTGCGAT TGACGTCCAG TCGGGCCTGA AACTGTGGTC GCTGGACGCG GCCGAGTTCG CGAATGATGC CAATACGAGC TTCCACGGCG GATTGAGCGT AACCAAAGAC AATAAGCTGT TTTTCGTAAC CTGGGGAAGG AATTCTAAAT ACACGGGTTA CATCGTGAGC GATCCGAGAA ACGGCAAATC GATCACCGAA TTCGATAACC AGGGCGAATC CTACTCCGGT AACCCGTTCT GGGACGATAA CCAGCTGGTG ATGGCTACCG GCGCGAGAGA CCTGAAATCA TTCGCACTGA AACCCGCACC GAAGGAGATG TGGCGCTCGC GCGAATTCCT GTCCAGCTCC GTGATCAGCA ACAATGTGGT GGTAGGGCAC GACGTTGCCG GCAAGGCCCT CAAAGGGCTC GATAAAGCAA CGGGTACCCG GTTGTGGCAA TACACGCCGG CGGTGGGAGA AATCTATACG AGGGCATGGA GCGTCTCAGG CAAATATGCG GCCATGACGG TCATGGAGCG AACCAGCGCA TTTGGCGGCA ACCGCTTCAT CCATGTGATT GACATTACCA ATGGAAAACT GGTGTGGGAG AAAAAGCTCG CCACCACAGT GGCCGAAAGC GTTCTGGCGG CAGGCAACAA GGTTGTGACC TGGGATGGCG AAGCGGTGGC ATTCGACATT GCCACCGGCA ACCAGCTTTG GAAAACCAAT GTAAACACCT CCAAACTCAT CTTCCCACGC GATATGGTGG TTGTCCAGAA AAATGGAACG AGCTACTATA TGGTTGAATC AGGCATGAAA TAG
|
Protein sequence | MNRVSSGSRA DTFLFVAVLK YKAAPRSAED FRNNILYQFS MKSRLLKPLR KIFTMLLVFL TVIACKKDVV PTQPAPEPEP EVVIPGKTVG FGVELVPAEE YEKLPLIAEP VMANGRTMKD PTLTPTYDLT AKMPPVASQG NQNSCTAWAT AFAARSYLHS RLTGASYVGS DGNRDNARVF SPAFVYNQIN GGNDKGSFTY NALDLMKNTG VCSWQDMPYK DTDFLTKPTN EQTQKAGNFK IKDWGRINIS ESVFKKFIYY DYPVIISAYL DNSFLELTHK DPQNEFVWKE NTGAKSAHAM VVVGYDDSRK AFKVQNSWGK NWANRGYIWL SYELVEEVIR EAYIMIVDDK SIVAPPKVET VGANLEEDGE VVFSARVTDR GDAPILGVGF CIASTRSLPE VKSSVRIEGI SMVPYNFTYS QRLAGDTLWY RAYAETVSGT VYGDTAHVVL KNTSNPVGSL AQNTLFFNDG RQAFSVDVDN GTVLWTSPKD GSSNDKGSVY ANGMYVFGEH RLVGVDAVTG KTKWMYSDPR YHTSFYSQPV AIGGTVFMIG EYRLTAIDVQ SGLKLWSLDA AEFANDANTS FHGGLSVTKD NKLFFVTWGR NSKYTGYIVS DPRNGKSITE FDNQGESYSG NPFWDDNQLV MATGARDLKS FALKPAPKEM WRSREFLSSS VISNNVVVGH DVAGKALKGL DKATGTRLWQ YTPAVGEIYT RAWSVSGKYA AMTVMERTSA FGGNRFIHVI DITNGKLVWE KKLATTVAES VLAAGNKVVT WDGEAVAFDI ATGNQLWKTN VNTSKLIFPR DMVVVQKNGT SYYMVESGMK
|
| |