Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dfer_4188 |
Symbol | |
ID | 8227789 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dyadobacter fermentans DSM 18053 |
Kingdom | Bacteria |
Replicon accession | NC_013037 |
Strand | + |
Start bp | 5065551 |
End bp | 5067350 |
Gene Length | 1800 bp |
Protein Length | 599 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 644932034 |
Product | peptidase C1A papain |
Protein accession | YP_003088556 |
Protein GI | 255037935 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATTTCA AAGCTAAAAT TTCACTATAC CTTACATTTC TGCTAGGCCT TTTGCTCATT GTTTCTTGCA AAAAGGACAC GAAGGAAGAG CAGGTGACTC CGGTGCAACC CATCGAATCC ACCGCCCAGC GAAGCGGTTT GCTGCTGGCG GAACCGGGCG CTTATCAGCG CATTCCATTG ATCGATGAGC CGTTGGTTAA TGCCCGTACC TCTGAAAAAA CCACACTTCC CGATGAATAT ATCATCCCGG GCCTCGTACC GGTGGGAAAC CAGGGTGATA TGGAAAGCTG TGTAGGCTGG GCGACGGCTT ATGCGGCCCG GACCTTGCTT TACCAACGCG GCAAGCCTGT TAATTATCTC AGTAATGACG GCCAATTGCT TACCGAACTT GTTTTCAGCC CGGAATTCGT ATGGACGTCT ATTAATCAGG GTCAAAACAA AGGCGTGCTC ATGGCGGATG CGATGAATTT GATCCAGCAA GTTGGTGTAG CGAGGTATAG GGATTTTCCT ACGCGAAATC AGCCGGCCCC GAAACCGACC GGCGCTCAAA ACCAAAGTGC GTCGACATTT AAGGTAAAAA ACTGGGGACG TATAGCCCAT AATGCGGCCA CCTTCCGAAA GTTTTTATAT TATGACAACC CGATCATCAT TGCAATCAAA CCGGATGAAA ATTTCAAGAA ACCTGCGCAA ATGCTGCCGG ACGGTAGCCT CGTTTGGGAT AACTATGGAA CACCGGGCGC CGGGCATGCG TGCACGATTG TAGGTTACAG CAAGGAGAGA AATGCGTTTT TGGTCCAAAA CTCCTGGGGA AAGCGCTGGG GGGACAGGCT TACGGAGGAA GACGCGAAAA AGCCAATGGC GGGGTTCATC TGGCTCGATT ACAACCTGTT GGAAAAGGTT GTCACCGAAG CCTATGTGAT GATTGCCAAT GATCCGGGCT ATGAAAAGCC TGTGGTGAAC ACGTGGGCGG CCGGTTCCGA ATCGGAAAAT GAAATCACGC TGAACGGTTC AGTGGAAAAA TTCGAAGACA AGGCCTTGTC GGAATATGGA TTTTTGGTTT CTACGGATTC TACGGGGTTG GAATTGGGTG GGAAGGCGGA AAGCATCCAA TTCCATACGA TACTCGCGCC GCCGTATTCG TTTGCCAAAA CCTACACTAG CAAAGGTGCA GGGAAGCGAT TTTACAGAGC TTACGCCAAA TCTTACTCGG AAGTATACTA TGGAAAAATA GTTGCATTCA CTGTCCAGCC CGAAATTGAG GATTCCGGAA GACTTTCTGG GAATATTGTG GTGAGCACGA ACTTCGAAAA CCGCTACACC AAAGCGGACA ATGGGTATAC GGAATACAAT AGCATCAGGG AATACAAATT CGAAATTACA TTTTCCGAGA AGCACCCGCG ATACAGGTAC CGGGCGCTCT ACTTGTTTAC CGCAAACGGT TCTTACTTCG CGGTGCGTGG AATTTCCGAA GGCCCCGAAA AGGATGCGGT AGCCTACCTC GACCTGAGTT ACCGATCGTG GGAATTCCCG TTTGCCAATG GGCAAATCGG TCAGGGCAGC GCATTACCCG ACGTGCCACT ATACCGGATA GCCGATCTGG AAGAAAGCCC TAAAAACTGG ACAATCGGCT CGAAGCTTAT CAATGATTTG ATCAGCGGGT TTATGGTAGT CATGTCGGAT GACCCGGACT TCGAGAAGAA TGTAATCGCA ACCGACCAGA TAGCCTTTCC CATGAACCTG AATTCAGATT CGTTCAAACC CACCGAAATC CCTCCGCAAC CGCTTCATGT GGCCTTTTAG
|
Protein sequence | MDFKAKISLY LTFLLGLLLI VSCKKDTKEE QVTPVQPIES TAQRSGLLLA EPGAYQRIPL IDEPLVNART SEKTTLPDEY IIPGLVPVGN QGDMESCVGW ATAYAARTLL YQRGKPVNYL SNDGQLLTEL VFSPEFVWTS INQGQNKGVL MADAMNLIQQ VGVARYRDFP TRNQPAPKPT GAQNQSASTF KVKNWGRIAH NAATFRKFLY YDNPIIIAIK PDENFKKPAQ MLPDGSLVWD NYGTPGAGHA CTIVGYSKER NAFLVQNSWG KRWGDRLTEE DAKKPMAGFI WLDYNLLEKV VTEAYVMIAN DPGYEKPVVN TWAAGSESEN EITLNGSVEK FEDKALSEYG FLVSTDSTGL ELGGKAESIQ FHTILAPPYS FAKTYTSKGA GKRFYRAYAK SYSEVYYGKI VAFTVQPEIE DSGRLSGNIV VSTNFENRYT KADNGYTEYN SIREYKFEIT FSEKHPRYRY RALYLFTANG SYFAVRGISE GPEKDAVAYL DLSYRSWEFP FANGQIGQGS ALPDVPLYRI ADLEESPKNW TIGSKLINDL ISGFMVVMSD DPDFEKNVIA TDQIAFPMNL NSDSFKPTEI PPQPLHVAF
|
| |