Gene Dfer_0574 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDfer_0574 
Symbol 
ID8224143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDyadobacter fermentans DSM 18053 
KingdomBacteria 
Replicon accessionNC_013037 
Strand
Start bp651188 
End bp652651 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content56% 
IMG OID644928450 
Productpeptidase C1A papain 
Protein accessionYP_003085004 
Protein GI255034383 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4870] Cysteine protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.17095 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCCCT CTTTCCTATT TGCCGTCGTG TATGTGCTGG CCTTGCTGCT TCCGGCGTCG 
GAATCGCTGG CGCAAAAACG CGGCATGGGC CTGAAAATGG ACGACGCGCG GTACCTGCGC
CTCCCGCGCA AGTCACCCAA TATTGTATTC AAAGGCGTAT TACCGGCCTC ATTCAGCCTG
CGAAGCCTGA TGCCGGAAAT CGGCGACCAG GGCATGGACG GGACGTGTGT GGGCTGGTCG
GCGGCATACT ACATGCGCAC GGTCATGGAA GCTGGCAAAC AGGGACTCAC AGGTCAGCCG
GCCAGGATCA CCGCTACGGC ATTCTCGCCC GGCTGGCTCT ACGGGCAGAT CCGTTCGGGA
CAGGATAACG GCTGTGCGGA TGGCGTTTAC CTTGAAGACG CGCTGGAAGT GATGAAAACG
AAAGGATCGG CGTTCCTGAG CTGCGCGCCG GGGAATTGCA CGGTCCGTTA CGACCAATGC
GACGACAAGG CTACCAACTA CAAAATCGCC GATTACGCCA CACTTTTCAG TCCGGGCGAC
AGCAGCTTTA CCGCGGCGCA GCGCATTCAT GCCATCAAAT CAGCGCTGGT GGAAAGCAAA
AGTGCCGTGC TCACCGGCAT GCTCGTACCG CCATCCTTCA TAGACGCTAC CGGCGAAAAC
TGGCAGCCGG CACCGGGCGA ATCGGCCGCC AATGCCATCG GAGGCCATTC GCTGGCCATC
ATCGGCTACG ACGATCATGT GAACGAAGGC TCGTTCCTGA TCGCCAACAG CTGGGGCACG
GCGTGGGGAA GCGGCGGTTA CATTTGGGCT AAATACGGCG ACCTGGCCCG CTTCATCAAG
AACGCCTACC AGATCTACAC CGAACCGCCC GCCAGGCCGC AACCGCAGAC GATAGCAATG
CAGGGAGACA TTGATTTTCT CACCGGAAGC GGCGCTATGG CCGTCCATTC GACCGTTTCG
AAAGGTGCCC AGGTAACCAT GGGACAACCA AAAACCGAAA TGCTTACCTA CACCATGAGC
CAGCCCCATG CTTCGGGCAC GCGATTCAAG ATGGTGATCA GCAACAGCCG GCAATCGTAT
GTTTACATTC TCGGGTCCGA CAAGGAGAAC CGGGTTTCGG CGCTGTTTCC CGACAATTCG
GAAGAAATGA TCACAAGCGC CGTGGTCCCC GCCAACAGCC AGATGCTGAT GCCTTCCCCG
CATAGTTCCT TCACGCTCGA CGACATGACG GGCGAAGATT ATTTCATTGT ATTTATTTCT
CAAAACGAGC TGAACCTCGA AGAACTGGCC GCAAGAATTA AAAATGCCGA CGGTACCATC
GTGCAAAAAG CCTTTTCCGC ATTGGGAGAA GAATTTATCT CTCCAAAAGC CATTTCCTAC
GCGCCCGGAA AAATTTCCTA CGAAGTGAAA GGCGCTCCGA AAGGCAGCGT TGTTCCGGTA
CTAGTTAAGA TCATACACCA ATAA
 
Protein sequence
MKPSFLFAVV YVLALLLPAS ESLAQKRGMG LKMDDARYLR LPRKSPNIVF KGVLPASFSL 
RSLMPEIGDQ GMDGTCVGWS AAYYMRTVME AGKQGLTGQP ARITATAFSP GWLYGQIRSG
QDNGCADGVY LEDALEVMKT KGSAFLSCAP GNCTVRYDQC DDKATNYKIA DYATLFSPGD
SSFTAAQRIH AIKSALVESK SAVLTGMLVP PSFIDATGEN WQPAPGESAA NAIGGHSLAI
IGYDDHVNEG SFLIANSWGT AWGSGGYIWA KYGDLARFIK NAYQIYTEPP ARPQPQTIAM
QGDIDFLTGS GAMAVHSTVS KGAQVTMGQP KTEMLTYTMS QPHASGTRFK MVISNSRQSY
VYILGSDKEN RVSALFPDNS EEMITSAVVP ANSQMLMPSP HSSFTLDDMT GEDYFIVFIS
QNELNLEELA ARIKNADGTI VQKAFSALGE EFISPKAISY APGKISYEVK GAPKGSVVPV
LVKIIHQ