Gene Dfer_1726 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDfer_1726 
Symbol 
ID8225297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDyadobacter fermentans DSM 18053 
KingdomBacteria 
Replicon accessionNC_013037 
Strand
Start bp2106866 
End bp2109235 
Gene Length2370 bp 
Protein Length789 aa 
Translation table11 
GC content52% 
IMG OID644929580 
Productpeptidase C1A papain 
Protein accessionYP_003086132 
Protein GI255035511 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4870] Cysteine protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGCGCG CTCTACGCCT CCACGCGAGC ATTTACTGCC TCCTTCTATT TGTCATTATC 
ACTGCCTGCA GCAAGAAGCC AAGCGAACTT ACACCGCCCG ATCCAAAACC GGACGAGGAG
GTGATTCCCG GCGAAACGGT GGGCTATGGT GCCGAAGTGC TCGACCCGGC GGAATACGAA
AAAATCAAAA TAATCACCGA ACCGCTGATC CCCGGCGGCC GTACCCAGAA AGACCCGAAG
CTTGCGCCGC AATATGATTT GTCGCCGAAC ATGCCTCCGG TGCAAAGTCA GGGCCGGCAG
GGATCGTGTG TTGCCTGGGC GGTTGCCTAT GCAGCGCGCA GCTATTTTAG TGGTTTGGGA
TTCAATTCAA ATTTCAAACT TGCGGACGGC TCGGTCAACA ACGAAACGGT CCTCAGCCCT
GCCTTCGTTT ATAACAAGAT CAAAGCCGGA GATTGTTTGA AAGGATCCTA TATCGAAGAT
GCACTTAACC TTTTGAAAAA TACGGGCGTG TGTACCTGGA AAGACATGCC TTACACTGAC
CAGGAATGCG ACACGCAGCC CGGAACGCAG CAAGTTGCCG GTGCAGCGAA GTACAAGATA
GCCGACTGGG GACGGATATT GATCACCACC GATGCCATCA AGAAATTCCT GTATTACGAT
ATGCCCGTAG TAATTGCGGG CAGGCTCAAT GCTGATTTCA AAAAGCCTAC GCGTTTCCCC
GATGGTCAGT TCGTGTGGAA GGACGCCACC CCGAATGATG TTTCCCATCA CGCCATGATC
ATTGTGGGCT ACGATGATGC CCGTAAGGCT TTCAAAATCC AGAATTCATG GAGCAAGAAT
TGGTGTAACG ATGGCTATAT CTGGATGAGC TACGACATTG TTAAAGACGT TATAAGGGAA
GCTTATGTAA TGACTTCTGA CGAATTGCTG CTGCGATCGC AGGCCAGCGT GGAGACCGGC
GCGGCCGTAA AATACGAGCA GAATGTGGTC GGCTACACGG GACGAATCAT GCAATTGGGT
GATTTGCCGG TAATCGGATA CGGCATGTGC CTGGCTACAA CGCCGAGCCT GCCGACACGT
AAATATTACG CGAAAGCGCA GAACATTCCC GCTACTCCTT TCGAATTTTC CGTCACCGAG
AGCATCAACA CCAGCAAAAT ATGGTATCGG GCATTTGTGG AAACGGTCGC CGGAGTGGTA
TATGGCGATA CGGCCAGTAT CGTCCTGAAC AGCGGCCCGG GCGAGACGCG TGACAAAGAC
CTGCTCCTCT TTCACAGCGG CTTCTTTGGA TATGCCGTCG ACGCCGGCAC CGGGGAGCGA
CTTTGGGAGG CGCCTTTACG AAGCGGCGCT GCCTCCGACC AGCTCCTGGG CGGGTTGATC
GTCAATGAGC AATATATTGT ACCTGGCAAT ATGACGGGCT CACTCAGTGC CGTATCGCTG
TCGACCGGCA AGGTGAACTG GGAACGGCGG GAAATGATGC GGAGCAAACA AACGTATCCG
GTCATTATCG ATAACCAGAT CATCACGATT GGCCGTGAAG AGCTGATCGC CGTGGACCAG
GCGTCGGGAA ACTTGTCGTG GACCCAAAAG CAGGCTGCGT TCGGAGGCGA GGATTTTGTT
GAATTCGGAT TGGCTGTGAC AAAGGATAGT AAGCTCTCCG TGAATGTCAA CAAGAGTTTT
TACGACGATG AGATGTATTT TGTTTCCAAT CCTTCCAACG GGACCGGGAT TACCGGAATG
GGCATTTTCC CAACCACGGT GAAGGGCCAT CCGGATTTTG CGGACAACTA TATGGTGCGT
TCGGAGGCAA TGAGTGGCCT GGCGGCTTAC AGCCTGTCGC CTTTCAAGAA ACTGTGGAGC
ACGCCCTCTA CGACAACGGC AGCGCACGAT AGCCCGCTCA TCGTGGGATC GCGTGTGATC
GGGGTTATAA TGGGTAATTT GGTTCAGGGA TTGCAGGCGG TGGACAAGGC CACAGGTAAA
AAGCTCTGGG AGTATTATCC GCAGCAAGGC GAAGTACTGT CGAAAAAGTG GAGTGCCTCG
GCCGATTTCA TCGCTATGAT CGTGCGGGTA AACATGGGTG GCGCTTCTGA GTACGAGTAT
TCGTTGCAGG TACTGGATAT TTCCACAGGG AAAGTTTCCT GGACAAAAAA GCTGTCTACG
AGTCCCTATC CGGTAATTCC AATGTATCCT CTGATTGCCG GGGACAAGGT GTACGTATCT
CCCGACAATT TGGAGATCGC GGCCTTCAAG CTGCAAACGG GAGCGGTTAT CTGGCAAAAG
AAACTCTTTA ACAATCCTGG GACAATTAGT CCTTTCAGCC TGATCACTAA AGAAGGTAAG
ATCTTCTACA TGCCCGAGTC GGGGATGTAG
 
Protein sequence
MLRALRLHAS IYCLLLFVII TACSKKPSEL TPPDPKPDEE VIPGETVGYG AEVLDPAEYE 
KIKIITEPLI PGGRTQKDPK LAPQYDLSPN MPPVQSQGRQ GSCVAWAVAY AARSYFSGLG
FNSNFKLADG SVNNETVLSP AFVYNKIKAG DCLKGSYIED ALNLLKNTGV CTWKDMPYTD
QECDTQPGTQ QVAGAAKYKI ADWGRILITT DAIKKFLYYD MPVVIAGRLN ADFKKPTRFP
DGQFVWKDAT PNDVSHHAMI IVGYDDARKA FKIQNSWSKN WCNDGYIWMS YDIVKDVIRE
AYVMTSDELL LRSQASVETG AAVKYEQNVV GYTGRIMQLG DLPVIGYGMC LATTPSLPTR
KYYAKAQNIP ATPFEFSVTE SINTSKIWYR AFVETVAGVV YGDTASIVLN SGPGETRDKD
LLLFHSGFFG YAVDAGTGER LWEAPLRSGA ASDQLLGGLI VNEQYIVPGN MTGSLSAVSL
STGKVNWERR EMMRSKQTYP VIIDNQIITI GREELIAVDQ ASGNLSWTQK QAAFGGEDFV
EFGLAVTKDS KLSVNVNKSF YDDEMYFVSN PSNGTGITGM GIFPTTVKGH PDFADNYMVR
SEAMSGLAAY SLSPFKKLWS TPSTTTAAHD SPLIVGSRVI GVIMGNLVQG LQAVDKATGK
KLWEYYPQQG EVLSKKWSAS ADFIAMIVRV NMGGASEYEY SLQVLDISTG KVSWTKKLST
SPYPVIPMYP LIAGDKVYVS PDNLEIAAFK LQTGAVIWQK KLFNNPGTIS PFSLITKEGK
IFYMPESGM