Gene Dbac_1250 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDbac_1250 
Symbol 
ID8376915 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfomicrobium baculatum DSM 4028 
KingdomBacteria 
Replicon accessionNC_013173 
Strand
Start bp1378391 
End bp1379824 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content52% 
IMG OID645000483 
Productpeptidase C1A papain 
Protein accessionYP_003157769 
Protein GI256829041 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4870] Cysteine protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0216738 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTGATA TTCGCTCACT ACCGGGTGTT CCCAGTGATA TCTTGATGGG CTTGGGGCTG 
CTTGGTTTAA AGACCGTGGA GGGGTTTCTT TCGCTCATGC AGATCCATGA AAGCCGCGAG
GCTTTCAAGA AAATGGTCGG TTTGAGCGAA AGTGCACTGG GGCGCAGCAT TCAGCAGCTT
CGCGAGCATC TTCCGAAAAG CTGCGGGGCC ATGCACCCAT CGCCTGTAAT GCCGGTTCAG
CTTGGTTTGA GCTGGGGCCA TGTTGAGGCG TCATATCCTG CAAACTATTT TGGATCGATG
GCCCATGATT CTTCCGAGGA AGAAGTCGAT CTCATTCCTC AGTTGCCACC TGTTCGTAAT
CAGGGGCGGC GCGGAACTTG TGTCGCTTTT GCCGCTACGG CCGCCTTCGA ACATGAAATG
CGCAGGAATG GCCTGTTCGG AATAGGAAAT GTCCTCGCGT TCACGCGTAC GAGAAGCGGG
CGCAAAATTT TCCGTAGATT TGATGATGAA ACCAAGATGC TTTCGCCGCA GTTTTTGTAC
TGGGCCTGCA AAAGCGCGGA CGGGGTTGAT GGCCCAGGGA CGATGATCTC AACAGCCATG
GAATGCTTGC ATGATCGTGG ATGTTGTCCG GAAAAAGACT GGCCATATTC CCCTGATTCG
CAGTGGGGAA ATGAAGGACA GGGGCCACCG CCGTTTGGCG CCGAGGAATC TGCGAGGAAG
CGACGTATTG CGCAATATGA TGAACTGACG AGCTTCGGCT CCGTCCCCCT TCAGCATATG
AAGCGATTGC TGGCACAAGG TCATTTGCTG GTATTTGGCG TGCCCGTGTT TCCAAGCTGG
GGAAATATCG AAACCCAGCA GACAGGACGA GTGATCATGC CCATCTCCGG AGAGCAATCT
CTCGGTGGCC ATGCGCTGTG TCTCTGCGGG TATAGAGACG ATGGTTCTTG CCCGGGAGGT
GGGGTATTTT ACTTCAGAAA TTCATGGGGC GAAGAATGGG CTTCCCAGAA TATCAGAGGA
AGAGGGTACG GCGAAATGCC ATACGCATAC ATGCAGATGT ATGCCAGGGA TGTTTATTCG
ATGAGGTGCG AGGCAACAGG TCTGCTGAAT TTCCCTGTAG CTGCCGTCCG AGCGGCGATT
GAGCCTGGAG TCGCTTTTAT TAAGTCCGCG GCTATGGTCA TGCTGACAGT GGCAATCATG
AGCATGGGCT CCCTTGTTGC CTGGCAATTC GGTCGGGCGG TTCAGTTGCC GACGGCCGAA
CCTCAGATGG ATACGCAGCA GCCCGTTGAG CAGGTATATT CAGCTGCTAT GGAGCCAAAG
GAAGAAGGCA CAAGTTCGGA AGATAGAGTG TTAGAAAACG AAGAAATGTT GTCAACGAAC
AATGTTTATA TCCTTAATAT TCTTGAAGAC GTAAAGAATA TATTGAATAG GTGA
 
Protein sequence
MIDIRSLPGV PSDILMGLGL LGLKTVEGFL SLMQIHESRE AFKKMVGLSE SALGRSIQQL 
REHLPKSCGA MHPSPVMPVQ LGLSWGHVEA SYPANYFGSM AHDSSEEEVD LIPQLPPVRN
QGRRGTCVAF AATAAFEHEM RRNGLFGIGN VLAFTRTRSG RKIFRRFDDE TKMLSPQFLY
WACKSADGVD GPGTMISTAM ECLHDRGCCP EKDWPYSPDS QWGNEGQGPP PFGAEESARK
RRIAQYDELT SFGSVPLQHM KRLLAQGHLL VFGVPVFPSW GNIETQQTGR VIMPISGEQS
LGGHALCLCG YRDDGSCPGG GVFYFRNSWG EEWASQNIRG RGYGEMPYAY MQMYARDVYS
MRCEATGLLN FPVAAVRAAI EPGVAFIKSA AMVMLTVAIM SMGSLVAWQF GRAVQLPTAE
PQMDTQQPVE QVYSAAMEPK EEGTSSEDRV LENEEMLSTN NVYILNILED VKNILNR