Gene Dbac_3248 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDbac_3248 
Symbol 
ID8378944 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfomicrobium baculatum DSM 4028 
KingdomBacteria 
Replicon accessionNC_013173 
Strand
Start bp3679326 
End bp3681290 
Gene Length1965 bp 
Protein Length654 aa 
Translation table11 
GC content63% 
IMG OID645002484 
Productpeptidase U32 
Protein accessionYP_003159739 
Protein GI256831011 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAATCA AGAAAAAAAT CGAAATTCTC GCTCCGGCCG GTGACATAGA CAGCTTTTTG 
GCCGCCATCG CCGCCGGAGC CGACGCCATC TACTGCGGCC TCAAAAATTT TTCCGCGCGC
ATGGAGGCGG AGAATTTTTC CCTGACCGAG CTCGCGGCCC TGACCGAGCT GGCTCATGCC
AAGGGCATTC GCGTGCATGT GGCCATGAAT AATCTGCTTA AAACTCCGGA ACTGGATCAG
GCCGGGCGCC TCATCCATCG GCTGCAGACG CAGGTCGGGC CGGACGCGCT CATTGTGCAG
GATCTGGGCC TGCCCGTCCT GGCCAGGCAG GCCGGGTTCA CGGGCGAGCT GCATCTCTCC
ACCCTGGCCA ACGGCGGCAC CATTGCCGGA CTGCCGCAGA TTCTGGCCTT GGGCGTGGAT
CGTCTGGTGT TGCCCCGGGA GCTGTCCATC GACGAGATCA AAGCCGTCGC CAGCCGCTGT
CCGGAGGGCC TTGGGCTGGA AGTGTTCGTG CATGGCGCGC TCTGCTACGC GGTCTCCGGC
CGGTGCTACT GGTCGAGCTA TCTGGGGGGC AAAAGCGGTC TGCGCGGCCG TTGCGTGCAG
CCTTGCCGGC GTCAGTACCA GCAGAAGAGC CAGAAGACGT CGTTCTTTTC CTGCGACGAT
CTGTCCCTGG ACGTGTTGGT GCGTCCTTTG AGCGGCGTGG ACAAGGTAGA TTCCTGGAAG
ATCGAGGGGC GCAAAAAAGG GCCGCATTAC GTCTACTATA CGGTCACGGC CTATCGCATG
CTGCGTGACG CGGGCGATGA TCCGGCCCAG CGCAAGGCGG CTCTGGGGCT TTTGGACATG
GCGCTGGGGC GGCCGTCCTC GCACTACAAT TTTCTTGGGC ACCGGCCCAG CAATCCCATC
GCCGACCGGG AACAGACCGC CTCGGGGCAC ATGGTCGGCA AGGTCCAGGG CGGGTTCAAG
GCCGCCTATG TTTCCCCGCG CGAACCGCTC AAAAGCGGAG ACCTGCTGCG CGTCGGCTAC
GAGGATCAGG CCGGGCATCA GACCGTGAAA ATCCGGCGCG ACATCCCCAA GGGCGGGCGC
CTGGAGCTGG CCAAAGGTCA GGGGCGTCCC GCACCCCAGG GCGCCCCGGT GTTTCTGGTG
GACCGGCGGG AGCGGGAGCT GCAGGCCTTG ATCGCGGACC TGAAAAAGGA TCTGTCTTTC
GACCGGCCGA CCAAGGAGTC CACCTTTACC CCGACCCTGC CGCGAACCGT GCGGCGCAAG
GGCAAGCCGC GCGAGATGGA CGTCTGGCGC AGCCTGCCGG CCAGACTTGG CAATCAGAGC
GAGCAGGCAG TGTGGCTGAC GCCGGGCGTG GAGCGGAACA TTTCGCGCAA TATTTTCGGG
CGTATCTGGT GGTGGTTGCC GCCCGTGATC TGGCCGGCCG AAGAAAAAGC CTGGACCGAG
TGCCTGGAGA ACATGACCCG CCTCGGTGCC AGGCAGTTCG TGCTGAACGC GCCCTGGCAG
GCAGGTCTCT TTGCCAAGCC CGAACGCCAG ACCTTCTGGG CCGGGCCCAT GTGCAACACG
GCCAACCCCC TGGCTCTGGC CGAACTTTTC CGAATGGGCT TTGCCGGGGC CTTTGTCAGT
CCGGAACTAG ACCGCCAGGG TTTTCTGGAA TTGCCCGCCC TTTCGCCGCT GCCGCTGGGC
ATTCTGGTCA AGGGCATACA TCCCCTGTGC GTCTCGCGCA TCCTTTCGGA CAACCTGCGC
GAGCGCGAGC CCTTCCAGAG TCCCAAGGGC GAGCAGTTTT GGTCAAGAAC CTACGGCGGT
CTGGTGTGGA CTTTCCCCAA CTGGGAAATC GACCTGAGCG AGAAATGGAC GGAACTGGAA
AAGGCCGGAT ACGCCCTCCT GGCGCGTCTG CATGAACCCG TGCCTGACAA GGTGTCCATC
AAGGAGCGCC AGGGGCTCTG GAACTGGGAC CTGAAGATGC TGTAA
 
Protein sequence
MEIKKKIEIL APAGDIDSFL AAIAAGADAI YCGLKNFSAR MEAENFSLTE LAALTELAHA 
KGIRVHVAMN NLLKTPELDQ AGRLIHRLQT QVGPDALIVQ DLGLPVLARQ AGFTGELHLS
TLANGGTIAG LPQILALGVD RLVLPRELSI DEIKAVASRC PEGLGLEVFV HGALCYAVSG
RCYWSSYLGG KSGLRGRCVQ PCRRQYQQKS QKTSFFSCDD LSLDVLVRPL SGVDKVDSWK
IEGRKKGPHY VYYTVTAYRM LRDAGDDPAQ RKAALGLLDM ALGRPSSHYN FLGHRPSNPI
ADREQTASGH MVGKVQGGFK AAYVSPREPL KSGDLLRVGY EDQAGHQTVK IRRDIPKGGR
LELAKGQGRP APQGAPVFLV DRRERELQAL IADLKKDLSF DRPTKESTFT PTLPRTVRRK
GKPREMDVWR SLPARLGNQS EQAVWLTPGV ERNISRNIFG RIWWWLPPVI WPAEEKAWTE
CLENMTRLGA RQFVLNAPWQ AGLFAKPERQ TFWAGPMCNT ANPLALAELF RMGFAGAFVS
PELDRQGFLE LPALSPLPLG ILVKGIHPLC VSRILSDNLR EREPFQSPKG EQFWSRTYGG
LVWTFPNWEI DLSEKWTELE KAGYALLARL HEPVPDKVSI KERQGLWNWD LKML