Gene Dbac_3079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDbac_3079 
Symbol 
ID8378774 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfomicrobium baculatum DSM 4028 
KingdomBacteria 
Replicon accessionNC_013173 
Strand
Start bp3481938 
End bp3484778 
Gene Length2841 bp 
Protein Length946 aa 
Translation table11 
GC content64% 
IMG OID645002314 
ProductPitrilysin 
Protein accessionYP_003159570 
Protein GI256830842 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1025] Secreted/periplasmic Zn-dependent peptidases, insulinase-like 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCATAT TGCGTTTTGT TCTTTTCTTC CTGTTCCTCG CCCTGCAGGC CCAGGCCGCG 
CAAGTGCGGA TCTCGGACAC GGACTACCGC GCCTATCGGG CCTTGACTCT TGAAACCGGC
CTGGAAGTGC TGCTGGTGCA TGACCAGCGC GCCTCCAAGG CCGCCGCGGC CCTGGCTCTG
CCCGTGGGCA GCCTCGACAA CCCGGACTCC CAGCCCGGCC TGGCCCATTA TCTGGAGCAC
ATGCTCTTCC TGGGTTCCAC GTCCTATCCC GGCCCCGAAG AGTACCAATC CTTCATCACC
AGAAACGGCG GACAGACCAA CGCGGCCACG GGGTACACCT CCACTACCTA CATGATCGAG
GTCGATCCTC CGGCTTTCCC CGAGGCGCTG CGGCGCATGG CCGACACCCT GGCCCGGCCG
CTGCTCGACC CGGTCTACGC GGACAAGGAG CGAAACGCCG TCAATGCGGA GATGGAGTCC
AAAAAGCACA GCGACGGACG CCGCCTAGCC ATGCTGATGC TCTCGACCCT GAACCCGGAT
CATCCGGCCA CCCGCTTCAC TGGCGGCAAC CTGGAGACCC TGTCCGACAA GCCCGGCAGC
CGCCTGCACG ACGAACTGGT CCGCTTCCAC CAGACATGGT ACTCGGCGAA CCTGATGAAA
GGCGTGCTCT ACGGCCCCCA GAGCCTTGAC GAGTTGGAAG CCCTGGCCCG GAGCGAACTT
GCCGTCATCC CCGACCGCCA GGCCAAAATA GAGGTTCCCG TCGCGCCGCC GGCCACGGAC
GCCGAAAAAG GCGTGATCGT CGGCGTGCGC CCCGTGCGCG AGACACGGAG CATGAGCATC
GAATTCGTCC TGCCCCAGGC CCTGGACGAC TCCCGCACCA AGCCCCTGCA GGTCGTATCC
GCCGTGCTCG GCACGGAGAC CGGGCACTCC CTGGTCGAAA TGCTGCGAGA CAAGGGGCTG
GCGCTGGGTC TTTCGGCCGG AGGAGACACC ACTTCCTTGC GCAACGGCGT GACCCTGTCC
CTCTTCGTGC AACTCACCGA AGAAGGCGAC AGGAAACGCG ACGAGGTGCT GGCCACGATA
TTCGCCTACT TCGATCTGCT GCGCGCCCAG GGTCTGGGAG AGACGTATTT CGAGCAGCTG
CGGCGCATGC TGGACATGGA ATTCCGCTTC GCCCCCCTGG CCAGCGGCTT CGACTATGTC
GCCTCCGCCG CGACGCAGAT GCTGCGGCAT CCCGTGGAAG ACGTGAATTA CGGCCCCTAC
CGCCTGGACT CCTTTGACCG CGAGGCCGTA AACAGCGTGA TCGAAGCGCT CAGGCCCGAA
AACGCGCGCA TCTTCCAGGT CGGCCCGGAT CAGCCTGTGG ACAGGGAGGC CTTCTTCTAC
CAGACCCCGT ACAGCGCAAG GCCCATCGAA GACGGCGACA TCACCCGTTG GGGCAAGCTT
TCCGCAGGGA TGGAATTGCG CCTGCCGGAC CTGAACCCCT TCCTGCCGGA TGATTTTTCG
CTGGTCGCAG CCAAGGGAAA CGCAGAGCCT CGCAAACTCA CGGACAAACC CGGTCTCTCC
CTTTGGCACG CCGGATCGGC TTTTAGGCAG GAGCCCAAGG CCATCCTCAT GACCCGGCTG
CAATCCGCCC ATTTCGCCGC CACCCGCGAG CAAACGGCCC TGCAGGGCGT GCTGCTCGAA
CTGTGGGATC AGCAGCAGGC GGGCCTGCGC TATCAGGCCA TGGAGGCGGG GCTTGGATTG
TCCGTGTCCG GCGATGAAGG CATCGTCATC CGAATTGACG GATTCAGTCA GCATCAGGCT
GACCTCCTGC CCCGCGTGCT CGACTTTCTC GAGCAGGACG TCACGCCGGA AGATTTCATG
CAGGCCAAGG CGGAACAGCT GCGCAGCCTG GCCAACATGG AAAAGCAGGG CCTGTTTGGT
CAGGCCATGG GCGCCATGCG CAACCTGCTC AAAGTCCCGT CATGGGACCA CCGGGCCATC
GAAGAGACCA CCAGGGGGCT GACCCTGCAG GACCTTGGCG AATACCTGCG CACAGTGCGG
CGGGATCTGC GCTTCACGGT CTTCGGGTTC GGCAATATTA CTCCGGACGA CCTGCGCAAG
CTGGAGGGGG ACCTTCGGCC GTTCATCGGA CCTGAAGCAG GAGCGCCGCC GATTGCGACA
CGCATCGCAC CCAGACAGGG CGTCGTGGCC GATTACCGCA AAGCGAGCGT GCTGGAAGAC
AGCGCCCTGG TCGAAATGTT CCTGGCCCCT GAGACCGGCT CAGGTTCCAA GGCGCGCATG
CTCCTGCTGG AAGGACTTCT GTCCAACCGC TTCTTCAGCC GTCTGCGCAC CGAAGAGCAA
TTGGGCTACG TGGCGACCAG CTTTCCGGTC ATGTTCGCCC ACGCCGCCGG GATCGGATTC
GGCGTGCAGA GCCCGGTGCA GGGAACGGCA GGCCTGGCCG ACCGCTTCGA GTCCTTCTAC
TACACGGCCC TCTCGCAGCT GCGCGGCGTG ACCGGGGAAG AATTCGAATC CGTGCGCCAG
GGCGTGCTGG CCTCACTGAC CAAAAGCCCG GATACCCTGG AAGAGGAATT CGGCTGGCTG
GAAACGGACC TGCGCCTGGG CAACCAGGCC TTCGACGGCC GCGACAAGCT GGTCGATTCT
TTGAGGAAAG CCACCCTGCC CGAGATCGTG CGCGCCTACG AGACCATGGT CATGGGCCCC
GGCGGCACGC GGGCGCTCAT CCAGATCCAA GGCTCGCGCT TCGACGATTT CGGCTGGGCC
CGCAAAAGCG GGGCGGAACA CGTCGCGGAG CCTACGGATT TTCACAGGCT GATGGGCGTC
CAACGCTATC AGGGACTGTG A
 
Protein sequence
MPILRFVLFF LFLALQAQAA QVRISDTDYR AYRALTLETG LEVLLVHDQR ASKAAAALAL 
PVGSLDNPDS QPGLAHYLEH MLFLGSTSYP GPEEYQSFIT RNGGQTNAAT GYTSTTYMIE
VDPPAFPEAL RRMADTLARP LLDPVYADKE RNAVNAEMES KKHSDGRRLA MLMLSTLNPD
HPATRFTGGN LETLSDKPGS RLHDELVRFH QTWYSANLMK GVLYGPQSLD ELEALARSEL
AVIPDRQAKI EVPVAPPATD AEKGVIVGVR PVRETRSMSI EFVLPQALDD SRTKPLQVVS
AVLGTETGHS LVEMLRDKGL ALGLSAGGDT TSLRNGVTLS LFVQLTEEGD RKRDEVLATI
FAYFDLLRAQ GLGETYFEQL RRMLDMEFRF APLASGFDYV ASAATQMLRH PVEDVNYGPY
RLDSFDREAV NSVIEALRPE NARIFQVGPD QPVDREAFFY QTPYSARPIE DGDITRWGKL
SAGMELRLPD LNPFLPDDFS LVAAKGNAEP RKLTDKPGLS LWHAGSAFRQ EPKAILMTRL
QSAHFAATRE QTALQGVLLE LWDQQQAGLR YQAMEAGLGL SVSGDEGIVI RIDGFSQHQA
DLLPRVLDFL EQDVTPEDFM QAKAEQLRSL ANMEKQGLFG QAMGAMRNLL KVPSWDHRAI
EETTRGLTLQ DLGEYLRTVR RDLRFTVFGF GNITPDDLRK LEGDLRPFIG PEAGAPPIAT
RIAPRQGVVA DYRKASVLED SALVEMFLAP ETGSGSKARM LLLEGLLSNR FFSRLRTEEQ
LGYVATSFPV MFAHAAGIGF GVQSPVQGTA GLADRFESFY YTALSQLRGV TGEEFESVRQ
GVLASLTKSP DTLEEEFGWL ETDLRLGNQA FDGRDKLVDS LRKATLPEIV RAYETMVMGP
GGTRALIQIQ GSRFDDFGWA RKSGAEHVAE PTDFHRLMGV QRYQGL