Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dbac_3079 |
Symbol | |
ID | 8378774 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfomicrobium baculatum DSM 4028 |
Kingdom | Bacteria |
Replicon accession | NC_013173 |
Strand | + |
Start bp | 3481938 |
End bp | 3484778 |
Gene Length | 2841 bp |
Protein Length | 946 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 645002314 |
Product | Pitrilysin |
Protein accession | YP_003159570 |
Protein GI | 256830842 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1025] Secreted/periplasmic Zn-dependent peptidases, insulinase-like |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCATAT TGCGTTTTGT TCTTTTCTTC CTGTTCCTCG CCCTGCAGGC CCAGGCCGCG CAAGTGCGGA TCTCGGACAC GGACTACCGC GCCTATCGGG CCTTGACTCT TGAAACCGGC CTGGAAGTGC TGCTGGTGCA TGACCAGCGC GCCTCCAAGG CCGCCGCGGC CCTGGCTCTG CCCGTGGGCA GCCTCGACAA CCCGGACTCC CAGCCCGGCC TGGCCCATTA TCTGGAGCAC ATGCTCTTCC TGGGTTCCAC GTCCTATCCC GGCCCCGAAG AGTACCAATC CTTCATCACC AGAAACGGCG GACAGACCAA CGCGGCCACG GGGTACACCT CCACTACCTA CATGATCGAG GTCGATCCTC CGGCTTTCCC CGAGGCGCTG CGGCGCATGG CCGACACCCT GGCCCGGCCG CTGCTCGACC CGGTCTACGC GGACAAGGAG CGAAACGCCG TCAATGCGGA GATGGAGTCC AAAAAGCACA GCGACGGACG CCGCCTAGCC ATGCTGATGC TCTCGACCCT GAACCCGGAT CATCCGGCCA CCCGCTTCAC TGGCGGCAAC CTGGAGACCC TGTCCGACAA GCCCGGCAGC CGCCTGCACG ACGAACTGGT CCGCTTCCAC CAGACATGGT ACTCGGCGAA CCTGATGAAA GGCGTGCTCT ACGGCCCCCA GAGCCTTGAC GAGTTGGAAG CCCTGGCCCG GAGCGAACTT GCCGTCATCC CCGACCGCCA GGCCAAAATA GAGGTTCCCG TCGCGCCGCC GGCCACGGAC GCCGAAAAAG GCGTGATCGT CGGCGTGCGC CCCGTGCGCG AGACACGGAG CATGAGCATC GAATTCGTCC TGCCCCAGGC CCTGGACGAC TCCCGCACCA AGCCCCTGCA GGTCGTATCC GCCGTGCTCG GCACGGAGAC CGGGCACTCC CTGGTCGAAA TGCTGCGAGA CAAGGGGCTG GCGCTGGGTC TTTCGGCCGG AGGAGACACC ACTTCCTTGC GCAACGGCGT GACCCTGTCC CTCTTCGTGC AACTCACCGA AGAAGGCGAC AGGAAACGCG ACGAGGTGCT GGCCACGATA TTCGCCTACT TCGATCTGCT GCGCGCCCAG GGTCTGGGAG AGACGTATTT CGAGCAGCTG CGGCGCATGC TGGACATGGA ATTCCGCTTC GCCCCCCTGG CCAGCGGCTT CGACTATGTC GCCTCCGCCG CGACGCAGAT GCTGCGGCAT CCCGTGGAAG ACGTGAATTA CGGCCCCTAC CGCCTGGACT CCTTTGACCG CGAGGCCGTA AACAGCGTGA TCGAAGCGCT CAGGCCCGAA AACGCGCGCA TCTTCCAGGT CGGCCCGGAT CAGCCTGTGG ACAGGGAGGC CTTCTTCTAC CAGACCCCGT ACAGCGCAAG GCCCATCGAA GACGGCGACA TCACCCGTTG GGGCAAGCTT TCCGCAGGGA TGGAATTGCG CCTGCCGGAC CTGAACCCCT TCCTGCCGGA TGATTTTTCG CTGGTCGCAG CCAAGGGAAA CGCAGAGCCT CGCAAACTCA CGGACAAACC CGGTCTCTCC CTTTGGCACG CCGGATCGGC TTTTAGGCAG GAGCCCAAGG CCATCCTCAT GACCCGGCTG CAATCCGCCC ATTTCGCCGC CACCCGCGAG CAAACGGCCC TGCAGGGCGT GCTGCTCGAA CTGTGGGATC AGCAGCAGGC GGGCCTGCGC TATCAGGCCA TGGAGGCGGG GCTTGGATTG TCCGTGTCCG GCGATGAAGG CATCGTCATC CGAATTGACG GATTCAGTCA GCATCAGGCT GACCTCCTGC CCCGCGTGCT CGACTTTCTC GAGCAGGACG TCACGCCGGA AGATTTCATG CAGGCCAAGG CGGAACAGCT GCGCAGCCTG GCCAACATGG AAAAGCAGGG CCTGTTTGGT CAGGCCATGG GCGCCATGCG CAACCTGCTC AAAGTCCCGT CATGGGACCA CCGGGCCATC GAAGAGACCA CCAGGGGGCT GACCCTGCAG GACCTTGGCG AATACCTGCG CACAGTGCGG CGGGATCTGC GCTTCACGGT CTTCGGGTTC GGCAATATTA CTCCGGACGA CCTGCGCAAG CTGGAGGGGG ACCTTCGGCC GTTCATCGGA CCTGAAGCAG GAGCGCCGCC GATTGCGACA CGCATCGCAC CCAGACAGGG CGTCGTGGCC GATTACCGCA AAGCGAGCGT GCTGGAAGAC AGCGCCCTGG TCGAAATGTT CCTGGCCCCT GAGACCGGCT CAGGTTCCAA GGCGCGCATG CTCCTGCTGG AAGGACTTCT GTCCAACCGC TTCTTCAGCC GTCTGCGCAC CGAAGAGCAA TTGGGCTACG TGGCGACCAG CTTTCCGGTC ATGTTCGCCC ACGCCGCCGG GATCGGATTC GGCGTGCAGA GCCCGGTGCA GGGAACGGCA GGCCTGGCCG ACCGCTTCGA GTCCTTCTAC TACACGGCCC TCTCGCAGCT GCGCGGCGTG ACCGGGGAAG AATTCGAATC CGTGCGCCAG GGCGTGCTGG CCTCACTGAC CAAAAGCCCG GATACCCTGG AAGAGGAATT CGGCTGGCTG GAAACGGACC TGCGCCTGGG CAACCAGGCC TTCGACGGCC GCGACAAGCT GGTCGATTCT TTGAGGAAAG CCACCCTGCC CGAGATCGTG CGCGCCTACG AGACCATGGT CATGGGCCCC GGCGGCACGC GGGCGCTCAT CCAGATCCAA GGCTCGCGCT TCGACGATTT CGGCTGGGCC CGCAAAAGCG GGGCGGAACA CGTCGCGGAG CCTACGGATT TTCACAGGCT GATGGGCGTC CAACGCTATC AGGGACTGTG A
|
Protein sequence | MPILRFVLFF LFLALQAQAA QVRISDTDYR AYRALTLETG LEVLLVHDQR ASKAAAALAL PVGSLDNPDS QPGLAHYLEH MLFLGSTSYP GPEEYQSFIT RNGGQTNAAT GYTSTTYMIE VDPPAFPEAL RRMADTLARP LLDPVYADKE RNAVNAEMES KKHSDGRRLA MLMLSTLNPD HPATRFTGGN LETLSDKPGS RLHDELVRFH QTWYSANLMK GVLYGPQSLD ELEALARSEL AVIPDRQAKI EVPVAPPATD AEKGVIVGVR PVRETRSMSI EFVLPQALDD SRTKPLQVVS AVLGTETGHS LVEMLRDKGL ALGLSAGGDT TSLRNGVTLS LFVQLTEEGD RKRDEVLATI FAYFDLLRAQ GLGETYFEQL RRMLDMEFRF APLASGFDYV ASAATQMLRH PVEDVNYGPY RLDSFDREAV NSVIEALRPE NARIFQVGPD QPVDREAFFY QTPYSARPIE DGDITRWGKL SAGMELRLPD LNPFLPDDFS LVAAKGNAEP RKLTDKPGLS LWHAGSAFRQ EPKAILMTRL QSAHFAATRE QTALQGVLLE LWDQQQAGLR YQAMEAGLGL SVSGDEGIVI RIDGFSQHQA DLLPRVLDFL EQDVTPEDFM QAKAEQLRSL ANMEKQGLFG QAMGAMRNLL KVPSWDHRAI EETTRGLTLQ DLGEYLRTVR RDLRFTVFGF GNITPDDLRK LEGDLRPFIG PEAGAPPIAT RIAPRQGVVA DYRKASVLED SALVEMFLAP ETGSGSKARM LLLEGLLSNR FFSRLRTEEQ LGYVATSFPV MFAHAAGIGF GVQSPVQGTA GLADRFESFY YTALSQLRGV TGEEFESVRQ GVLASLTKSP DTLEEEFGWL ETDLRLGNQA FDGRDKLVDS LRKATLPEIV RAYETMVMGP GGTRALIQIQ GSRFDDFGWA RKSGAEHVAE PTDFHRLMGV QRYQGL
|
| |