Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dbac_0671 |
Symbol | |
ID | 8376324 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfomicrobium baculatum DSM 4028 |
Kingdom | Bacteria |
Replicon accession | NC_013173 |
Strand | + |
Start bp | 745616 |
End bp | 747037 |
Gene Length | 1422 bp |
Protein Length | 473 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644999913 |
Product | protease Do |
Protein accession | YP_003157210 |
Protein GI | 256828482 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0470073 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATATG CACTTCGACT CTTTACCATT ATGTTCGTGT TTCTGGCCTC GGCCTCCATG GCTGCGCAGT TGCCGGATTT CACGGAGCTC GCGGAAAAGT CGGGCCAGGC GGTGGTTAAC ATCAGTACGG TCAAGCTCGT GAAAAATCAG GGCAACATGC AGCAGTTTTT CCCGAGGGGG CCACAGGGAC AGCACCCGTT CGGAGATTTT TTCGACCAGT TCGAGCGTTT TTTCGGAGAG CAGGGGCAAG GAACTCCGCG TGAACAGCGT TCTCTGGGAT CGGGTTTCGT CTTCTCCGCC GACGGGTACA TCGTCACCAA CAATCACGTC ATCGAGGGCG CGGATTCCAT CAAGGTCAAC CTCCAGGTCG ACAAGAACGG AGACCGTTCC TACGACGCCG AGGTCATCGG GACGGACAAG GAGACGGATC TGGCGCTGCT GAAGATCAAG GCCGACAAGC CGTTGCCGTA CCTCGCCTTT GGTGACTCCG ACGTGCTCAA GGTCGGGCAA TGGGTCATGG CCATCGGCAA CCCCTTTGGC CTTGATCATA CCGTCACGGC CGGAATTGTC AGCGCCAAGG GGCGCACCAT CGGCGCCGGT CCCTACGACA ACTTCATCCA GACCGACGCC TCCATCAACC CCGGCAACAG CGGTGGTCCG CTCATCGACC TGGACGGAAA GGTCATCGGC ATCAATACGG CCATCGTTGC TTCGGGTCAG GGCATCGGTT TTGCCATCCC CAGCGATCTG GCCAGACAGG TCATTGAGCA GCTCAAGGAA TACAAGAGCG TGAAGCGCGG CTGGCTCGGC GTGTCCATCC AGAATGTGGA CGAGAACTCC GCCAAGGCCT TGGGCCTTGA CCAGGCCAGC GGCGCCCTGG TCTCGTCCGT GACTGTCGGA GACCCGGCGG AAAAGGCCGG AATCAAGGCA GGAGACGTCA TTGTCGCGGT GGATGGAGTG TCGGTGGCCG ACGCCGGCGA TCTGACCCGC AAGATCGGCG ACCTCTTGCC CGGCGTGAAG ATCACGCTTT CGGTCTGGCG CGAAGGCAAG ACCGTCACGA TCCCTCTGGT TCTGGGTGAG CGCAGCGCGG AGAAGGTCGC TCAGGGCCGG CCCGGCGCTC CTGGCAGCCA GGGCGAGGAT GTCCTGGGCC TGAGCGTTCG GCCCGTGGCC GAGGCCGAGG CGAAGGCGCT GGAACTCGAC CGGGCCCAGG GGCTTCTGGT GGTTGAAGTG AGCGAGGGAT CCCCGGCTGC GCAAAACGAC TTGAGCGCAG GGGATGTCAT CCTTGAAGCC AACGGCAAGG CCGTGAACAC GGTCAAGGCC CTCAAGGACG TGATCGAAGG CGATGGCAAG GAAAAGGGCG TCGTCATGCT GCTGGTCAAG CGCCAGGGTC GCAACGTGTT CCGCACCGTG CCCCTTTCCT AG
|
Protein sequence | MKYALRLFTI MFVFLASASM AAQLPDFTEL AEKSGQAVVN ISTVKLVKNQ GNMQQFFPRG PQGQHPFGDF FDQFERFFGE QGQGTPREQR SLGSGFVFSA DGYIVTNNHV IEGADSIKVN LQVDKNGDRS YDAEVIGTDK ETDLALLKIK ADKPLPYLAF GDSDVLKVGQ WVMAIGNPFG LDHTVTAGIV SAKGRTIGAG PYDNFIQTDA SINPGNSGGP LIDLDGKVIG INTAIVASGQ GIGFAIPSDL ARQVIEQLKE YKSVKRGWLG VSIQNVDENS AKALGLDQAS GALVSSVTVG DPAEKAGIKA GDVIVAVDGV SVADAGDLTR KIGDLLPGVK ITLSVWREGK TVTIPLVLGE RSAEKVAQGR PGAPGSQGED VLGLSVRPVA EAEAKALELD RAQGLLVVEV SEGSPAAQND LSAGDVILEA NGKAVNTVKA LKDVIEGDGK EKGVVMLLVK RQGRNVFRTV PLS
|
| |