Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dbac_0490 |
Symbol | |
ID | 8376139 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfomicrobium baculatum DSM 4028 |
Kingdom | Bacteria |
Replicon accession | NC_013173 |
Strand | + |
Start bp | 561507 |
End bp | 564416 |
Gene Length | 2910 bp |
Protein Length | 969 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644999730 |
Product | Peptidase M16C associated domain protein |
Protein accession | YP_003157031 |
Protein GI | 256828303 |
COG category | [R] General function prediction only |
COG ID | [COG1026] Predicted Zn-dependent peptidases, insulinase-like |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCACCG GATTCACCCG CGTTTCAACC ACGTATGTCC AGGAAATTTC ATCACGGGCC GATATCTTCG TGCACGACAG GACAGGGGCG CGCATCCTGT CCGTGGTCAA CGATGACGAG AACAAGGTCT TCGGCATCAG CTTTCGCACT CCCCCGTCCG ATTCGACCGG TGTGGCGCAC ATCCTGGAGC ATTCGGTGCT GTGCGGGTCG CGCAAGTACC CGGTCAAGGA ACCCTTTGTG GACCTCTTGA AGGGATCCTT GCAAACATTT CTGAACGCCA TGACCTACCC GGACAAGACC TGTTATCCCG TGGCCAGTCA GAACCTGAAG GATTTCTACA ATCTGGTCGA CGTCTACCTG GACGCGGTCT TCTTCCCGCG CATCACCCCC GAGATTTTCG AGCAGGAGGG CTGGCATCTG GACCTGCCGG GCAAAGGCGG GGAGCTGTCC ATCAAGGGCG TGGTCTATAA TGAAATGAAG GGCGTGTACT CTTCGCCGGA TTCGCAGCTC GCCGAGCATT CCCAGCAATC CCTTTTTCCG GACACGACCT ACGGCCTGGA TTCCGGCGGC AACCCTCAGG CCATCACGGA ACTGACCTAC GAGGGCTTTC TGGAGTTTCA CCGCACCCTG TACCATCCTT CAAACGCCTG GATCTTCTTT TATGGCGATG ACGACCCGGA GGTCAGATTT GAACTCCTTG GTGAATATCT GGATCAGTTT TCCGCTCTGG AGGTGAACTC GACCATCGCC CGGCAGCAGG CTTTCGACGC GCCGCGCGAA GTGCGGCGCG GTTTCGAGGC CATGGGCGAG GATGACGACG CCCTGGGCAT GCTGACCATG AACTGGCTTT TGCCGGGCAA GGAAGACTCC GGGACCGTGC TGGCCTGCAA GATTCTGGAC GGCCTGTTGA CCGGCATGAA CGCCTCGCCC CTGCGCAAGG CGCTCATCGA GTCGGGCCTG GGCGAGGACC TGACCGGGGC GGGCGTGGAG CACGAGATGG CCCAGATGTA CTATTCCGTG GGTATGAAGG GCGTGCAGCC GGAAAATTTT CAGGCGGTGC GCGATCTCAT CGTCACGACT CTCGAAGACA TCGTGGCGCA CGGCTTTGAG GCCGACCTGA TTGAGGCCGG AATCAATTCC GCCGAGTTCG ATCTGCGCGA GAACAACACC GGCTCCTATC CGCGCGGCCT CATTGTCATG CTGCGTGCAC TGGGGTCCTG GCTGTATGAT CTCGATCCTC TGGAGCTGGT GGCCTTCGAG GCTCCCATGG CGGCCTTGAA GGAGCGCCTG GCCAAAGGCG AGAGGGTTTT CGAAGACCTC ATCGAACGCC ATATTCTCAG GAATCCCCAC GCAAGCGTGG TCATTCTCGA GCCCGAAGAA GGTCATGCGG AGCGGGTGGA ACAGGAGGAA GAGGCGCTCA TAGCCAAGCT GCGCGTGGAT CAGGCGGCCC TGTCCGACGA TGAGCTGGTG CGGCGCACGG AGCATCTGCG GCGCATGCAG GAGGCCCCGG ATTCGCCCGA GGCCCTGGCT CTTTTGCCCA GTCTCTCCCG GGAGGACATC GACCCCGCGG TGCGGGTCGT GCCCACGGAA GTGCGTGAGT GGGAGAGCGC CACGGCGCTC ATGCACGACC TGCCGACCAA CAATATCTGC TACATGGATC TGGCTCTGGA CCTTGGCGCG GTACCCGACC GGCTCATTCC CCTGGTCCCA CTCTTCGGAC GCGCCCTGAC CGAGATGGGC ACCATGAAGG AGGACTACGT GTCCTTCTCC AAGCGCATCA ACAGCAAGAC CGGCGGGGTC TACGCCAGAA GCCTTCTGTC CCAGCGCGAG GACGGACCCG GCCCCGTGGC CCGGCTGGTG GTCCGCGCCA AGGCCGTGGG CGAGCGGGTC GGCGACATGA TCGATATCGT GCGCGACGCC CTGACCCAGG CCCGCTTCGA CGACCGCGAA CGCTTCCGGC AGATGGTGCT GGAGGAAAAG GCGGGGCTTG AGCACGCCCT GGTGCCTTCG GGTCATCATT TCGTGGGCCT GCGCCTGCGC GCGCGCTTCA ACCTGGCGGA CAGCCTGCAG GAACGCATGG GCGGGCTGGA GAATCTCTTC TTCCTGCGCG AACTGGCCGA GAGGATGGAC ACAGATTTCA GCGGGGTGCT GTCCGATCTG GAAGAACTGC GCCGGGCGCT GGTGCGCAAG GACGGGGCGG TGTTGAATCT GACCATGGAC GAGGCCATGC TCACCGCCCA TGGCGATCGT TTTCGGGAAT TCGTGGCGGG TCTGCCGGGC GGGACCCTGG CGGACGCGGT CTGGCAGGTG GCGGGCACGG ATGGGCACGA GGGGCTCATC ATCCCGGCTC AGGTCAATTA TGTCGGCAAG ATCTGCGACC TGCACAGTGC CGGCTACTCC TTTCACGGTT CAAGCCTGGT CGCGGTCAAA TACCTGCGCA CGACCTGGCT GTGGGAGCAG GTCCGGGTCC TGGGCGGAGC GTATGGCGGT TTTTGCAACT TCGGCCGTCT CTCGGGGCTC ATGAGTTTCG GGTCCTACCG TGATCCCAAC GTCACGTCCA CGCTGGCCGC TTTCGACGGC TGCGGGAAAT TCCTGGAGAC GGTGAGCCTG GACCGGGGCG AGCTCTTAAA GGCCATCATC GGCACCTCGG GCGATCTCGA TCCGTATCAG CTCCCCGATT CCAAGGGCTT CACGGCCTTG AGCCAGCACC TGGCCGGGGT GACTACCGAG ACCCGCCAGC GTATCCGCGA CGAGGTCCTG GCCACGGACG AGCACCATTT CCGCGAGTTT GGCACGCTGC TGCGCGACGC CGCGCCCGCG GGGCTGATCT GCGTGCTCGG CGGTGAAGAT TCCCTGACCA GGGCCGCCTC CCAGGGCCTT GAGCTTCAAC GCAAGATCCG TCTGCTTTAA
|
Protein sequence | MATGFTRVST TYVQEISSRA DIFVHDRTGA RILSVVNDDE NKVFGISFRT PPSDSTGVAH ILEHSVLCGS RKYPVKEPFV DLLKGSLQTF LNAMTYPDKT CYPVASQNLK DFYNLVDVYL DAVFFPRITP EIFEQEGWHL DLPGKGGELS IKGVVYNEMK GVYSSPDSQL AEHSQQSLFP DTTYGLDSGG NPQAITELTY EGFLEFHRTL YHPSNAWIFF YGDDDPEVRF ELLGEYLDQF SALEVNSTIA RQQAFDAPRE VRRGFEAMGE DDDALGMLTM NWLLPGKEDS GTVLACKILD GLLTGMNASP LRKALIESGL GEDLTGAGVE HEMAQMYYSV GMKGVQPENF QAVRDLIVTT LEDIVAHGFE ADLIEAGINS AEFDLRENNT GSYPRGLIVM LRALGSWLYD LDPLELVAFE APMAALKERL AKGERVFEDL IERHILRNPH ASVVILEPEE GHAERVEQEE EALIAKLRVD QAALSDDELV RRTEHLRRMQ EAPDSPEALA LLPSLSREDI DPAVRVVPTE VREWESATAL MHDLPTNNIC YMDLALDLGA VPDRLIPLVP LFGRALTEMG TMKEDYVSFS KRINSKTGGV YARSLLSQRE DGPGPVARLV VRAKAVGERV GDMIDIVRDA LTQARFDDRE RFRQMVLEEK AGLEHALVPS GHHFVGLRLR ARFNLADSLQ ERMGGLENLF FLRELAERMD TDFSGVLSDL EELRRALVRK DGAVLNLTMD EAMLTAHGDR FREFVAGLPG GTLADAVWQV AGTDGHEGLI IPAQVNYVGK ICDLHSAGYS FHGSSLVAVK YLRTTWLWEQ VRVLGGAYGG FCNFGRLSGL MSFGSYRDPN VTSTLAAFDG CGKFLETVSL DRGELLKAII GTSGDLDPYQ LPDSKGFTAL SQHLAGVTTE TRQRIRDEVL ATDEHHFREF GTLLRDAAPA GLICVLGGED SLTRAASQGL ELQRKIRLL
|
| |