Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | DvMF_2141 |
Symbol | |
ID | 7174063 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris str. 'Miyazaki F' |
Kingdom | Bacteria |
Replicon accession | NC_011769 |
Strand | - |
Start bp | 2664868 |
End bp | 2666997 |
Gene Length | 2130 bp |
Protein Length | 709 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643540661 |
Product | peptidase U32 |
Protein accession | YP_002436552 |
Protein GI | 218887231 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 101 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGACAACC GTGCCCCGGA ACGCGAGCAC CACCGCACCC GCCTGAGCGA TGCGGAACTG GCGGAACTGC TGCCCCCCAG CGTGACCGCC GGGGCCGCCG CATCCGGCAG CGCCGCGCCT GACCAGGCGG ACAACCTGCG CACGCCCGAA ATCCTTGCCC CGGCGGGCGA CATGCAGGCC GCGCTGGCCG CATTCGCCGC TGGTGCCGAC GCCGTGTACC TGGGCCTGAA GCACTTTTCG GCCCGCATGC AGGCGGAAAA CTTCTCCACC GGCGAACTGG CCGCGCTGAC GGACCTAGCC CATTCCGAGG GCCGCCGCAT CTACGTGGCC ATGAACTCCA TGCTGAAACC GGGCGACACC GGCGCAGCAG GCCGCCTGGT GGCCCGCCTG GCCCGTGACG TGCGGCCCGA CGCGCTCATC GTGCAGGACC TCGGCATGCT GGACATCGCC CGGCAGGCCG GTTTCGCGGG TGAACTGCAT CTTTCCACGC TGGCCAACGT CACCCACCCC GCCGCGCTCA CCGTGGCGCG CGACCTGGGC GCCAGCCGGG TGATCCTGCC GCGCGAGCTG AACATCGACG AAATCCGCGC CTGCGGAAAC GCCTGCCCGG ACGATCTGGA CCTTGAATTC TTCGTGCACG GGGCGCTGTG CTACTGCGTG TCGGGCCGCT GCTACTGGTC CAGCTACATG GGCGGCAAAA GCGGCCTGCG GGGCCGCTGC GTGCAGCCGT GCCGCCGGGT GTACCGCCAG AAGGGCCGCG AGGGGCGCTT CTTCTCCTGT CTCGACCTTT CGCTGGACGT GCTGGCAAAG ACCCTGCTGT CGGTGCCGCA CATGGCCTCG TGGAAGATCG AGGGCCGCAA GAAGGGGCCG CACTACGTGT ACTACGCGGT GTCGGCCTAC CGCCTGCTGC GCGACAACCC GGACGACCCC AAGGCCCGCA AGCAGGCCGA GGACATCCTT GAAATGGCCC TGGGCCGCCC CGGCACCCAT TCGCGCTTTC TGCCCCAGCG CGGCGACGAC CCCACCGCCC CCGGCGAGCA GACCAGTTCC GGCCTGCTGG CGGGCAAGGT GGGGCAGACG CCGGAAGGGG GCATCTTCTT CAAGCCGCGC TTCCAACTGC TGCCGCAGGA TCTTCTGCGC ATCGGCTACG AGGACGAATC GTGGCACACC ACCCTGCCGG TGGCCCGCCA CATCCCCAAG GCAGGCACCT TCAACCTGCG CCTGCCGCGC CACAAGACGC CCAAGCTGGG CACGCCGGTG TTCCTCATCG ACCGTCGCGA GCCGGAACTG ACGCGCGAGG TGCGCACTTG GCAGGCCAAA CTGGAACGCC ACCGCAAGGG CGGCGGCGAA GGCGGCGCGG TGGACTTCAC CCCCACCTAT CCCGCCCCTG GCAAGTCCGC CCGCCCGCTG GACGTGATCC TGCGTGGCTC GCTGCCGCAT GGCCGCGAGG GCAAGGCCGG GGTGCGCCCC GGCACGGTGA TGGGCCTGTG GCTGTCGCCC AAGGCCCTGC GCGAGGTGTC GCGCACGCTG TACCCGCGCA TCTCGTGGTG GCTGCCGCCA GTGATCTGGC CGGACGAAGA AGCCCAGTGG CTGCGCATGG CCCGCGAATC CGTCCGCGAC GGCGCGCGCC ACTTCGTGCT GAACGCGCCG TGGCAGGTGG GCCTGTTCGG GGACGCGCAC GCCCGGCGCG ACCTGGCGCT GACCGCCGGT CCCTTCTGCA ACACCTCCAA CCCGGCGGCG CTGGCCGTAC TGCGCAAACT GGGCTTTGCC GCCGCCATCG TCAGCCCCGA ACTGCCCGGA GAAGACATCC TGGCCCTGCC GCGCCAGAGC TACCTGCCGC TGGGCGTGGT GCTGTCCGGC TACTGGCCCA TGGGCGTCAC CCGCCACCAC CTGGAGGGGC TGAAGCCGGG CGAGCCGTTC AACAGCCCCA AGAACGAGGT GTTCTGGGCC CGGCGCTACG GCCAGAATAC CTGGATATAC CCCGGCTGGC CGCTGGACAT CGAGGACCGC CGGGGCCAGC TGGAAGCCGC CGGGTACACC CTGTTCGTGC GCATCGACGA GCACCCGCCC CTGACCGTGC CCGAGCCGCG CAGGACCAGT CCGTTCAACT GGGACATTCC GCTGCTGTAA
|
Protein sequence | MDNRAPEREH HRTRLSDAEL AELLPPSVTA GAAASGSAAP DQADNLRTPE ILAPAGDMQA ALAAFAAGAD AVYLGLKHFS ARMQAENFST GELAALTDLA HSEGRRIYVA MNSMLKPGDT GAAGRLVARL ARDVRPDALI VQDLGMLDIA RQAGFAGELH LSTLANVTHP AALTVARDLG ASRVILPREL NIDEIRACGN ACPDDLDLEF FVHGALCYCV SGRCYWSSYM GGKSGLRGRC VQPCRRVYRQ KGREGRFFSC LDLSLDVLAK TLLSVPHMAS WKIEGRKKGP HYVYYAVSAY RLLRDNPDDP KARKQAEDIL EMALGRPGTH SRFLPQRGDD PTAPGEQTSS GLLAGKVGQT PEGGIFFKPR FQLLPQDLLR IGYEDESWHT TLPVARHIPK AGTFNLRLPR HKTPKLGTPV FLIDRREPEL TREVRTWQAK LERHRKGGGE GGAVDFTPTY PAPGKSARPL DVILRGSLPH GREGKAGVRP GTVMGLWLSP KALREVSRTL YPRISWWLPP VIWPDEEAQW LRMARESVRD GARHFVLNAP WQVGLFGDAH ARRDLALTAG PFCNTSNPAA LAVLRKLGFA AAIVSPELPG EDILALPRQS YLPLGVVLSG YWPMGVTRHH LEGLKPGEPF NSPKNEVFWA RRYGQNTWIY PGWPLDIEDR RGQLEAAGYT LFVRIDEHPP LTVPEPRRTS PFNWDIPLL
|
| |