Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dde_2677 |
Symbol | |
ID | 3757700 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio desulfuricans subsp. desulfuricans str. G20 |
Kingdom | Bacteria |
Replicon accession | NC_007519 |
Strand | - |
Start bp | 2686067 |
End bp | 2689207 |
Gene Length | 3141 bp |
Protein Length | 1046 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637783579 |
Product | PreP peptidase |
Protein accession | YP_389168 |
Protein GI | 78357719 |
COG category | [R] General function prediction only |
COG ID | [COG1026] Predicted Zn-dependent peptidases, insulinase-like |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCCGGAAC CGCCGTGCAC GGTGCTGGTC TGCGCGCGGT ACGGCGGGCT GCGCGGTGTT GCATTTTGCA GTGATCCTTG CTTTCCCCGC AGGGGTGCCT ATATTCCGCT GGCGCGGTTG TGTCCGGCTG CCGGACATAC GCCGCATGCG CTGCGCCGCA GGTATCTGCA GCTATTGTCA GTAGTCTTGA ACTGCCGCCC AGAGCGGAGT TTTTTTCGAG TATCCGGATA CGGGAGGTTC ACCTTGCACA AGCATGGTTT TACACTGGTA GAAGAACGCG AGATAAAAGA GCTTTCCAGC AGAGCGCGGT TGTGGCGGCA CGATGCCACC GGTGCCGCGC TGCTTTCCAT GTCCAACGAC GACGAAAACA AGGTCTTCGG GGTGAGCTTC CGCACACCGC CGCACGATTC CACGGGTGTC GCGCATATTC TGGAGCACTC GGTCTTGTGC GGGTCAGAAA AATACCCTGT GAAAGAACCT TTTGTGGAAT TGCTCAAAGG TTCTCTGCAG ACGTTCCTCA ATGCGTTCAC CTATCCCGAC AAAACGTGCT ATCCGGTTGC CAGTACCAAT CTGGCCGATT TTTACAACCT TGTGGACGTG TATCTGGACG CCGCGTTTTT CCCGCGGATC ACTCCCGAAA TATTTCAGCA GGAAGGCTGG CATTACGAGA CGGCGCAGGA CGGCACGCTG ACCTACAAAG GCGTGGTCTT CAACGAGATG AAGGGCGTGT ATTCGTCTCC TGAAAGCATT CTTGCCGAAC GTTCGCAGCA GGCGCTTTTT CCTGACATAA CCTACGGGCT CGATTCCGGC GGTAATCCGG AACATATTCC GGATCTGACG TATGAGGCGT TCAAGGCGTT TCATGAAACG TACTACCATC CTGCCAACGC ACGTTTTTAT TTCTGGGGCG ATGATCCGGA AGACCGTCGT CTGGCCGTGC TGTCGCAGTT GCTGGACCGT TTCGGTCCAC TGGATGTTCA GTCCGAAGTG CCGCTGCAGC AGACATTTGA TACGCCGAAA CGTCTGGAAG TACCGTACGC CGCCGGTGCG GACAGCGACA GGCGGGGCAT GATGACCGTC AACTGGCTGC TGCCGGAAAC ATCCGACGCG GAGACCAACT TTGCCCTGCA GATGCTTGAC CACATTCTGG TGGGGTTGCC TGCCTCGCCG CTGCGCAAGG CGCTCATAGA GTCGGGGCTG GGTGAAGATC TTGCCGGAGG CGGACTGGAA AATGAACTGC GGCAGCTGTA TTTTTCCACC GGACTCAAAG GGATAGACCC CGCAGATGAG GAACAGATCA GCAGTCTGAT TTTTTCGGTG CTGCGCGGGC TGGCCGAAGA CGGCGTGCCT GCCGCCAGCA TCGAGGCTGC CATGAACACG GTGGAATTCG ATCTGCGCGA AAACAATACG GGCCGTTTTC CCGTGGGGCT GGCGGTGATG ACCCGCGCGC TGACCACGTG GCTTTATGAC GGCGATCCTT TTTCGCAGCT GGCATTTGAA AAGCCCGTCA ACGCCATCAG GGCGCGTGTG GCCGTCGGTG AGCCCGTGTT TGAAAATCTT ATCCGCCGCT GGCTGCTGGA CAACACGCAC AGGGCCACGG TGGTGCTGGT TCCTGCTGAA AACAGCGACA AGGCCCGTGC GGAAAGAGAA AAGTCGCGGC TGGCCGCTGT GCGTGCCTCG CTGGATGAGC AGGGGCTTGA GGCCGTGCGC GCCAATGCCG AAAAACTGCG CCGCATGCAG GAAGAACCTG ATACTCCCGA GGCGCTGTCC ACCATACCGC GGTTGGGGCT GCACGACCTT GCCGCCGTCA ATACGCCCAT ACCGGCACAG GAATCGCCGC TGGACGGTAT GCGTCTGATA ACGCACGACA TTGACGCCAA AGGTATTCTG TATGTCGATG CAGGGTTCAG CCTGAACAGG ATTCCCGCGG GACTGGTGCC GCTGGTGCCG CTGCTGGGCA GGGCCATGGT GGAAATGGGC ACATCGCGGT ACGATTTTGT AGAGATGGGC ATGCGCATCG CCTGCAAGAC AGGCGGTATA GATGCCGACA GCGTGGTGCT GACCCGTGTG GACGACCGTA CTCCCGATGC CCGTCTGTTT GTGCAGGGCA AGGCGACGCA GGCTAACGCC GCAGCGTTGT TTGAGCTGAT GCGCGATGTG CTGCTGGACG CGCAGCTTGA CCAGAAAGAG CGCTTCCGGT CCATTCTGCT GGAAGAAAAA GCGCGTATGG AGCACCGTCT GGTACCTGCC GGACATATGG TGGTCATGTC GCGTCTGCGG TCGCATTTCG GCAAAAGCGG CTGGCTGGGT GAACAGATGG ACGGTCTGGC TGCGCTGGAA TACCTGCGCG AACTTGTGCG GCGTGTTGAT GAAGACTGGA ACGGCGTGCT GGCCGACCTG ACGGCCGTCC GTCAGGCGCT GGTGGGCAGA GCCGGTGCGG TGCTCAACAT GACGGGCAGC GGATCAACAC TGGCAGCGGC CATGCCGCAT GCATCGTCTT TTGCCGCTTC GCTGCCCGAA GGACAGGATG TACCCCCGGC CTGGTTTGCC GATATCCGTC CTGTGCATGA GGCACTGTGC GTCCCGTCGC AGGTTAACTA CGTGGGCAAG GCGGCCGACC TTTACAGTCT GGGGTACCGG TATCACGGTT CGGCCAATGT GATTTTCAAG CATCTGCGCA TGGCCTGGCT GTGGGACAAG GTGAGGGTGC AGGGCGGGGC CTACGGTGCT TTCTGCGCCT TTGACCGTGC AAGCGGTGTT CTGGCACAGG TGTCGTACCG CGACCCCAAT CTGGAAGCCA CGCTTGACGT GTATGACCGC AGCGCTGAAT ATCTGCGTTC GCTGTCTCTG ACAAAGGATG AGCTGGTCAC CTCGGTGGTG GGAGCCATCG GCGAACTGGA CGCCCATATG CTGCCCCATG ACCGCGGCAT GGCATCGCTG GCGCGCACAC TGACGGGTGA TACGGAAGAA CGCCGCCAGC AGATGCGTGA TGAGATTCTG TCCACCACCC CCGAAGACTT TGTCCGCTTT GCCGATGTGC TGGCCGAGGC GGCCCGGACA GGCACTGTGT GCGTTCTGGG CGGGGCCGGA GTGGAAGAGG CTGCGGAGCG GAACGGCTGG GCAGTGAAGA GCATTATGTA A
|
Protein sequence | MPEPPCTVLV CARYGGLRGV AFCSDPCFPR RGAYIPLARL CPAAGHTPHA LRRRYLQLLS VVLNCRPERS FFRVSGYGRF TLHKHGFTLV EEREIKELSS RARLWRHDAT GAALLSMSND DENKVFGVSF RTPPHDSTGV AHILEHSVLC GSEKYPVKEP FVELLKGSLQ TFLNAFTYPD KTCYPVASTN LADFYNLVDV YLDAAFFPRI TPEIFQQEGW HYETAQDGTL TYKGVVFNEM KGVYSSPESI LAERSQQALF PDITYGLDSG GNPEHIPDLT YEAFKAFHET YYHPANARFY FWGDDPEDRR LAVLSQLLDR FGPLDVQSEV PLQQTFDTPK RLEVPYAAGA DSDRRGMMTV NWLLPETSDA ETNFALQMLD HILVGLPASP LRKALIESGL GEDLAGGGLE NELRQLYFST GLKGIDPADE EQISSLIFSV LRGLAEDGVP AASIEAAMNT VEFDLRENNT GRFPVGLAVM TRALTTWLYD GDPFSQLAFE KPVNAIRARV AVGEPVFENL IRRWLLDNTH RATVVLVPAE NSDKARAERE KSRLAAVRAS LDEQGLEAVR ANAEKLRRMQ EEPDTPEALS TIPRLGLHDL AAVNTPIPAQ ESPLDGMRLI THDIDAKGIL YVDAGFSLNR IPAGLVPLVP LLGRAMVEMG TSRYDFVEMG MRIACKTGGI DADSVVLTRV DDRTPDARLF VQGKATQANA AALFELMRDV LLDAQLDQKE RFRSILLEEK ARMEHRLVPA GHMVVMSRLR SHFGKSGWLG EQMDGLAALE YLRELVRRVD EDWNGVLADL TAVRQALVGR AGAVLNMTGS GSTLAAAMPH ASSFAASLPE GQDVPPAWFA DIRPVHEALC VPSQVNYVGK AADLYSLGYR YHGSANVIFK HLRMAWLWDK VRVQGGAYGA FCAFDRASGV LAQVSYRDPN LEATLDVYDR SAEYLRSLSL TKDELVTSVV GAIGELDAHM LPHDRGMASL ARTLTGDTEE RRQQMRDEIL STTPEDFVRF ADVLAEAART GTVCVLGGAG VEEAAERNGW AVKSIM
|
| |