Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dvul_2044 |
Symbol | |
ID | 4662334 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris DP4 |
Kingdom | Bacteria |
Replicon accession | NC_008751 |
Strand | - |
Start bp | 2381957 |
End bp | 2384851 |
Gene Length | 2895 bp |
Protein Length | 964 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639820287 |
Product | peptidase M16C associated domain-containing protein |
Protein accession | YP_967487 |
Protein GI | 120603087 |
COG category | [R] General function prediction only |
COG ID | [COG1026] Predicted Zn-dependent peptidases, insulinase-like |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCTTC ACGGATTCGA ACTCATAGAC GAGACCAACC TCGAAGAACT CTCAAGCCGC GTACGCCGGT GGCGACATGT CGTCACGGGT GCGCAGCTTC TTTCCTTCTG CAATGCCGAC GAGAACAAGG TGTTCGGTGT CAGCTTCCGC ACTCCCCCCG GCGACTCCAC AGGCGTCGCG CATATCCTCG AACATTCGGT TCTTTGCGGT TCCGAGCGTT ATCCGGTCAA AGAACCGTTC GTGGAACTGC TCAAGGGGTC GCTCCAGACA TTCCTGAACG CCTTCACCTA CCCTGATAAG ACGTGCTACC CCGTGGCAAG CACCAACCTT CAGGATTTTC GTAATCTTGT TGATGTGTAC CTCGACGCGG TGTTCTTTCC GCGTATCGAT GAGAACATCT TCCGGCAGGA AGGCTGGCAC ATCGATGCCG AGACGGCTGA CGGCCCATGG AACTACAAGG GCGTCGTCTA TAACGAGATG AAGGGCGTCT ATTCTTCGCC CGAGTCGGTA CTTTCGGAAC AGTCGCAGCA GGCGATCTTT CCGGAGCATG TCTACGGGCT GGATTCGGGC GGCAACCCCG AGCGCATCCT TGAACTGACG TATGAGCAGT TCCGCGACTT CCATCGTCGT TTCTATCACC CCGGCAACGG CCGTTTCTTC TTCTGGGGCG ACGACCCAGA AGAAGCCCGC CTTGAGCATA TCGGGCGGGT GCTGTCGCGT TTCGATAGGC TTGATGTGGA TTCCGCTGTG CCTCTCATGG GGCACCGTGA CACGCCTCGC CTTCTTGAGG TGCCCTTCGC CGCCGGGGAA GGCGATACTC GTGGCATGGT CACGGTGAAC TGGCTTCTTG ACGAGACCGT CGATGCAGAG CGCAACTTCG CTTTGCACAT GCTCGAACAC ATCCTCCTGG GGATGCCCGG TTCGCCGCTG CGTCGTGCGC TCATCGAGTC GGGTCTGGGT GAGGATGTCG CCGGTGTAGG TCTGGAAGCC GAACTGCGCC AGATGTACTT CTCCGTGGGG CTCAAGGGCA TCGACCCGGC GGATGCCGAG CGGGTCGAGG TGCTTGTCAT GGATACGCTT GCCTCCCTCG CAGAGGAGGG GGTTCCGTCC GACGCCATCG AAGCCGCTTT CAACAGCGTG GAGTTCTCTT TGCGCGAGAA CAACACCGGA CGCTACCCGC GTGGTCTCGC CGTCATGGTG CGTTCTCTCA CGACGTGGCT CTATGACGGA GACCCCCTGG CCCTGCTGGC GTTCGAGAAG CCCCTTGCCG CAATCCGTGA CGCCATAGCC GCCGGAGGCT ACTTCGAGTC GCTCATCAGG CGCTGCTTCC TCGACAATGC CCACCGCGCA ACGGTCTCGC TAGTGCCCGA CATGACGCTG GAGGCGCGTC GTGAAGAGGC CGAGAACAAG CGCATCGAGA AGGTGCAGTC CGCACTTTCG CCCTCCGATA GAGAAGCCGT GGTCTCTCTT GCGGCGACGT TGCGCGCCCT GCAGGAGGCT CCTGATTCGC CGGAGGCGCT CGCCACCATA CCGCGTCTGG GCCTTGAAGA CCTTGCGCGC GAGAACCGCC CCATTCCCAT CGAAGAACGC ACATCGGGCG ATGTCCCCGT CCTGTTCCAT GACATCGACA CGTCCGGCAT CGTCTACTCC GAACTGCTGT TCGACTTGTC TGCCGTTCCT GCAAGGCTTC TGCCGCTTGT TCCGCTCTTC GGGCGGGCCC TGCTCGAGAT GGGAACGGCG CGTCACGACT TCGTGGCACT GGGGATGCGT ATCGCCGCGA AGACCGGCGG TATCGAGGCG GATACGCTCT TCGCCACCAC CCGTGCCGGA CGCAAGCCTG TCGCTCACAT GGTGGTGAGC GGCAAGGCCA CCCGTGACAA TGCCGCGGCC CTGGTCGACA TCATGCACGA AGTGCTGCAC GAGGCGCTGT TCGATGACGC GGAGCGCTTC GGGCGCATGG TGCTGGAAGA AAAGGCCCGT CAGGAGCATT CGCTGGTGCC TTCGGGCCAC GGCGTTGTAT CCAGCCGTTT GCGGGCCAGT TTCTCCATGG CTGGCTGGCT TGACGAGGTG ACAGGTGGCA TCACCTACCT CATGGCCCTG CGCGAACTCG CCGAACGTGT GCGTGACGAT TGGCAAGGGG TGCGCGACGA CCTTGAAACG CTTCGCACAC TTGTCTTGCG TCGAAGCGGG GCTCTCTGCA ACCTGACGGC GGACAGCGCC ACGGCGGCAG TGGCCATGCC CCTTTTCGAC GGGCTTGTCG CGGGACTGCC CGATACCGCA GCCGATGCCG TGGTCTGGGC ACCGGATGCA CTGCCTGCTG CAGAGGCGCT GGTCGTTCCC GCGCAGGTGA ACTACGTGGG CAAGGGCGCC AACCTCTATG ACCTTGGCTA CGCCTATCAT GGTTCGGTGA GCGTGGTGCT CAAGCACCTC CGCATGGCGT TCCTGTGGGA CCGGGTGCGC GTGCAGGGCG GGGCCTACGG CGCGTTCTGC GCGTTCGACC GCATGAGCGG GGCCTTCACG CAGGTTTCAT ACCGCGACCC CAATGTGGAG CGCACCCTTG ATGTCTACGA CAAGTGCGCC GAGTATCTGC GCACGGTGGA ACTCGATGAC GCCGCACTCA CAAGCGCCAT CGTCGGTGCC ATCGGCGACC TTGACATGCA CATGCTGCCC GACGCGCGCG GTGAGGCCTC GATGCTACGC CATCTTACCG GTGATACGGA AGATGTCAGG CAGACGATGC GCGAGCAGAT GCTGGCGACC ACGCAGCGGC ACTTCCGGGA ATTCGCCGAT GTGCTGGATG CGGTGGCACG CACGGGCAGG GTGTGCGTCC TCGGTGGAGG CAGTCTCGAC GCCGTTGCGG CAACGCGCGG CTGGCAGGCT TTGCGGGTGC TGTAG
|
Protein sequence | MQLHGFELID ETNLEELSSR VRRWRHVVTG AQLLSFCNAD ENKVFGVSFR TPPGDSTGVA HILEHSVLCG SERYPVKEPF VELLKGSLQT FLNAFTYPDK TCYPVASTNL QDFRNLVDVY LDAVFFPRID ENIFRQEGWH IDAETADGPW NYKGVVYNEM KGVYSSPESV LSEQSQQAIF PEHVYGLDSG GNPERILELT YEQFRDFHRR FYHPGNGRFF FWGDDPEEAR LEHIGRVLSR FDRLDVDSAV PLMGHRDTPR LLEVPFAAGE GDTRGMVTVN WLLDETVDAE RNFALHMLEH ILLGMPGSPL RRALIESGLG EDVAGVGLEA ELRQMYFSVG LKGIDPADAE RVEVLVMDTL ASLAEEGVPS DAIEAAFNSV EFSLRENNTG RYPRGLAVMV RSLTTWLYDG DPLALLAFEK PLAAIRDAIA AGGYFESLIR RCFLDNAHRA TVSLVPDMTL EARREEAENK RIEKVQSALS PSDREAVVSL AATLRALQEA PDSPEALATI PRLGLEDLAR ENRPIPIEER TSGDVPVLFH DIDTSGIVYS ELLFDLSAVP ARLLPLVPLF GRALLEMGTA RHDFVALGMR IAAKTGGIEA DTLFATTRAG RKPVAHMVVS GKATRDNAAA LVDIMHEVLH EALFDDAERF GRMVLEEKAR QEHSLVPSGH GVVSSRLRAS FSMAGWLDEV TGGITYLMAL RELAERVRDD WQGVRDDLET LRTLVLRRSG ALCNLTADSA TAAVAMPLFD GLVAGLPDTA ADAVVWAPDA LPAAEALVVP AQVNYVGKGA NLYDLGYAYH GSVSVVLKHL RMAFLWDRVR VQGGAYGAFC AFDRMSGAFT QVSYRDPNVE RTLDVYDKCA EYLRTVELDD AALTSAIVGA IGDLDMHMLP DARGEASMLR HLTGDTEDVR QTMREQMLAT TQRHFREFAD VLDAVARTGR VCVLGGGSLD AVAATRGWQA LRVL
|
| |