Gene Dvul_2044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_2044 
Symbol 
ID4662334 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp2381957 
End bp2384851 
Gene Length2895 bp 
Protein Length964 aa 
Translation table11 
GC content63% 
IMG OID639820287 
Productpeptidase M16C associated domain-containing protein 
Protein accessionYP_967487 
Protein GI120603087 
COG category[R] General function prediction only 
COG ID[COG1026] Predicted Zn-dependent peptidases, insulinase-like 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCTTC ACGGATTCGA ACTCATAGAC GAGACCAACC TCGAAGAACT CTCAAGCCGC 
GTACGCCGGT GGCGACATGT CGTCACGGGT GCGCAGCTTC TTTCCTTCTG CAATGCCGAC
GAGAACAAGG TGTTCGGTGT CAGCTTCCGC ACTCCCCCCG GCGACTCCAC AGGCGTCGCG
CATATCCTCG AACATTCGGT TCTTTGCGGT TCCGAGCGTT ATCCGGTCAA AGAACCGTTC
GTGGAACTGC TCAAGGGGTC GCTCCAGACA TTCCTGAACG CCTTCACCTA CCCTGATAAG
ACGTGCTACC CCGTGGCAAG CACCAACCTT CAGGATTTTC GTAATCTTGT TGATGTGTAC
CTCGACGCGG TGTTCTTTCC GCGTATCGAT GAGAACATCT TCCGGCAGGA AGGCTGGCAC
ATCGATGCCG AGACGGCTGA CGGCCCATGG AACTACAAGG GCGTCGTCTA TAACGAGATG
AAGGGCGTCT ATTCTTCGCC CGAGTCGGTA CTTTCGGAAC AGTCGCAGCA GGCGATCTTT
CCGGAGCATG TCTACGGGCT GGATTCGGGC GGCAACCCCG AGCGCATCCT TGAACTGACG
TATGAGCAGT TCCGCGACTT CCATCGTCGT TTCTATCACC CCGGCAACGG CCGTTTCTTC
TTCTGGGGCG ACGACCCAGA AGAAGCCCGC CTTGAGCATA TCGGGCGGGT GCTGTCGCGT
TTCGATAGGC TTGATGTGGA TTCCGCTGTG CCTCTCATGG GGCACCGTGA CACGCCTCGC
CTTCTTGAGG TGCCCTTCGC CGCCGGGGAA GGCGATACTC GTGGCATGGT CACGGTGAAC
TGGCTTCTTG ACGAGACCGT CGATGCAGAG CGCAACTTCG CTTTGCACAT GCTCGAACAC
ATCCTCCTGG GGATGCCCGG TTCGCCGCTG CGTCGTGCGC TCATCGAGTC GGGTCTGGGT
GAGGATGTCG CCGGTGTAGG TCTGGAAGCC GAACTGCGCC AGATGTACTT CTCCGTGGGG
CTCAAGGGCA TCGACCCGGC GGATGCCGAG CGGGTCGAGG TGCTTGTCAT GGATACGCTT
GCCTCCCTCG CAGAGGAGGG GGTTCCGTCC GACGCCATCG AAGCCGCTTT CAACAGCGTG
GAGTTCTCTT TGCGCGAGAA CAACACCGGA CGCTACCCGC GTGGTCTCGC CGTCATGGTG
CGTTCTCTCA CGACGTGGCT CTATGACGGA GACCCCCTGG CCCTGCTGGC GTTCGAGAAG
CCCCTTGCCG CAATCCGTGA CGCCATAGCC GCCGGAGGCT ACTTCGAGTC GCTCATCAGG
CGCTGCTTCC TCGACAATGC CCACCGCGCA ACGGTCTCGC TAGTGCCCGA CATGACGCTG
GAGGCGCGTC GTGAAGAGGC CGAGAACAAG CGCATCGAGA AGGTGCAGTC CGCACTTTCG
CCCTCCGATA GAGAAGCCGT GGTCTCTCTT GCGGCGACGT TGCGCGCCCT GCAGGAGGCT
CCTGATTCGC CGGAGGCGCT CGCCACCATA CCGCGTCTGG GCCTTGAAGA CCTTGCGCGC
GAGAACCGCC CCATTCCCAT CGAAGAACGC ACATCGGGCG ATGTCCCCGT CCTGTTCCAT
GACATCGACA CGTCCGGCAT CGTCTACTCC GAACTGCTGT TCGACTTGTC TGCCGTTCCT
GCAAGGCTTC TGCCGCTTGT TCCGCTCTTC GGGCGGGCCC TGCTCGAGAT GGGAACGGCG
CGTCACGACT TCGTGGCACT GGGGATGCGT ATCGCCGCGA AGACCGGCGG TATCGAGGCG
GATACGCTCT TCGCCACCAC CCGTGCCGGA CGCAAGCCTG TCGCTCACAT GGTGGTGAGC
GGCAAGGCCA CCCGTGACAA TGCCGCGGCC CTGGTCGACA TCATGCACGA AGTGCTGCAC
GAGGCGCTGT TCGATGACGC GGAGCGCTTC GGGCGCATGG TGCTGGAAGA AAAGGCCCGT
CAGGAGCATT CGCTGGTGCC TTCGGGCCAC GGCGTTGTAT CCAGCCGTTT GCGGGCCAGT
TTCTCCATGG CTGGCTGGCT TGACGAGGTG ACAGGTGGCA TCACCTACCT CATGGCCCTG
CGCGAACTCG CCGAACGTGT GCGTGACGAT TGGCAAGGGG TGCGCGACGA CCTTGAAACG
CTTCGCACAC TTGTCTTGCG TCGAAGCGGG GCTCTCTGCA ACCTGACGGC GGACAGCGCC
ACGGCGGCAG TGGCCATGCC CCTTTTCGAC GGGCTTGTCG CGGGACTGCC CGATACCGCA
GCCGATGCCG TGGTCTGGGC ACCGGATGCA CTGCCTGCTG CAGAGGCGCT GGTCGTTCCC
GCGCAGGTGA ACTACGTGGG CAAGGGCGCC AACCTCTATG ACCTTGGCTA CGCCTATCAT
GGTTCGGTGA GCGTGGTGCT CAAGCACCTC CGCATGGCGT TCCTGTGGGA CCGGGTGCGC
GTGCAGGGCG GGGCCTACGG CGCGTTCTGC GCGTTCGACC GCATGAGCGG GGCCTTCACG
CAGGTTTCAT ACCGCGACCC CAATGTGGAG CGCACCCTTG ATGTCTACGA CAAGTGCGCC
GAGTATCTGC GCACGGTGGA ACTCGATGAC GCCGCACTCA CAAGCGCCAT CGTCGGTGCC
ATCGGCGACC TTGACATGCA CATGCTGCCC GACGCGCGCG GTGAGGCCTC GATGCTACGC
CATCTTACCG GTGATACGGA AGATGTCAGG CAGACGATGC GCGAGCAGAT GCTGGCGACC
ACGCAGCGGC ACTTCCGGGA ATTCGCCGAT GTGCTGGATG CGGTGGCACG CACGGGCAGG
GTGTGCGTCC TCGGTGGAGG CAGTCTCGAC GCCGTTGCGG CAACGCGCGG CTGGCAGGCT
TTGCGGGTGC TGTAG
 
Protein sequence
MQLHGFELID ETNLEELSSR VRRWRHVVTG AQLLSFCNAD ENKVFGVSFR TPPGDSTGVA 
HILEHSVLCG SERYPVKEPF VELLKGSLQT FLNAFTYPDK TCYPVASTNL QDFRNLVDVY
LDAVFFPRID ENIFRQEGWH IDAETADGPW NYKGVVYNEM KGVYSSPESV LSEQSQQAIF
PEHVYGLDSG GNPERILELT YEQFRDFHRR FYHPGNGRFF FWGDDPEEAR LEHIGRVLSR
FDRLDVDSAV PLMGHRDTPR LLEVPFAAGE GDTRGMVTVN WLLDETVDAE RNFALHMLEH
ILLGMPGSPL RRALIESGLG EDVAGVGLEA ELRQMYFSVG LKGIDPADAE RVEVLVMDTL
ASLAEEGVPS DAIEAAFNSV EFSLRENNTG RYPRGLAVMV RSLTTWLYDG DPLALLAFEK
PLAAIRDAIA AGGYFESLIR RCFLDNAHRA TVSLVPDMTL EARREEAENK RIEKVQSALS
PSDREAVVSL AATLRALQEA PDSPEALATI PRLGLEDLAR ENRPIPIEER TSGDVPVLFH
DIDTSGIVYS ELLFDLSAVP ARLLPLVPLF GRALLEMGTA RHDFVALGMR IAAKTGGIEA
DTLFATTRAG RKPVAHMVVS GKATRDNAAA LVDIMHEVLH EALFDDAERF GRMVLEEKAR
QEHSLVPSGH GVVSSRLRAS FSMAGWLDEV TGGITYLMAL RELAERVRDD WQGVRDDLET
LRTLVLRRSG ALCNLTADSA TAAVAMPLFD GLVAGLPDTA ADAVVWAPDA LPAAEALVVP
AQVNYVGKGA NLYDLGYAYH GSVSVVLKHL RMAFLWDRVR VQGGAYGAFC AFDRMSGAFT
QVSYRDPNVE RTLDVYDKCA EYLRTVELDD AALTSAIVGA IGDLDMHMLP DARGEASMLR
HLTGDTEDVR QTMREQMLAT TQRHFREFAD VLDAVARTGR VCVLGGGSLD AVAATRGWQA
LRVL