Gene Dvul_0850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_0850 
Symbol 
ID4664372 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp1044171 
End bp1047188 
Gene Length3018 bp 
Protein Length1005 aa 
Translation table11 
GC content67% 
IMG OID639819072 
Productpeptidase M16 domain-containing protein 
Protein accessionYP_966298 
Protein GI120601898 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0487817 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCATGA AACACTCCTC GATGCGTACA GGCGGGACGG CGGCCCTTGC CGCCCTTCTC 
TGCCTCACAG GTGGATGCCT GCGCGACAGG GCGCCATCGG CACCACCTGC ACTGGTCGCC
CCCGCTGAAT CCACTGCCGT CCCCTATGGC TGGTATGACG GCCGGTGGCC GCATGAACGC
CCGCTGCCCC CACACGAACG CCTCCTGCCA CACGACAGCG CACGCTTCGG ACGGCTGGCA
AACGGGTTGC GGTACGTCAT CGTCCCCAAC GCCAAACCGG AAGGACGAGT GAGTCTGCAT
CTCGACGTAC AGGCGGGGTC GCTCATGGAG ACTGACGGAC AACGCGGACT GGCGCACTTC
GTCGAACACA TGGCCTTCAA CGGGTCGCGC AATTTCGCAC CCGGTACGCT CATCCCCTTC
CTTCAGCGCA ACGGCATGGC CTTCGGGGCC GACGCCAACG CCCACACCAG CACGGCAGAG
ACCGTCTACA AACTCGACCT GCCTGCCGCC GACCCTGCGA CCATCGAAAA GGGATTGCTC
ATCCTGCGGG ATGTGGCCGA TGGGCTGCTC ATCCTGCCTG AAGAGGTCGA GAAGGAACGC
GGCGTCATCC TCGCCGAGAA GCTGGCGCGG GATAACCGCA GAAGCCGCGC GGGCAAGGCG
TTGCGCGACG TCCTGTATGC CGATAGCCGC TACGCCTTCG AGACCATCGG CCTCGAGGAT
GTCGTGCGCC ATGCCCGCCC CGAGACGCTG CGCGCCTTCT ACGACACATG GTACCGCCCC
GAACGCATGG TGCTTGTGGC TGTGGGGGCC GTCACCCCCG CCGACCTCGC GACCATGGTC
GAACGTCACT TCGCTGATGT GAAGGCGCGC AGCGGCGCAC CGGTCATGCC CGCGCCCGGC
AACGTCAGGC ACGAAGGCGT ACACGCAAGC TACGACCCGG AGACGGGAGG CGGCGTGACG
GTGAGCGTCA CCGCCATGCA CACGGCGCGA ACCGAGGTCG ACTCCATGAG CCTGCAACGC
CGCAGGCTGG CAGAGGCCGT CGCCACCAGC GCGTTCCAGA AGCGGTTGCT GCGCCTCGCC
TCCACGCCCG GTGCGCCCGT CCTTGGCGGT CATATGGCCA TGCCCGTCGG TTTCGAGATG
TTCGAGACCG CCACCATCAC CATGCGTGCC CGTGGCGAAG ACTGGCGCAA CGCGCTGACG
ACCGCAGAGA CGGAACTGCG CCGGGCCCTC GAACACGGCT TCACGCAAGA CGAGTTCGAG
AACGCGCGAC GCGTGTGCGA GGGGCTTTTC ACCACCATGC GCCGTGAGAA GGCCAACCGC
ACCAACAGCG ACATCGCCGC AGAGGCCGTG GCCTGCTTCA ACGCCGACAG GGTGTTCCAG
TCCACCGACC AGACCTGCGA CCTCTACCTG AACATGCTCC CAAGCCTCAC GCGAGGCGAG
GTCGAAGAGG CATTCCGCCG CCGCTGGGAC ACAGGCAACA GGATGCTGCA CCTCTCCGGC
ATGGCAGGCG TGGAGAACGC TTCGAAGCGC CTTGTCGAGG CGTGGGCCGA GAGTGCCGCA
CACGCCGTCG AAGCACCGCG TGACAGCGTC GCAACGGCCT TCCCCTACCT TGCCGAACCC
TCCCTGCCCG CGCTCGTGTT GCACGACTCT TCGCGCCAGC TGCCGGAAGG CCCGGCCCTT
CGCACCGTAC GCCTCGACAA CGGCCTCACC CTGCACATGG CCGTCACGCC CTACGAGAAG
GGACGCTTCA GCCTTTCGCT GTTCCACGGT GACGGGCTGG ACGGTCTGGA CGATGCGACC
TACGCGGTGG CACGCGCCAC CGAACGCACC CTCCGGGAGG AGGGGGTCGG ACGTCTCTCT
CGCGAGGCCA CGCGCGACCA TCTCGGCTGG AGGCATGTGA AGGTGGAGAC GGACTACCGC
GACGACGCCT TCACCATCAA CGCCAGCGGC CCCGGTGAGG AACTCGACGC CGTCACCGCC
GCCCTGTGGA CGCAGTACAC CGACCCCACC CCCACGAAGG CAGGACGGGC GCGGGCCATA
GAAGGGCTTG AAGCAGGCAG GGACGAACGC GAGAACACCG TGGAGGGCGT TGCCCCTCAC
CGCATTCGCG CCTTCCTCTA CGGCGACGCA CGCCGCACCG CACCTCTCGA CAGCCGGGAT
GTGGCCCGTG TGCCGCTCGA CGCCATGCGC GATTTCGCCC TTGCCCGCAC CACCACACCG
CCCCGCACCA TGGTCGCCGT CGGCGATTTC GACCCCGAAC GTCTCATCGA ACGGGTGCGC
CATCTCTTCG CCATGCCAAC ACCCGTGCCA GCCTCCCACG AGGCACCGCA TGAGGCCGTC
CGCTTCCCCG CCGGGGCACG GCGCACCGTC GAAGTGGCCG ACCCCGATGG CAAAGCGCAC
CTCGTGGTGG CGTGGCGGCA TGACCTTGAG GACGAATCAG ACCAGCGCGC GCTGGCCATC
CGCCACCTGA CCGCATCATG GCTCAACGAA GTGCTGCGCG AAGAGGTGCG AGAGGCCATC
GGCGCCGCGT ACTCCCCCTC GGGACGCTAT CGCCATGACC AGGAACGCGG GGGCTTCGGC
ACCTATGTCG CCTCGATTCG CACCGATGCG GCACTGGTCG ACAAGGTACG GCGGGCCGTG
CGCGAGGCGG CCCAAGGGCT GGCGAAGGGC GAAGTGCCCC CCGGCACGGC TGCGCGCCTG
CGCGCACCGG TACTCAACGC CATCACCAAG GCGCGCGACT CGAACACCTA CTGGCAGCGC
ATCATCGAAG CCGAAGTCCT GCGCGGCAGA ACCGCGGCCC GCCATGCCGA AGCCTTCGCC
AAGGCACTGG ACAGCGTCAC CGATGCCGAC ATCGCTGCGG AAGCGCGCGC CGTCTTCGCC
ACCAGAAGTG CCGAACTCGC CATCACGGGC AAGGCCCCTG CGAAGAAGCC CGGCGGCAAA
CCCTCCAAGG CTGCGGTCTC CCGAAAGATA CCCGCCGGGA CCACAACCGA AGAAACAGCC
CAAGGAGTAG CCCGATGA
 
Protein sequence
MSMKHSSMRT GGTAALAALL CLTGGCLRDR APSAPPALVA PAESTAVPYG WYDGRWPHER 
PLPPHERLLP HDSARFGRLA NGLRYVIVPN AKPEGRVSLH LDVQAGSLME TDGQRGLAHF
VEHMAFNGSR NFAPGTLIPF LQRNGMAFGA DANAHTSTAE TVYKLDLPAA DPATIEKGLL
ILRDVADGLL ILPEEVEKER GVILAEKLAR DNRRSRAGKA LRDVLYADSR YAFETIGLED
VVRHARPETL RAFYDTWYRP ERMVLVAVGA VTPADLATMV ERHFADVKAR SGAPVMPAPG
NVRHEGVHAS YDPETGGGVT VSVTAMHTAR TEVDSMSLQR RRLAEAVATS AFQKRLLRLA
STPGAPVLGG HMAMPVGFEM FETATITMRA RGEDWRNALT TAETELRRAL EHGFTQDEFE
NARRVCEGLF TTMRREKANR TNSDIAAEAV ACFNADRVFQ STDQTCDLYL NMLPSLTRGE
VEEAFRRRWD TGNRMLHLSG MAGVENASKR LVEAWAESAA HAVEAPRDSV ATAFPYLAEP
SLPALVLHDS SRQLPEGPAL RTVRLDNGLT LHMAVTPYEK GRFSLSLFHG DGLDGLDDAT
YAVARATERT LREEGVGRLS REATRDHLGW RHVKVETDYR DDAFTINASG PGEELDAVTA
ALWTQYTDPT PTKAGRARAI EGLEAGRDER ENTVEGVAPH RIRAFLYGDA RRTAPLDSRD
VARVPLDAMR DFALARTTTP PRTMVAVGDF DPERLIERVR HLFAMPTPVP ASHEAPHEAV
RFPAGARRTV EVADPDGKAH LVVAWRHDLE DESDQRALAI RHLTASWLNE VLREEVREAI
GAAYSPSGRY RHDQERGGFG TYVASIRTDA ALVDKVRRAV REAAQGLAKG EVPPGTAARL
RAPVLNAITK ARDSNTYWQR IIEAEVLRGR TAARHAEAFA KALDSVTDAD IAAEARAVFA
TRSAELAITG KAPAKKPGGK PSKAAVSRKI PAGTTTEETA QGVAR