Gene Dvul_0074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_0074 
Symbol 
ID4664110 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp89298 
End bp91391 
Gene Length2094 bp 
Protein Length697 aa 
Translation table11 
GC content67% 
IMG OID639818267 
Productpeptidase U32 
Protein accessionYP_965525 
Protein GI120601125 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGTGGT GTTTGCGCTT GCTCCCGGCG CGAGCTATAT TTATATGTCA TCGGCGCGCC 
GTCCGGCGCG ACGCCACCCG AACGCGGCCC CTTGCCGCCA CCCCGGACCA TTCCGCCATG
CATCGCAAGC CAGAGATCCT CGCCCCCGCG GGCGACACCT CCGCCTTTCT CGCCGCCATC
GCCGCAGGCG CCGACGCCGT ATATCTCGGT CTCAAGCACT TCTCCGCCCG TATGCAGGCC
GACAACTTCG GCCTCACCGA GCTCTCGCGC CTCGTCGAAC TGGCCCATAC CCACGACCGC
CGTGTCTACG TGGCGCTCAA CACCATGCTC AAGCCCGGCG ACCCCGACGC CGCCGGACGC
CTCATCGCCC GCCTCGCCCG CGATGTGATG CCCGACGCCC TCATCCTGCA AGACCTCGGT
GTGGCCGAGG TGGCGCGTCA GGCGGGGTTC AAGGGTGAGT TGCACTTCTC CACCCTCGCC
AACGTCACCC ACGCCGACGG GCTGCGCGCC GCCAAGGCCT TCGGGGCCGA CCGCGTCATC
CTGCCGCGCG AACTGAACAT CGACGAGGTG CGGAGCATGG GCGAAGCCTG CCCCGAAGGG
CTCGACCTCG AACTCTTCGT GCACGGGGCG CTGTGCTACT GCGTCTCGGG CCGCTGCTAC
TGGTCGAGCT ACATGGGCGG CAAGAGCGGC CTGCGCGGGC GCTGCGTGCA GCCGTGCCGC
CGCGTCTACC GCCAGAAGGG GCGCGAGGGC CGCTACTTCT CATGCCTCGA CCTGTCGCTG
GACGTGCTGT CCAAGACGTT GCTCGACGTG CCCAATCTCT CTTCGTGGAA GATAGAAGGC
CGCAAGAAGG GGCCGCATTA CGTCTTCCAT GTGGTCACCG CCTACCGCAT GTTGCGTGAC
AACCCCGACG ACCCGCAGGT GCGGAAGGAT GCCGAACGCA TCCTGAAGAT GGCGCTGGGC
CGTCCGGGAA CCCATGCCGG GTTCCTGCCG CAACGCTCGA AGGGCCCCAC CGCCTTCGAC
GAGCAGACCA GTTCCGGTCT TCTCGTCGGC AAGATTCAGC ACGACAAGGA AGGCACGGTC
TACTTCAAGC CGCGCATCGA ACTGCTGCCG CAGGACTACC TGCGCATCGG CTATGAAGAT
GAACCGTGGC ATGCCACGCA GCCCGTACCG CGCCGCATCC CCAAGGCCGG TTCGTTCACC
TTGCGGATGC CCCGCCACAA GACGCCCAAG GCGGGCACGC CCGTGTTCCT CATCGACCGC
CGCGAGCCGG AGATGATGCA CATCCTGCGC GAATGGCAGA ACCGCCTCGC CAAGTGCCAC
GGGCGCAAGG CCACAGCCGT GGACTTCGAA CCGAAGCTGC CCGCCCCCGT ACGCGGACGC
AAGCGCCCAG ACATGGTGCT GCGCAGTTCC GTCCCCCATG GCATGGAGAC GCGCGGTTCG
CGTGGCGTCG TCACCGGGCT GTGGCTGTCG CCCAAGTCGG TGCAGGAGGT CTCGCGCACG
GTGTTCGGGC GTATGGCGTG GTGGCTTTCG CCGGTCATCT GGCCCGACGA GGAGGACGCC
CACCGGCGCA TGGTGGTGCA GGCGCTACGC AACGGTGCGC GGCACTTCGT GCTCAATGCG
CCGTGGCAGC TCGGCCTTTT CCCGGAGCGT GACGACCTGG ACTTCATCGC CGGGCCATTC
TGCAACATCG GCAACGCGGC GGCGCTGGCA TGCCTGCGCG AGCACGGCTT CTCCGCCGCC
ATCGTCAGCC CCGAACTTGC GGGCGAAGAC CTGCTCGCCC TGCCCAAAGA AAGCTGCCTG
CCGCTGGGTG TGGTGCTTTC CGGCTTCTGG CCCATGGGCA TCGCGCGGCA TCAGCTTGAG
GGCGTGAAGC CCGGCGAACC GTTCTCCAGC CCCAAGGGCG AGGTGTTCTG GGCTCGTCGC
TACGGTCAGA ACACGTGGAT ATACCCCGGC TGGCCGCTGG ACATCAGCGC CCGTAGGGCT
GAACTGGATG CAGCGGGCTA TGCCTTCGGC GTGCGCATCG ACGAATATCC GCCCAAGAAC
CTGCCCGAGG CGCGTCGCGC CAGCACGTTC AACTGGGACC TGCCGCTGCT GTAG
 
Protein sequence
MAWCLRLLPA RAIFICHRRA VRRDATRTRP LAATPDHSAM HRKPEILAPA GDTSAFLAAI 
AAGADAVYLG LKHFSARMQA DNFGLTELSR LVELAHTHDR RVYVALNTML KPGDPDAAGR
LIARLARDVM PDALILQDLG VAEVARQAGF KGELHFSTLA NVTHADGLRA AKAFGADRVI
LPRELNIDEV RSMGEACPEG LDLELFVHGA LCYCVSGRCY WSSYMGGKSG LRGRCVQPCR
RVYRQKGREG RYFSCLDLSL DVLSKTLLDV PNLSSWKIEG RKKGPHYVFH VVTAYRMLRD
NPDDPQVRKD AERILKMALG RPGTHAGFLP QRSKGPTAFD EQTSSGLLVG KIQHDKEGTV
YFKPRIELLP QDYLRIGYED EPWHATQPVP RRIPKAGSFT LRMPRHKTPK AGTPVFLIDR
REPEMMHILR EWQNRLAKCH GRKATAVDFE PKLPAPVRGR KRPDMVLRSS VPHGMETRGS
RGVVTGLWLS PKSVQEVSRT VFGRMAWWLS PVIWPDEEDA HRRMVVQALR NGARHFVLNA
PWQLGLFPER DDLDFIAGPF CNIGNAAALA CLREHGFSAA IVSPELAGED LLALPKESCL
PLGVVLSGFW PMGIARHQLE GVKPGEPFSS PKGEVFWARR YGQNTWIYPG WPLDISARRA
ELDAAGYAFG VRIDEYPPKN LPEARRASTF NWDLPLL