Gene Dvul_0853 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_0853 
Symbol 
ID4664347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp1050386 
End bp1053151 
Gene Length2766 bp 
Protein Length921 aa 
Translation table11 
GC content54% 
IMG OID639819075 
Producttype III restriction enzyme, res subunit 
Protein accessionYP_966301 
Protein GI120601901 
COG category[V] Defense mechanisms 
COG ID[COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.237382 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0361479 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAGT CCAACCAAAA TCCAGAGCAA AAAGCGCGAG ATGCTATCGA CAAGACGCTT 
GACGAGTGCG GTTGGGCTGT TCAGGACAAA AAGAAAATCA ACTTCAATGC CGGGCTGGGT
GTCGCTGTCA GGGAATACCC GACAGATGCG GGGCCTGCTG ATTATGTTCT CTTTGTTGAC
AGAAAGCCTG TCGGCGTCAT CGAAGCCAAA CGTGAAGAGG AAGGACAACA CCTTGTCACT
GTCGCTGAAC AATCAGCTGA ATACGCCAAC GCTCAGCTGA AATGGGTTCA GGATAATTCT
CCGCTCCCTT TCATCTACGA AAGCACTGGG GTAATCACAA TTTTCCGGGA TCAGCGGGAC
CCCAAGCCCC GTTCCCGGGA GGTGTTTAGC TTCCATCGAC CAGAGACATT CCAAGAATGG
CTGGCGCAGG ATGATACACT GAGGGGTAGA CTCGAAAAAC TTCCTCCTCT TCCAACACAA
GGGCTTCGAG ACTGCCAAAA GAGAGCGATT CAAAATCTTG AAGGCTCGTT CAAGCTGGCG
AAGCCCAAGG CTCTCGTGCA GATGGCGACT GGAGCCGGAA AGACTTACAC CGCCATTACC
TCGGTGTATC GACTTTTCAA GCACGCCAAT GCGAAGCGAA TCCTTTTCCT TGTCGACACC
AAGAACCTGG GAGAACAGGC GGAACAGGAG TTCATGGCGT TCACTCCCAA TGACTCGAAC
GATCTTTTCA CGCGCCTGTA TAATGTCCAG CGGCTCAAAT CCAGCTACAT CCCGGATTCA
TCCCACGTCT GCATTTCCAC GATTCAGCGG CTCTACTCCA TTCTGAAGGG TGTGGAACTG
GACGAGGCTG CGGAAGAGTT CAACCCCGCT GAACATGTGG GGCCGAAAGC TCCTCTGCCT
GTTGTTTACA ATGGTAAGAT TCCCATCGAG TTTTTCGACT TCATCGTCAT AGACGAGTGC
CACCGCTCCA TTTACAACCT CTGGCGGCAG GTGCTGGACT ATTTCGACTC CTTCCTGATC
GGCTTGACCG CCACTCCTGA CAAACGCACT TTCGCCTTTT TCAACGAAAA CGTTGTCAGC
GAATACTCCC ATGAGGACGC TGTCGCAGAC GGGGTGAACG TCGGCTATGA CGTCTACACC
ATTGAGACGC AGATCACGAA GCAAGGTGCT GTTCTCAAGG CGAACCAGGC CATCGAGAAG
AGAAATCGCC TCTCCCGTAG GAAGCGATGG GAACAGCAAG ACGAGGATGA AGCCTATTCC
GGCGCGCAAC TGGACAAGGA CGTGGTCAAC CCGAGCCAGA TTCGCAATGT CATCAAAACC
TACAAAGACA AGCTGCCAGC CATCTTTCCA GGACGCTGCG AAGTGCCAAA GACGCTCATC
TTCGCCAAGA CCGACAGCCA TGCGGACGAC ATCATTCAAA TAGTCCGGGA GGAGTTTGGC
GAAGGAAACG AATTCTGCAA AAAGGTCACG TACAAGGCCA ATGAGGACCC GAAGTCTGTC
CTGGCGCAGT TCAGGAACAG CTACAATCCC CGCATCGCGG TGACTGTGGA CATGATCGCC
ACGGGAACGG ACGTGAAGCC CCTGGAGTGC CTTCTCTTCA TGCGCGATGT TCGGAGCAAG
AACTACTTCG AGCAGATGAA GGGGCGCGGC ACCAGAACCC TGGACTTTGA TGGGTTGAAA
AAGGTTACGC CTTCAGCATC CAGCAACAAG ACCCATTTCG TCATCGTGGA CGCCATCGGT
GTCACCAAGT CGCACAAGAC AGACAGCCGT GCTCTGGAGC GCAAGCCCAC AGTCTCCCTG
AAAGACCTGC TTATGAGCGT GATGATGGGA GCGACGGACG ACGACACCCT GACCAGTCTG
GCAAGCAGGC TGACACGCCT GAACCACCAG CTCACTCCAG AGGATCAGAA ACGCATCGAA
GTGAAAACGG ATGGGGTTCC CCTCTCACAA ATCACAAAAG ACTTGTTGGG GTCCATGAAC
CCCGACAAGA TCGACGCCAA GGCGCGGGAG CAATTCAAGC TCACGGATGA ACAGGAGCCC
ACGGATGACC AGAGCACCAA GGCCAAGCAG GAACTGGTCA AGAACGCGAC CACGGTGTTC
ACTGGCGAGG TCTGCGAGCT GCTGGACACC ATACGCCGGG AGCACGAACA GACCATCGAC
ACCCACAATA TCGATACGGT CCTCCGGGCT GAATGGGAAG GCGACAGTAT TGAGAACGCC
ACCAAGCTCA CTGCCGAATT TGCGGAGTAC ATCAAAGAGC ACAAGGACGA GATTGTCGCG
CTGAGCATCT ATTTTGACCA GCCGTACCGC CGCAGGGAAG TCACCTACGG CATGGTGAAA
ACCCTGCTGG CCAAGCTGAA GACGGACCAG CCCAAGCTGG CTCCGGTTCG CATCTGGCAC
GCCTATGCCC TGCTGGACAA GGTGACGGGA AAGTCGCCAG AAAATGAACT GACAGCCCTG
GTCTCTTTGA TCAGACGGGT TTGCGGCATG GACGCCGCCA TCGCCCCTTA CGGCGATACG
GTGCGAGACA ACTTCAAGCG CTGGATTTTC AAGCGTCACC AGGGAAGCGG CTCGAAATTT
GACGAAGAGC AGATGACCTG GCTGCGGATG ATCCGGGACC ACATTGCCTG TTCGTTCCAC
ATGGACAGGG ATGACCTGGA ACTGGCCCCG TTTGACGGAC AAGGCGGATT GGGACGCATG
TGGGAATTGT TTAGGGAAGA TATGGATTCT TTAATTGATG AGATGAATGA AGAACTGGCG
GCATAA
 
Protein sequence
MNKSNQNPEQ KARDAIDKTL DECGWAVQDK KKINFNAGLG VAVREYPTDA GPADYVLFVD 
RKPVGVIEAK REEEGQHLVT VAEQSAEYAN AQLKWVQDNS PLPFIYESTG VITIFRDQRD
PKPRSREVFS FHRPETFQEW LAQDDTLRGR LEKLPPLPTQ GLRDCQKRAI QNLEGSFKLA
KPKALVQMAT GAGKTYTAIT SVYRLFKHAN AKRILFLVDT KNLGEQAEQE FMAFTPNDSN
DLFTRLYNVQ RLKSSYIPDS SHVCISTIQR LYSILKGVEL DEAAEEFNPA EHVGPKAPLP
VVYNGKIPIE FFDFIVIDEC HRSIYNLWRQ VLDYFDSFLI GLTATPDKRT FAFFNENVVS
EYSHEDAVAD GVNVGYDVYT IETQITKQGA VLKANQAIEK RNRLSRRKRW EQQDEDEAYS
GAQLDKDVVN PSQIRNVIKT YKDKLPAIFP GRCEVPKTLI FAKTDSHADD IIQIVREEFG
EGNEFCKKVT YKANEDPKSV LAQFRNSYNP RIAVTVDMIA TGTDVKPLEC LLFMRDVRSK
NYFEQMKGRG TRTLDFDGLK KVTPSASSNK THFVIVDAIG VTKSHKTDSR ALERKPTVSL
KDLLMSVMMG ATDDDTLTSL ASRLTRLNHQ LTPEDQKRIE VKTDGVPLSQ ITKDLLGSMN
PDKIDAKARE QFKLTDEQEP TDDQSTKAKQ ELVKNATTVF TGEVCELLDT IRREHEQTID
THNIDTVLRA EWEGDSIENA TKLTAEFAEY IKEHKDEIVA LSIYFDQPYR RREVTYGMVK
TLLAKLKTDQ PKLAPVRIWH AYALLDKVTG KSPENELTAL VSLIRRVCGM DAAIAPYGDT
VRDNFKRWIF KRHQGSGSKF DEEQMTWLRM IRDHIACSFH MDRDDLELAP FDGQGGLGRM
WELFREDMDS LIDEMNEELA A