Gene Dvul_0854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_0854 
Symbol 
ID4664338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp1053152 
End bp1054939 
Gene Length1788 bp 
Protein Length595 aa 
Translation table11 
GC content47% 
IMG OID639819076 
Productrestriction modification system DNA specificity subunit 
Protein accessionYP_966302 
Protein GI120601902 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.221182 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0643125 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACTAT CTTACAAACC GATAGAAATT GTCAAGGAAG GGAAGAACCC TTTACTTGGA 
AAAGCTGATC ACTGGAAAAG GGTTTATGTC AGCGAAATTG CTATGGTTCA AAATGGATTT
GCATTCAAAT CAAAATTCTT TTCTAGGGAC GAAGGCATTC CACTGATCAG AATCAGGGAT
ATCCTGAGTG CCGAGACCGA GCACAAATAT TTTGGGCAGT TCGACAAAGA GTATTTAGTG
CACAATGGAG ACCTGCTTAT AGGGATGGAT GGCGATTTTG TAGCAGCATA TTGGCCTGGA
AAAGAAGGTT TGCTGAATCA GCGAGTCTGT AGAATTGTTA TTGAATCAGA AAATTACGAT
AAAAAGTTCT TTTTTCTTGC CCTTCAGCCG TACCTAGATG CAATTCACGA AAAGACATCC
TCTGTTACTG TCAAACATCT ATCATCAAAG ACAGTCAATG AGATACCTCT TCCGCTCCCC
CCCCTCAACG AACAAAACCG CATCGTCGCC AAGATCGAAG AACTCTTTTC CGAGCTGGAC
GCCGGGGTTG AAAACCTCAC CAAAGCCAAG GAACAGCTTG GCGTGTACCG GCAGTCGCTT
CTCAAACACG CCTTTGAGGG CAAGCTGACC GAAGCATGGC GCAAGAGAAA CGCTGACAAG
CTCGAATCCG GCGAAGCCCT TCTCAAGCGA GTGAAGAAGG AACGCGAGGA GTATTTCAAA
AAGCAGCTTG AGCAGTGGGA GAAAGACGTC GCTCAATGGG AAGCGGACGG CAAGCCCGGC
AAGAAGCCAA CCCAACCCAA GAAGCCGAAG AAGCTCGCTC CGATCAGTGA AGAGGAGTTG
AAGGAGCTGC CGGAGCTGCC TGAGGGGTGG GTGTGGGCTC GACTTGGAAA TCTGATCGAT
CCCCCAGCTT ATGGAACCTC AAGAAAGTCT GACTACAATA TTGATGGCAC AGGGGTGTTG
AGAATACCAA ACATTGTAGA TGGAAAGATC GATAGTAGCG ATTTAAAATA TACTGCGTTT
TCTCCAGGCG AAGAGGAACA ATACAGGTTA AAAGCTGGTG ACTTATTAAC TATAAGATCA
AATGGGAGCG TTTCACTGGT TGGACAATGT GCATTAATAG AAGATGACGA CACACGATAT
GTCTACGCAG GATATTTGAT TCGACTGAGA ACCATTGGGC TATTAGTTTC TAAATTCCTT
CTGTACTGCC TTTCAAGCCT CAGGCTTCGT AATCAAATTG AAAGCAAGGC GAAATCGACA
AGCGGAGTGA ACAACATTAA CTCGCAGGAA CTGTCGTCAC TCATAGTGCC ACTATGCTCT
CAACTTGAAC AGAACGAGGT GAGCAAATTA TTAGCGGATT CATTATCAAC CGCAGGTGAA
CAGACTTCCA TGATCGAAAT ACAGCTAGAA CACATTAGAA TCCTAAAACA ATCGATTCTC
GACAAAGCTT TCTCTGGAAC TTTAATCTCA CAAGACCCCA ATGACGAACC AGCCTCCAAG
CTTCTCGAAA GAATCAAGCA GGAGCGGAAG AGTGCTCCCA ATCCCAAACG GACACGGAAA
ACGAAAACCA AGAGGATTGC TATGGCAGAC CTGAAAGAAG TTCTCGCCAC TGCCAAAGAT
TGGGTCAGTG CTCAGGATGC ATTCCGCCAG TGTGGCGTTG GCGATGGCGC ACCTACAGAT
GAGGTCGAAA AACTGTACGG AGAATTAAAA CAGGAACTCG ACCAGAAGAC CATCGAGGTA
GAGCGCAGGG GTGATGAAGA CTGGCTCCGC TTGGCTGCTG AGGGTTAG
 
Protein sequence
MALSYKPIEI VKEGKNPLLG KADHWKRVYV SEIAMVQNGF AFKSKFFSRD EGIPLIRIRD 
ILSAETEHKY FGQFDKEYLV HNGDLLIGMD GDFVAAYWPG KEGLLNQRVC RIVIESENYD
KKFFFLALQP YLDAIHEKTS SVTVKHLSSK TVNEIPLPLP PLNEQNRIVA KIEELFSELD
AGVENLTKAK EQLGVYRQSL LKHAFEGKLT EAWRKRNADK LESGEALLKR VKKEREEYFK
KQLEQWEKDV AQWEADGKPG KKPTQPKKPK KLAPISEEEL KELPELPEGW VWARLGNLID
PPAYGTSRKS DYNIDGTGVL RIPNIVDGKI DSSDLKYTAF SPGEEEQYRL KAGDLLTIRS
NGSVSLVGQC ALIEDDDTRY VYAGYLIRLR TIGLLVSKFL LYCLSSLRLR NQIESKAKST
SGVNNINSQE LSSLIVPLCS QLEQNEVSKL LADSLSTAGE QTSMIEIQLE HIRILKQSIL
DKAFSGTLIS QDPNDEPASK LLERIKQERK SAPNPKRTRK TKTKRIAMAD LKEVLATAKD
WVSAQDAFRQ CGVGDGAPTD EVEKLYGELK QELDQKTIEV ERRGDEDWLR LAAEG