Gene Dvul_0485 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_0485 
Symbol 
ID4662091 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp615167 
End bp618286 
Gene Length3120 bp 
Protein Length1039 aa 
Translation table11 
GC content58% 
IMG OID639818695 
Producthypothetical protein 
Protein accessionYP_965935 
Protein GI120601535 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.646894 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000377567 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGCCACC ACAAGGAAAT CGCGTTCGAA AACGACATCT GCAACCACCT TGCGGCGCAT 
GGCTGGCAGT ACACGGCAGG AGACGCCGCA AGCTACGACC GCGCTCGTGC CTTGTTCCCG
GAAGATGTCG TTGCGTGGGT GCAGACCACC CAGCCCGAGG CGTGGGAGGT GCTGGTCAGA
AATCATGGCA CTGCCGCGCA AGGGGTGCTG CTTGACCGCA TCCGCAAGCA ACTCGACGAC
CGGGGCACCC TTGATGTGAT CCGCTTCGGG GTTGAACTGC TCGGCCTGAA AAGGCGGCTC
ACACTGGCCC AGTTCAAGCC GGCTTTCGAT CTCAACCCGG AGATTCTGGA GCGGTATCAG
GCGACCCGCC TGCGTGTGGT GCGGCAGGTG CGCTATTCCG TGCATAACGA AAACAGCCTC
GACCTCGTGC TGTTCCTGAA CGGCATTCCC GTTGCCACGG TAGAACTCAA GTCGGACTTC
ACCCAGTCGG TTGAGGATGC TGTTGACCAG TACCGCGTTG ACCGCAACCC TCACCCCAAG
GGGCAAGGGA CGCGAGAACC CTTGCTCGAC TTCCCGCGCG GGGCGCTGGT GCATTTTGCG
GTGAGCAACT CGCTGGTACG CATGACCACC AGACTGGAGG GGGCAGGCAC ACGCTTTCTG
CCGTTCGACC GTGGCAACTG CGGCGCTGCG GGCAACGCGC CCAACCCTGC GGGGCATGCC
ACCGCCTACC TGTGGGAAGA GGTGTGGCAG CGCGATAGCT GGCTCGAGAT TGTGGGGCGC
TACATTGTCG CCATGCGCGG GCCGAAGAAG CAGATAGAGA AGATCATCTT TCCTCGCTAT
CACCAGCTTG ATGCCACGCG CCAACTCGTT GCCAAAGTAC GCGAAGAGGG GGTGGGGCAA
AAATACCTCA TCCAGCATTC TGCCGGGTCG GGCAAAACCA ACTCCATTGC GTGGACAGCC
CACTTTCTGG CTGACCTGCA CGACGCCAAC CAGAAGAAGA TGTTCGATTC CGTACTGGTG
GTAAGTGACC GCACCGTGCT GGATGCCCAG TTGCAGGAAG CCATTTTTGC CTTTGAGCGC
ACGACGGGTG TTGTGGCGAC CATCACCGGT GACAACGGCA GCAAGAGCGA GGCACTGGCG
CAGGCGCTTT CTGGTGGCAA GAAGATTGTC GTTTGCACCA TCCAGACCTT TCCATTTGCC
TTGCAAGCCG TGCAGGAACT TGCCGCCACG CAGGGCAAGA CCTTTGCCGT CATTGCCGAT
GAAGCCCACA GTTCACAGAC GGGAGATGCC GCCGCCAAGC TGAAGCAGGT GCTCACCGCC
GAAGAAATCA GGGAACTGGA AGACGGCGGC GAAATAAGCA ACGAAGACAT CCTCACCATG
CAAATGGCGG CAAGGGCCAA TGCGCGGGGC ATAACCTACG TCGCCTTCAC GGCGACCCCC
AAGGCCAAAA CGCTTGAACT TTTCGGACGG TGCCCAGACC CTTCACTCCC CGCCGGGCCG
GGCAACCTGC CTGCACCCTT TCATGTGTAC GGCATGCGAC AGGCCATTGA AGAAAGGTTC
ATTCTGGATG TGCTGCGCAA CTATACGCCG TACAAGCTGG CGTTCCGCCT CGCCAGCAAT
GGCAAGGAGT GGGACGAAAA AGAGGTGGAG CGCAGCGAAG CCATGAAGGG CATCATGCGA
TGGGTGCGCC TGCATCCCTA CAACATCAGC CAGAAGGTAC AGGTGGTTGT AGAGCACTTT
CTCGCCAATG TGGCCCCGTT GCTGGACGGG CAGGCCAAGG CAATGGTGGT GACAGCCAGC
AGACAGGAGG TGGTGCGCTG GCAGATTGCC ATCAACAAGT ACATCAAGGA CAAGGGCTAC
CGGATAGGCA CGCTCGTGGC CTTCTCTGGT GAAGTGCATG ATGCAGAGAT TGCGAAAGAC
AGTTTTACCG AGCACAGCAC GACGCTGAAC CCCGGACTCA ACGGGCGCGA CATGCGCGAG
GCCTTCGGAA CAGACGAGTA TCAGTTGCTG CTGGTCGCCA ACAAATTCCA GACGGGCTTT
GACCAGCCCC TCTTGTGCGG CATGTATGTG GACAAGCGCC TTGCCGGGAT TCAAGCCGTA
CAGACGCTCT CGCGCCTCAA CAGGGCGCAT CCCGGCAAGG ATACGACGTA TATCCTCGAC
TTCGTCAACG AGCCGAACGA AGTGCTTGAA GCCTTCAAGA CCTACTATGA AACAGCTGAA
CTTGAGGGCG TGACTGACCC GAACCTAGTC TATGACCTGC GGGCCAAGCT GGATGGCATG
GGCTACTACG ACGATAACGA GGTAGAACGT GTCGTTACCG TGGTACTCAG CCCCAAAGCA
TCACAAAAAG AGCTTGATGC AGCCATTAGA CCTGTTGCAG ACCGCCTGCT CAAGCGGTTC
AATGTGCTAA AGGAAGCCAT AAAGGTTGCC GTGACAGTGC AGGATGCGCG AGGGGAGAAG
GACGCTCGCG ATGAAAGGAA TGCCCTTGAG TTGTTCAAAC GCAACATTGG TGCCTTCCTG
AGAGTGTATT CATTCCTGTC GCAGATATTC GACTATGGCA ATACAGACAT AGAGAAACGA
TCCATCTTCT ATCGCTGTTT GCTGCCCTTG CTGGAGTTTG GACGTGAGCG TGATCTTGTT
GACCTTTCGG GCGTGGTCTT GACCCACCAC ACCCTGCGCA ACCGGGGCAA GCGCGATCTG
CCGTTCGATG GCAAGGGTGA AAAGCTCATG CCCCTCACCG AACCCGGCAG TGGTGAAGTG
CGCGACAAGC AGAAGGCACT GCTTGCCGAG ATCATCTCCA AGGTCAACGA CCTCTTCGAG
GGAGACCTCA CCGAGGATGA CAAACTGATC TACGTTAACA GCGTCATCAA GGGCAAACTT
CTGGAGTGTG ACGTGCTTGT GCAGCAAGCC GCCAACAACT CCAAGGGACA GTTTGCCAAT
TCACCCGACC TCGCAAAAGA AATACTCAAC GCCATCATGG ATGCACTGAC GGCGCATACT
GCCATGAGCA AGCAGGCGCT GGAGTCTGAA CGGGTACGGC ATGGCTTGCG CGACATTCTT
CTGGATCATG CCGGACTGTA TGAAGACCTG CGGCAAAAGG CAGAAGCTCT CAGGGCGTAA
 
Protein sequence
MSHHKEIAFE NDICNHLAAH GWQYTAGDAA SYDRARALFP EDVVAWVQTT QPEAWEVLVR 
NHGTAAQGVL LDRIRKQLDD RGTLDVIRFG VELLGLKRRL TLAQFKPAFD LNPEILERYQ
ATRLRVVRQV RYSVHNENSL DLVLFLNGIP VATVELKSDF TQSVEDAVDQ YRVDRNPHPK
GQGTREPLLD FPRGALVHFA VSNSLVRMTT RLEGAGTRFL PFDRGNCGAA GNAPNPAGHA
TAYLWEEVWQ RDSWLEIVGR YIVAMRGPKK QIEKIIFPRY HQLDATRQLV AKVREEGVGQ
KYLIQHSAGS GKTNSIAWTA HFLADLHDAN QKKMFDSVLV VSDRTVLDAQ LQEAIFAFER
TTGVVATITG DNGSKSEALA QALSGGKKIV VCTIQTFPFA LQAVQELAAT QGKTFAVIAD
EAHSSQTGDA AAKLKQVLTA EEIRELEDGG EISNEDILTM QMAARANARG ITYVAFTATP
KAKTLELFGR CPDPSLPAGP GNLPAPFHVY GMRQAIEERF ILDVLRNYTP YKLAFRLASN
GKEWDEKEVE RSEAMKGIMR WVRLHPYNIS QKVQVVVEHF LANVAPLLDG QAKAMVVTAS
RQEVVRWQIA INKYIKDKGY RIGTLVAFSG EVHDAEIAKD SFTEHSTTLN PGLNGRDMRE
AFGTDEYQLL LVANKFQTGF DQPLLCGMYV DKRLAGIQAV QTLSRLNRAH PGKDTTYILD
FVNEPNEVLE AFKTYYETAE LEGVTDPNLV YDLRAKLDGM GYYDDNEVER VVTVVLSPKA
SQKELDAAIR PVADRLLKRF NVLKEAIKVA VTVQDARGEK DARDERNALE LFKRNIGAFL
RVYSFLSQIF DYGNTDIEKR SIFYRCLLPL LEFGRERDLV DLSGVVLTHH TLRNRGKRDL
PFDGKGEKLM PLTEPGSGEV RDKQKALLAE IISKVNDLFE GDLTEDDKLI YVNSVIKGKL
LECDVLVQQA ANNSKGQFAN SPDLAKEILN AIMDALTAHT AMSKQALESE RVRHGLRDIL
LDHAGLYEDL RQKAEALRA