Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dvul_0485 |
Symbol | |
ID | 4662091 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris DP4 |
Kingdom | Bacteria |
Replicon accession | NC_008751 |
Strand | - |
Start bp | 615167 |
End bp | 618286 |
Gene Length | 3120 bp |
Protein Length | 1039 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 639818695 |
Product | hypothetical protein |
Protein accession | YP_965935 |
Protein GI | 120601535 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.646894 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.000377567 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAGCCACC ACAAGGAAAT CGCGTTCGAA AACGACATCT GCAACCACCT TGCGGCGCAT GGCTGGCAGT ACACGGCAGG AGACGCCGCA AGCTACGACC GCGCTCGTGC CTTGTTCCCG GAAGATGTCG TTGCGTGGGT GCAGACCACC CAGCCCGAGG CGTGGGAGGT GCTGGTCAGA AATCATGGCA CTGCCGCGCA AGGGGTGCTG CTTGACCGCA TCCGCAAGCA ACTCGACGAC CGGGGCACCC TTGATGTGAT CCGCTTCGGG GTTGAACTGC TCGGCCTGAA AAGGCGGCTC ACACTGGCCC AGTTCAAGCC GGCTTTCGAT CTCAACCCGG AGATTCTGGA GCGGTATCAG GCGACCCGCC TGCGTGTGGT GCGGCAGGTG CGCTATTCCG TGCATAACGA AAACAGCCTC GACCTCGTGC TGTTCCTGAA CGGCATTCCC GTTGCCACGG TAGAACTCAA GTCGGACTTC ACCCAGTCGG TTGAGGATGC TGTTGACCAG TACCGCGTTG ACCGCAACCC TCACCCCAAG GGGCAAGGGA CGCGAGAACC CTTGCTCGAC TTCCCGCGCG GGGCGCTGGT GCATTTTGCG GTGAGCAACT CGCTGGTACG CATGACCACC AGACTGGAGG GGGCAGGCAC ACGCTTTCTG CCGTTCGACC GTGGCAACTG CGGCGCTGCG GGCAACGCGC CCAACCCTGC GGGGCATGCC ACCGCCTACC TGTGGGAAGA GGTGTGGCAG CGCGATAGCT GGCTCGAGAT TGTGGGGCGC TACATTGTCG CCATGCGCGG GCCGAAGAAG CAGATAGAGA AGATCATCTT TCCTCGCTAT CACCAGCTTG ATGCCACGCG CCAACTCGTT GCCAAAGTAC GCGAAGAGGG GGTGGGGCAA AAATACCTCA TCCAGCATTC TGCCGGGTCG GGCAAAACCA ACTCCATTGC GTGGACAGCC CACTTTCTGG CTGACCTGCA CGACGCCAAC CAGAAGAAGA TGTTCGATTC CGTACTGGTG GTAAGTGACC GCACCGTGCT GGATGCCCAG TTGCAGGAAG CCATTTTTGC CTTTGAGCGC ACGACGGGTG TTGTGGCGAC CATCACCGGT GACAACGGCA GCAAGAGCGA GGCACTGGCG CAGGCGCTTT CTGGTGGCAA GAAGATTGTC GTTTGCACCA TCCAGACCTT TCCATTTGCC TTGCAAGCCG TGCAGGAACT TGCCGCCACG CAGGGCAAGA CCTTTGCCGT CATTGCCGAT GAAGCCCACA GTTCACAGAC GGGAGATGCC GCCGCCAAGC TGAAGCAGGT GCTCACCGCC GAAGAAATCA GGGAACTGGA AGACGGCGGC GAAATAAGCA ACGAAGACAT CCTCACCATG CAAATGGCGG CAAGGGCCAA TGCGCGGGGC ATAACCTACG TCGCCTTCAC GGCGACCCCC AAGGCCAAAA CGCTTGAACT TTTCGGACGG TGCCCAGACC CTTCACTCCC CGCCGGGCCG GGCAACCTGC CTGCACCCTT TCATGTGTAC GGCATGCGAC AGGCCATTGA AGAAAGGTTC ATTCTGGATG TGCTGCGCAA CTATACGCCG TACAAGCTGG CGTTCCGCCT CGCCAGCAAT GGCAAGGAGT GGGACGAAAA AGAGGTGGAG CGCAGCGAAG CCATGAAGGG CATCATGCGA TGGGTGCGCC TGCATCCCTA CAACATCAGC CAGAAGGTAC AGGTGGTTGT AGAGCACTTT CTCGCCAATG TGGCCCCGTT GCTGGACGGG CAGGCCAAGG CAATGGTGGT GACAGCCAGC AGACAGGAGG TGGTGCGCTG GCAGATTGCC ATCAACAAGT ACATCAAGGA CAAGGGCTAC CGGATAGGCA CGCTCGTGGC CTTCTCTGGT GAAGTGCATG ATGCAGAGAT TGCGAAAGAC AGTTTTACCG AGCACAGCAC GACGCTGAAC CCCGGACTCA ACGGGCGCGA CATGCGCGAG GCCTTCGGAA CAGACGAGTA TCAGTTGCTG CTGGTCGCCA ACAAATTCCA GACGGGCTTT GACCAGCCCC TCTTGTGCGG CATGTATGTG GACAAGCGCC TTGCCGGGAT TCAAGCCGTA CAGACGCTCT CGCGCCTCAA CAGGGCGCAT CCCGGCAAGG ATACGACGTA TATCCTCGAC TTCGTCAACG AGCCGAACGA AGTGCTTGAA GCCTTCAAGA CCTACTATGA AACAGCTGAA CTTGAGGGCG TGACTGACCC GAACCTAGTC TATGACCTGC GGGCCAAGCT GGATGGCATG GGCTACTACG ACGATAACGA GGTAGAACGT GTCGTTACCG TGGTACTCAG CCCCAAAGCA TCACAAAAAG AGCTTGATGC AGCCATTAGA CCTGTTGCAG ACCGCCTGCT CAAGCGGTTC AATGTGCTAA AGGAAGCCAT AAAGGTTGCC GTGACAGTGC AGGATGCGCG AGGGGAGAAG GACGCTCGCG ATGAAAGGAA TGCCCTTGAG TTGTTCAAAC GCAACATTGG TGCCTTCCTG AGAGTGTATT CATTCCTGTC GCAGATATTC GACTATGGCA ATACAGACAT AGAGAAACGA TCCATCTTCT ATCGCTGTTT GCTGCCCTTG CTGGAGTTTG GACGTGAGCG TGATCTTGTT GACCTTTCGG GCGTGGTCTT GACCCACCAC ACCCTGCGCA ACCGGGGCAA GCGCGATCTG CCGTTCGATG GCAAGGGTGA AAAGCTCATG CCCCTCACCG AACCCGGCAG TGGTGAAGTG CGCGACAAGC AGAAGGCACT GCTTGCCGAG ATCATCTCCA AGGTCAACGA CCTCTTCGAG GGAGACCTCA CCGAGGATGA CAAACTGATC TACGTTAACA GCGTCATCAA GGGCAAACTT CTGGAGTGTG ACGTGCTTGT GCAGCAAGCC GCCAACAACT CCAAGGGACA GTTTGCCAAT TCACCCGACC TCGCAAAAGA AATACTCAAC GCCATCATGG ATGCACTGAC GGCGCATACT GCCATGAGCA AGCAGGCGCT GGAGTCTGAA CGGGTACGGC ATGGCTTGCG CGACATTCTT CTGGATCATG CCGGACTGTA TGAAGACCTG CGGCAAAAGG CAGAAGCTCT CAGGGCGTAA
|
Protein sequence | MSHHKEIAFE NDICNHLAAH GWQYTAGDAA SYDRARALFP EDVVAWVQTT QPEAWEVLVR NHGTAAQGVL LDRIRKQLDD RGTLDVIRFG VELLGLKRRL TLAQFKPAFD LNPEILERYQ ATRLRVVRQV RYSVHNENSL DLVLFLNGIP VATVELKSDF TQSVEDAVDQ YRVDRNPHPK GQGTREPLLD FPRGALVHFA VSNSLVRMTT RLEGAGTRFL PFDRGNCGAA GNAPNPAGHA TAYLWEEVWQ RDSWLEIVGR YIVAMRGPKK QIEKIIFPRY HQLDATRQLV AKVREEGVGQ KYLIQHSAGS GKTNSIAWTA HFLADLHDAN QKKMFDSVLV VSDRTVLDAQ LQEAIFAFER TTGVVATITG DNGSKSEALA QALSGGKKIV VCTIQTFPFA LQAVQELAAT QGKTFAVIAD EAHSSQTGDA AAKLKQVLTA EEIRELEDGG EISNEDILTM QMAARANARG ITYVAFTATP KAKTLELFGR CPDPSLPAGP GNLPAPFHVY GMRQAIEERF ILDVLRNYTP YKLAFRLASN GKEWDEKEVE RSEAMKGIMR WVRLHPYNIS QKVQVVVEHF LANVAPLLDG QAKAMVVTAS RQEVVRWQIA INKYIKDKGY RIGTLVAFSG EVHDAEIAKD SFTEHSTTLN PGLNGRDMRE AFGTDEYQLL LVANKFQTGF DQPLLCGMYV DKRLAGIQAV QTLSRLNRAH PGKDTTYILD FVNEPNEVLE AFKTYYETAE LEGVTDPNLV YDLRAKLDGM GYYDDNEVER VVTVVLSPKA SQKELDAAIR PVADRLLKRF NVLKEAIKVA VTVQDARGEK DARDERNALE LFKRNIGAFL RVYSFLSQIF DYGNTDIEKR SIFYRCLLPL LEFGRERDLV DLSGVVLTHH TLRNRGKRDL PFDGKGEKLM PLTEPGSGEV RDKQKALLAE IISKVNDLFE GDLTEDDKLI YVNSVIKGKL LECDVLVQQA ANNSKGQFAN SPDLAKEILN AIMDALTAHT AMSKQALESE RVRHGLRDIL LDHAGLYEDL RQKAEALRA
|
| |