Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dvul_0854 |
Symbol | |
ID | 4664338 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris DP4 |
Kingdom | Bacteria |
Replicon accession | NC_008751 |
Strand | + |
Start bp | 1053152 |
End bp | 1054939 |
Gene Length | 1788 bp |
Protein Length | 595 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 639819076 |
Product | restriction modification system DNA specificity subunit |
Protein accession | YP_966302 |
Protein GI | 120601902 |
COG category | [V] Defense mechanisms |
COG ID | [COG0732] Restriction endonuclease S subunits |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.221182 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0643125 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACTAT CTTACAAACC GATAGAAATT GTCAAGGAAG GGAAGAACCC TTTACTTGGA AAAGCTGATC ACTGGAAAAG GGTTTATGTC AGCGAAATTG CTATGGTTCA AAATGGATTT GCATTCAAAT CAAAATTCTT TTCTAGGGAC GAAGGCATTC CACTGATCAG AATCAGGGAT ATCCTGAGTG CCGAGACCGA GCACAAATAT TTTGGGCAGT TCGACAAAGA GTATTTAGTG CACAATGGAG ACCTGCTTAT AGGGATGGAT GGCGATTTTG TAGCAGCATA TTGGCCTGGA AAAGAAGGTT TGCTGAATCA GCGAGTCTGT AGAATTGTTA TTGAATCAGA AAATTACGAT AAAAAGTTCT TTTTTCTTGC CCTTCAGCCG TACCTAGATG CAATTCACGA AAAGACATCC TCTGTTACTG TCAAACATCT ATCATCAAAG ACAGTCAATG AGATACCTCT TCCGCTCCCC CCCCTCAACG AACAAAACCG CATCGTCGCC AAGATCGAAG AACTCTTTTC CGAGCTGGAC GCCGGGGTTG AAAACCTCAC CAAAGCCAAG GAACAGCTTG GCGTGTACCG GCAGTCGCTT CTCAAACACG CCTTTGAGGG CAAGCTGACC GAAGCATGGC GCAAGAGAAA CGCTGACAAG CTCGAATCCG GCGAAGCCCT TCTCAAGCGA GTGAAGAAGG AACGCGAGGA GTATTTCAAA AAGCAGCTTG AGCAGTGGGA GAAAGACGTC GCTCAATGGG AAGCGGACGG CAAGCCCGGC AAGAAGCCAA CCCAACCCAA GAAGCCGAAG AAGCTCGCTC CGATCAGTGA AGAGGAGTTG AAGGAGCTGC CGGAGCTGCC TGAGGGGTGG GTGTGGGCTC GACTTGGAAA TCTGATCGAT CCCCCAGCTT ATGGAACCTC AAGAAAGTCT GACTACAATA TTGATGGCAC AGGGGTGTTG AGAATACCAA ACATTGTAGA TGGAAAGATC GATAGTAGCG ATTTAAAATA TACTGCGTTT TCTCCAGGCG AAGAGGAACA ATACAGGTTA AAAGCTGGTG ACTTATTAAC TATAAGATCA AATGGGAGCG TTTCACTGGT TGGACAATGT GCATTAATAG AAGATGACGA CACACGATAT GTCTACGCAG GATATTTGAT TCGACTGAGA ACCATTGGGC TATTAGTTTC TAAATTCCTT CTGTACTGCC TTTCAAGCCT CAGGCTTCGT AATCAAATTG AAAGCAAGGC GAAATCGACA AGCGGAGTGA ACAACATTAA CTCGCAGGAA CTGTCGTCAC TCATAGTGCC ACTATGCTCT CAACTTGAAC AGAACGAGGT GAGCAAATTA TTAGCGGATT CATTATCAAC CGCAGGTGAA CAGACTTCCA TGATCGAAAT ACAGCTAGAA CACATTAGAA TCCTAAAACA ATCGATTCTC GACAAAGCTT TCTCTGGAAC TTTAATCTCA CAAGACCCCA ATGACGAACC AGCCTCCAAG CTTCTCGAAA GAATCAAGCA GGAGCGGAAG AGTGCTCCCA ATCCCAAACG GACACGGAAA ACGAAAACCA AGAGGATTGC TATGGCAGAC CTGAAAGAAG TTCTCGCCAC TGCCAAAGAT TGGGTCAGTG CTCAGGATGC ATTCCGCCAG TGTGGCGTTG GCGATGGCGC ACCTACAGAT GAGGTCGAAA AACTGTACGG AGAATTAAAA CAGGAACTCG ACCAGAAGAC CATCGAGGTA GAGCGCAGGG GTGATGAAGA CTGGCTCCGC TTGGCTGCTG AGGGTTAG
|
Protein sequence | MALSYKPIEI VKEGKNPLLG KADHWKRVYV SEIAMVQNGF AFKSKFFSRD EGIPLIRIRD ILSAETEHKY FGQFDKEYLV HNGDLLIGMD GDFVAAYWPG KEGLLNQRVC RIVIESENYD KKFFFLALQP YLDAIHEKTS SVTVKHLSSK TVNEIPLPLP PLNEQNRIVA KIEELFSELD AGVENLTKAK EQLGVYRQSL LKHAFEGKLT EAWRKRNADK LESGEALLKR VKKEREEYFK KQLEQWEKDV AQWEADGKPG KKPTQPKKPK KLAPISEEEL KELPELPEGW VWARLGNLID PPAYGTSRKS DYNIDGTGVL RIPNIVDGKI DSSDLKYTAF SPGEEEQYRL KAGDLLTIRS NGSVSLVGQC ALIEDDDTRY VYAGYLIRLR TIGLLVSKFL LYCLSSLRLR NQIESKAKST SGVNNINSQE LSSLIVPLCS QLEQNEVSKL LADSLSTAGE QTSMIEIQLE HIRILKQSIL DKAFSGTLIS QDPNDEPASK LLERIKQERK SAPNPKRTRK TKTKRIAMAD LKEVLATAKD WVSAQDAFRQ CGVGDGAPTD EVEKLYGELK QELDQKTIEV ERRGDEDWLR LAAEG
|
| |