Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dvul_0853 |
Symbol | |
ID | 4664347 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris DP4 |
Kingdom | Bacteria |
Replicon accession | NC_008751 |
Strand | + |
Start bp | 1050386 |
End bp | 1053151 |
Gene Length | 2766 bp |
Protein Length | 921 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 639819075 |
Product | type III restriction enzyme, res subunit |
Protein accession | YP_966301 |
Protein GI | 120601901 |
COG category | [V] Defense mechanisms |
COG ID | [COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.237382 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0361479 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAAGT CCAACCAAAA TCCAGAGCAA AAAGCGCGAG ATGCTATCGA CAAGACGCTT GACGAGTGCG GTTGGGCTGT TCAGGACAAA AAGAAAATCA ACTTCAATGC CGGGCTGGGT GTCGCTGTCA GGGAATACCC GACAGATGCG GGGCCTGCTG ATTATGTTCT CTTTGTTGAC AGAAAGCCTG TCGGCGTCAT CGAAGCCAAA CGTGAAGAGG AAGGACAACA CCTTGTCACT GTCGCTGAAC AATCAGCTGA ATACGCCAAC GCTCAGCTGA AATGGGTTCA GGATAATTCT CCGCTCCCTT TCATCTACGA AAGCACTGGG GTAATCACAA TTTTCCGGGA TCAGCGGGAC CCCAAGCCCC GTTCCCGGGA GGTGTTTAGC TTCCATCGAC CAGAGACATT CCAAGAATGG CTGGCGCAGG ATGATACACT GAGGGGTAGA CTCGAAAAAC TTCCTCCTCT TCCAACACAA GGGCTTCGAG ACTGCCAAAA GAGAGCGATT CAAAATCTTG AAGGCTCGTT CAAGCTGGCG AAGCCCAAGG CTCTCGTGCA GATGGCGACT GGAGCCGGAA AGACTTACAC CGCCATTACC TCGGTGTATC GACTTTTCAA GCACGCCAAT GCGAAGCGAA TCCTTTTCCT TGTCGACACC AAGAACCTGG GAGAACAGGC GGAACAGGAG TTCATGGCGT TCACTCCCAA TGACTCGAAC GATCTTTTCA CGCGCCTGTA TAATGTCCAG CGGCTCAAAT CCAGCTACAT CCCGGATTCA TCCCACGTCT GCATTTCCAC GATTCAGCGG CTCTACTCCA TTCTGAAGGG TGTGGAACTG GACGAGGCTG CGGAAGAGTT CAACCCCGCT GAACATGTGG GGCCGAAAGC TCCTCTGCCT GTTGTTTACA ATGGTAAGAT TCCCATCGAG TTTTTCGACT TCATCGTCAT AGACGAGTGC CACCGCTCCA TTTACAACCT CTGGCGGCAG GTGCTGGACT ATTTCGACTC CTTCCTGATC GGCTTGACCG CCACTCCTGA CAAACGCACT TTCGCCTTTT TCAACGAAAA CGTTGTCAGC GAATACTCCC ATGAGGACGC TGTCGCAGAC GGGGTGAACG TCGGCTATGA CGTCTACACC ATTGAGACGC AGATCACGAA GCAAGGTGCT GTTCTCAAGG CGAACCAGGC CATCGAGAAG AGAAATCGCC TCTCCCGTAG GAAGCGATGG GAACAGCAAG ACGAGGATGA AGCCTATTCC GGCGCGCAAC TGGACAAGGA CGTGGTCAAC CCGAGCCAGA TTCGCAATGT CATCAAAACC TACAAAGACA AGCTGCCAGC CATCTTTCCA GGACGCTGCG AAGTGCCAAA GACGCTCATC TTCGCCAAGA CCGACAGCCA TGCGGACGAC ATCATTCAAA TAGTCCGGGA GGAGTTTGGC GAAGGAAACG AATTCTGCAA AAAGGTCACG TACAAGGCCA ATGAGGACCC GAAGTCTGTC CTGGCGCAGT TCAGGAACAG CTACAATCCC CGCATCGCGG TGACTGTGGA CATGATCGCC ACGGGAACGG ACGTGAAGCC CCTGGAGTGC CTTCTCTTCA TGCGCGATGT TCGGAGCAAG AACTACTTCG AGCAGATGAA GGGGCGCGGC ACCAGAACCC TGGACTTTGA TGGGTTGAAA AAGGTTACGC CTTCAGCATC CAGCAACAAG ACCCATTTCG TCATCGTGGA CGCCATCGGT GTCACCAAGT CGCACAAGAC AGACAGCCGT GCTCTGGAGC GCAAGCCCAC AGTCTCCCTG AAAGACCTGC TTATGAGCGT GATGATGGGA GCGACGGACG ACGACACCCT GACCAGTCTG GCAAGCAGGC TGACACGCCT GAACCACCAG CTCACTCCAG AGGATCAGAA ACGCATCGAA GTGAAAACGG ATGGGGTTCC CCTCTCACAA ATCACAAAAG ACTTGTTGGG GTCCATGAAC CCCGACAAGA TCGACGCCAA GGCGCGGGAG CAATTCAAGC TCACGGATGA ACAGGAGCCC ACGGATGACC AGAGCACCAA GGCCAAGCAG GAACTGGTCA AGAACGCGAC CACGGTGTTC ACTGGCGAGG TCTGCGAGCT GCTGGACACC ATACGCCGGG AGCACGAACA GACCATCGAC ACCCACAATA TCGATACGGT CCTCCGGGCT GAATGGGAAG GCGACAGTAT TGAGAACGCC ACCAAGCTCA CTGCCGAATT TGCGGAGTAC ATCAAAGAGC ACAAGGACGA GATTGTCGCG CTGAGCATCT ATTTTGACCA GCCGTACCGC CGCAGGGAAG TCACCTACGG CATGGTGAAA ACCCTGCTGG CCAAGCTGAA GACGGACCAG CCCAAGCTGG CTCCGGTTCG CATCTGGCAC GCCTATGCCC TGCTGGACAA GGTGACGGGA AAGTCGCCAG AAAATGAACT GACAGCCCTG GTCTCTTTGA TCAGACGGGT TTGCGGCATG GACGCCGCCA TCGCCCCTTA CGGCGATACG GTGCGAGACA ACTTCAAGCG CTGGATTTTC AAGCGTCACC AGGGAAGCGG CTCGAAATTT GACGAAGAGC AGATGACCTG GCTGCGGATG ATCCGGGACC ACATTGCCTG TTCGTTCCAC ATGGACAGGG ATGACCTGGA ACTGGCCCCG TTTGACGGAC AAGGCGGATT GGGACGCATG TGGGAATTGT TTAGGGAAGA TATGGATTCT TTAATTGATG AGATGAATGA AGAACTGGCG GCATAA
|
Protein sequence | MNKSNQNPEQ KARDAIDKTL DECGWAVQDK KKINFNAGLG VAVREYPTDA GPADYVLFVD RKPVGVIEAK REEEGQHLVT VAEQSAEYAN AQLKWVQDNS PLPFIYESTG VITIFRDQRD PKPRSREVFS FHRPETFQEW LAQDDTLRGR LEKLPPLPTQ GLRDCQKRAI QNLEGSFKLA KPKALVQMAT GAGKTYTAIT SVYRLFKHAN AKRILFLVDT KNLGEQAEQE FMAFTPNDSN DLFTRLYNVQ RLKSSYIPDS SHVCISTIQR LYSILKGVEL DEAAEEFNPA EHVGPKAPLP VVYNGKIPIE FFDFIVIDEC HRSIYNLWRQ VLDYFDSFLI GLTATPDKRT FAFFNENVVS EYSHEDAVAD GVNVGYDVYT IETQITKQGA VLKANQAIEK RNRLSRRKRW EQQDEDEAYS GAQLDKDVVN PSQIRNVIKT YKDKLPAIFP GRCEVPKTLI FAKTDSHADD IIQIVREEFG EGNEFCKKVT YKANEDPKSV LAQFRNSYNP RIAVTVDMIA TGTDVKPLEC LLFMRDVRSK NYFEQMKGRG TRTLDFDGLK KVTPSASSNK THFVIVDAIG VTKSHKTDSR ALERKPTVSL KDLLMSVMMG ATDDDTLTSL ASRLTRLNHQ LTPEDQKRIE VKTDGVPLSQ ITKDLLGSMN PDKIDAKARE QFKLTDEQEP TDDQSTKAKQ ELVKNATTVF TGEVCELLDT IRREHEQTID THNIDTVLRA EWEGDSIENA TKLTAEFAEY IKEHKDEIVA LSIYFDQPYR RREVTYGMVK TLLAKLKTDQ PKLAPVRIWH AYALLDKVTG KSPENELTAL VSLIRRVCGM DAAIAPYGDT VRDNFKRWIF KRHQGSGSKF DEEQMTWLRM IRDHIACSFH MDRDDLELAP FDGQGGLGRM WELFREDMDS LIDEMNEELA A
|
| |