Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Vapar_4592 |
Symbol | |
ID | 7973290 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Variovorax paradoxus S110 |
Kingdom | Bacteria |
Replicon accession | NC_012791 |
Strand | - |
Start bp | 4863016 |
End bp | 4866009 |
Gene Length | 2994 bp |
Protein Length | 997 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644795176 |
Product | Type III site-specific deoxyribonuclease |
Protein accession | YP_002946463 |
Protein GI | 239817553 |
COG category | [V] Defense mechanisms |
COG ID | [COG3587] Restriction endonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACTTC ACTTCGAGCC CAATCTGGAC TACCAGCTTC AAGCCATCGA GGCGGTCTGT GACCTGTTCC GTGGGCAGGA AGCATGCCGA ACCGAGTTCA CGGTCACCAT GAAGCTGCCC GACCAGCAAT TGACGCTGGG CGTGGCCGAG ACCGACCTTG GCGTTGGCAA TCGGTTGACG TTGCTGGATG ACCAACTGCT GCAGAACCTG CGCGACGTTC AATTGCGCGG CGGCCTGGCG CCGTCCAGCA TGCTGGCGTC GGGTGATTTC ACCGTCGAAA TGGAGACGGG CACCGGCAAG ACCTACGTCT ACCTGCGCAC GATCTTCGAG CTGAACAAAC GCTACGGCTT CACCAAGTTC GTCATCGTGG TGCCTTCGGT GGCCATCAAA GAAGGCGTCT ACAAGTCGCT GCAGATCACC GAAGAGCACT TTAAGGGCAT CTATGCCGGG GTGCCGGTGG ACTTCTTCCT GTACGACTCG GGCAAGCTGG GCCAAGTGCG CAACTTCGCC ACCAGTTCGA CCATCCAGGT CATGGTGGTC ACGGTCGGCG CCATCAACAA GAAGGATGTC AACAACCTCT ATAAAGACAG CGAGAAGACG GGCGGCGAGA AGCCCATTGA CCTGATCAAG GCGACACGGC CCATTGTCAT CGTGGACGAG CCGCAGAGCG TGGACGGTGG CCTGACCGGC GCCGGCAAGA CCGCGCTGGA CGCCATGAAT CCGCTGTGCA CGTTGCGCTA TTCGGCCACG CACGCCAACA AGCACCACAT GGTGTTCAGG CTCGATGCGC TGGACGCCTA CGAACGCAAA CTGGTCAAGC AGATCGAGGT GGCGGCGGCC ACCATCGAGG ACGCGTACAA CAAGGCCTTC GTGCGGCTGG TGTCGGTAAC CAACAAGCGC GGCGCCATCA GCGCCAAGGT GGAGCTGCAT GTCAAGACCG CGAGCGGTGT GAAGCCCCAA GAGGTCACTG TCGGTGACGG CGACGACTTG CAGCAAAGTA CCAAGCGCGC GGTCTACGCC GACTTTCGTG TGGGCGAAAT CAACACGGCC AAGGGCGAAG AATTCCTGGA GCTGCGTTAT CCCGGCGGCG AGGTCATGCT CGCCGTGGGC CAGGCCTACG GCGATGTCGA TGCCCTCGCC GTGCAGCGCG AGATGATCCG GCGAACCATC CGCGAGCACC TGGAGAAGGA AAAACTTCTC GGGCCGAAGG GCATCAAGGT GCTGTCGCTG TTCTTCATTG ACGCGGTGGA GCGCTACCGC CAGTACGACA CGGAGGGCAA CCCGGTAAAG GGCGACTACG CCCGCATCTT CGAAGAAGAG TATCGGCGCG CGGCCAAGCT GCCGGCCTAT CAGAGCCTGT TCGCTGAAAT TGACCTAGCC AGTGCCGCCG AGGAGGTGCA CAACGGCTAC TTCTCCATCG ACAAGAAGGG TGGCTGGTCG GACACGGCCG AGAACAACGC CGGCAACCGC GAGAACGCCG AGCGAGCATA CATCCTGATC ATGAAGGAGA AGGAAAAGCT GCTGGACATC AAGACGCCGC TGAAATTCAT CTTCTCGCAC TCGGCACTGA AGGAGGGCTG GGACAACCCC AACGTCTTCC AGATTTGCAC CTTGCGGGAA ATCGGCACCG AGCGCGAGCG CCGGCAGACC ATCGGCCGTG GTCTGCGCCT GTGCGTGGAT CAGCACGGCG AGCGCGTGCG CGGCTTCGAA GTCAACCGGC TGACGGTCAT CGCGACCGAG TCCTACGAAC AGTTCGCCGA GAACTTGCAG AAGGAAATCG AGGCTGACAC GGGCATCCGC TTCGGAATCC TGGAGCGGCA CCAGTTCGCG GCCATCGCCA TCAAGGCGGC GGACGGCACG CTGGCGCCGC TGGGCGTTGA CCAGTCCAGG GCCTTGTGGG ATCACCTAAG AGCTGCCGGG CATCTCGATG CCAAGGGCAA GGTGCAGGAC TCGCTGAAGG TGGCCCTGAA AAACGGCACC CTGGCGTTGC CGCCGGCCTT TGAGGCCCAG CGCACGCAGA TTGCCGAGGT GCTACGCAAG GTGTCCGGCC GGCTCGAAAT CAAGAATGCC GACGACCGCC GTGCGGTCCT GCTGCGCAAG GGGGCGGACG GGAAGGCCGT GTACCTGTCC GAGGACTTCA AGGCGCTGTG GGACCGCATC AAGCACCGGA CCACCTACCG GGTGCAGTTC GACAACGCCA AGCTGCTGCA AGACTGCACC ACCGAACTGA AGAAGGCGCC GGCCATCGCG AAGGCCCGGC TGCAGTGGCG CAAGGCCGAC ATTGCCATCG GCAAGGCCGG CGTGCAGGCC ACGGAAAAGG AAGGCGCTGC CACCATAGTA CTGGATGAGA CCGACATCGA GCTTCCGGAC CTCTTGACCG ACTTGCAGGA TCGCACCCAG CTCACCCGTC GCAGCATCGT CACCATCCTG ACCGAAAGCG GCCGGCTGAA TGAGTTCAGG CGCAACCCGC AGCAGTTCAT CGAGCAGACC ACCGAGGTCA TCAACCGCTG CAAGCGCCTG GCGCTGGTGG ACGGCATCAA ATACCAGAAA ATGGGCGATG ACCATTTCTA CGCCCAAGAG CTGTTCGCCC AGGATGAGCT GACCGGCTAT CTGCGCAACA TGCTGCTGGA CACTGGCCGG TCGATCTACG AGCACGTGAT CTACGACTCG GATACCGAAC GTGGCTTCGC TGACGCCCTG GAGAAAAACG ACGGCGTGGT GCTCTACGCC AAGCTGCCTA GCTGGTTCAA GGTACCCACG CCGCTGGGCC CATACAACCC GGACTGGGCC ATCCTGTTCG ACCAGGACGG CACCCAGCGT CTATATTTCG TCGTTGAGAC GAAGAGTAGC CTGTTCGCAG ACGATCTGCG CGACAAGGAA AGTGCCAAGA TCGAATGCGG CAGAGCGCAC TTTGCGGCAC TCGGTGTGGG TGAGAATCCA GCGAAGTACC TCGTCGCCAC ATCCTTGGGC GATGTACTAA AGACTATGGC GTAG
|
Protein sequence | MKLHFEPNLD YQLQAIEAVC DLFRGQEACR TEFTVTMKLP DQQLTLGVAE TDLGVGNRLT LLDDQLLQNL RDVQLRGGLA PSSMLASGDF TVEMETGTGK TYVYLRTIFE LNKRYGFTKF VIVVPSVAIK EGVYKSLQIT EEHFKGIYAG VPVDFFLYDS GKLGQVRNFA TSSTIQVMVV TVGAINKKDV NNLYKDSEKT GGEKPIDLIK ATRPIVIVDE PQSVDGGLTG AGKTALDAMN PLCTLRYSAT HANKHHMVFR LDALDAYERK LVKQIEVAAA TIEDAYNKAF VRLVSVTNKR GAISAKVELH VKTASGVKPQ EVTVGDGDDL QQSTKRAVYA DFRVGEINTA KGEEFLELRY PGGEVMLAVG QAYGDVDALA VQREMIRRTI REHLEKEKLL GPKGIKVLSL FFIDAVERYR QYDTEGNPVK GDYARIFEEE YRRAAKLPAY QSLFAEIDLA SAAEEVHNGY FSIDKKGGWS DTAENNAGNR ENAERAYILI MKEKEKLLDI KTPLKFIFSH SALKEGWDNP NVFQICTLRE IGTERERRQT IGRGLRLCVD QHGERVRGFE VNRLTVIATE SYEQFAENLQ KEIEADTGIR FGILERHQFA AIAIKAADGT LAPLGVDQSR ALWDHLRAAG HLDAKGKVQD SLKVALKNGT LALPPAFEAQ RTQIAEVLRK VSGRLEIKNA DDRRAVLLRK GADGKAVYLS EDFKALWDRI KHRTTYRVQF DNAKLLQDCT TELKKAPAIA KARLQWRKAD IAIGKAGVQA TEKEGAATIV LDETDIELPD LLTDLQDRTQ LTRRSIVTIL TESGRLNEFR RNPQQFIEQT TEVINRCKRL ALVDGIKYQK MGDDHFYAQE LFAQDELTGY LRNMLLDTGR SIYEHVIYDS DTERGFADAL EKNDGVVLYA KLPSWFKVPT PLGPYNPDWA ILFDQDGTQR LYFVVETKSS LFADDLRDKE SAKIECGRAH FAALGVGENP AKYLVATSLG DVLKTMA
|
| |