Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plav_2344 |
Symbol | |
ID | 5456649 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Parvibaculum lavamentivorans DS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009719 |
Strand | - |
Start bp | 2525176 |
End bp | 2528313 |
Gene Length | 3138 bp |
Protein Length | 1045 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640877922 |
Product | HsdR family type I site-specific deoxyribonuclease |
Protein accession | YP_001413613 |
Protein GI | 154252789 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 0.0184878 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTCTCAGG CGGACGCTGC ACGCTTGCGG GGTGGCCGCT TGCGCAATGT GGTGCTGGAC GATGTTCTTG CGGATCAGCT GCTGAAGATA AATAGCTTCA CCCATCGCGG CCGGGAGTAT CCCTTCGATC TCGAGGATGC ACATGAAGCC ATTCGGCGGC TTAAGCCTAC GCCTGATAGA CAGAAGGGTC TTCGCGGTAC CAACCAGGAC ATTTACGACA CGCTTGTTCT TGGGACGACC ATCACCAAGA CAATTCAAGG CGACTCGAAG TCCTACTCTT TCCGGTTCAT CGACTGGGAT ACGCCGTCAA ACAATATCTT TCACGTCACC GCCGAACTTG CCGTCGAGCG AACGGCCAGC ACGCAAACGA AGCGGTGCGA CATTGTTGGC TATGTGAATG GCATTCCCTT TCTCGTTGTC GAAAACAAGC GATCGACCGA AAGCCTGAAG AAAGCCAAAA GCCAGTTGAT CGGCTATCAG AACGAAGACA ACATCCCGCA ACTGTTTCAT TTCGCCCAGC TTTTGATGGT GATGAATCGA GCGGAGGCGC GCTATGCGAC TGTCGGCGCT ACCCAGCAGT ATTGGCAGAC GTGGCGCGAC GATGAAGACA GGGACGAAGA TATTGCCAAA CAAGCAAACC GCCCCCTGTC GCGGGAAGAG GCGGATGCGA TCTTTTCCGG AGATTTTTCC GATGCCCGTG CATTCTTCGA AGCCATGGCG GCCGAAGGAC CTCGTGCGAT AACAGCGCAG GACAGGGCGA TCCATGCCCT GTGCCGACCG GAGCGGCTGC TCGATCTGGT GCGCCGGTTC ACTGTGTTTG ACGGAGGCGT GCGAAAAATT GCCCGTCACC AGCAATTCTT CGGCATCCGG GAGACGGTGG AGCGCATTCG TCACTATGAC ATTGGCGGCC GGCGAAAGGG GGGCGTGATA TGGCATACCC AGGGATCCGG CAAGTCGTTG ACGATGGTCA TGCTCGGCCG GGCGCTGGCA CTCGACAAGG CGATACCGAA CCCGCGGATC ATCATCGTCA CGGACCGCGA AGATCTGGAC AAGCAGATCA AGGACACGTT CAAGTCGTGC GACATGAAGC CGATCCGCGC GACGAGCGGG AGTGATCTCG TCGAACACAT CCACAACAAG ACGGCCCTCG TCACGACGAT CATCAACAAG TTCGACAGCG CGGCAAAGAA TGCCGCGGAA GTCGATGAGG ATAGCAATAT CTTCGTGCTG GTTGACGAGA GCCACAGGTC TCAAACCGGC CGCTACGGTG GCCATAGCCA GTTCGCCACG AAGATGCGCC GCCTGTTGCC CAAGGCTTGC TATATCGGCT TCACCGGCAC GCCGTTGCTG AAGAAGGAAA AGAACACTCT CGCGACCTTT GGCGGTCTGA TTCACAAGTA TCCGATCGAC GAAGCTGTGG CAGACCGTGC GGTTGTGCCT CTGCTCTATG AAGGGCGGCT TGTCGAGCAG CAGGTGAATG CGGGTGTGAT TGATCGCTGG TTCGAAAAGA TCAGCGAGGG CCTCAGCAAG AGTCAAAAGG CCGATCTAAA GCGCAAATTC TCTCGCACCG ACGCGCTGTC GAAGACCGGC CAGGCGATCC GGGCCAAGGC TTTCGACATT TCAGAGCACT ACCGTCAGCA CTGGCAGGGC ACCGGGTTCA AGGCGCAGCT CGTTGCTCCC TCAAAGGCGG CCGCCGTTCG CTTCAAGGAA GTGCTGGACG AAATCGGCCA TGTGACGAGC GAAATCATTA TCTCGGCGCC TGATGAAAAC GAAGGCAACG AGGAGGTGGA TCGCGAGTCG AAGGATATCG TTCGTGCGTT CTGGACACGC ATGATGGCGC GCTACAGGAA CGAGGACGAG TATAACAGGC AGATTACCGA AGCCTTCAAA GGATCCGGGG ATCCGGAAAT CCTGATAGTT GTGTCGAAGC TCCTCACCGG CTTCGATGCA CCTCGCAACA CAGTACTCTA TGTCTGCAAG ACGCTTCGCG AGCATAACCT CCTGCAGGCT ATAGCGCGTG TTAACCGGCT CTATGAAGAG GATGGGACTG AGAAGGAATT CGGATTCATC ATCGACTATG AGGGCCTGCT AGGCGAACTC GATGCTGCGC TGACGACCTA TAGCGCCTTC GAGGGATATG AAGACGCCGA TCTCATTGGT GCCGTTCATG ACGTCCGCGA ACAGATCCGC AAGCTTCCGC ACGTGCATGA CCGGCTCTGG GACTTGTTCA AGTCGGTGAG AAACAAGAAG GATATGGAGC AATTCGAGCA GTTCCTCGGT GACGAAGCTA TCAGGAACGA GTTTTACGAA AGGTTGCGTG CCTTCAGCCG GTGTCTTCAC ATGTCGCTGT CATCGGACAA GCTTTTCGAC GTCTTCGACG AAGCCAGGGT CGAGACCATG AAGCGGGACT GGAGGCAGTT CACGGAATTG CGGCGGTCGG TCCAGCTGCG CTATCAGGAG ACAGTCGACG TCACCGAATT CGAGCCGAAG ATACAGAAGC TCCTCGACGA TCATGTCGTC GCCATGCCGG CAGAGACCAT CATCGAAATG GTCAATATCA ACGATCCCGA CGCACTAAAG GCTGTTGTCG ACGATAAGGG TGTATCCGAG GCGTCGAGGG CTGATCGGAT TGCGAGCGCC ACTCGCCGGG TTATCACCGA AAAGATGGAG GAAGACCCGA CCTTCTACAT GCGGTTCTCG GAACTCCTGG AAGAAACCAT TCGGGACTAT CGGGCGAGGC GAATATCGGA GCGCGAGTAT CTCGCTAGTG TGGTCGATCT TTCAAGCAAG ATTGCCCGCA AATATCGCGG ACGCACGGTT CCGGAGGTGA TCAAGGGCAA CGACGATGGG CAGGCGTTCT TCGGTGTCCT TGAGGGAACC ATGATATTCG CCGATGGCCG TGCGATGGAG GAAGATGAAG TGGCCGGCAT CGCCCTTGCA ATCATCGATA TCATCAAGGA GCACCACATC GTCGATGTCT GGTCCAATGA CCTGGCCCAG AACAAGATAC GCAATGCGAT CGACGATTAT TTCTTCGATG TTGTCCGCGA TGAGAAGGGG ATCAACCTTT CCGTCGGAAC TCTCGATGAC CTCGAAAGTC GTATTATGAA CCTCGCAAAG GCGCGGTTTC CTGGATGA
|
Protein sequence | MSQADAARLR GGRLRNVVLD DVLADQLLKI NSFTHRGREY PFDLEDAHEA IRRLKPTPDR QKGLRGTNQD IYDTLVLGTT ITKTIQGDSK SYSFRFIDWD TPSNNIFHVT AELAVERTAS TQTKRCDIVG YVNGIPFLVV ENKRSTESLK KAKSQLIGYQ NEDNIPQLFH FAQLLMVMNR AEARYATVGA TQQYWQTWRD DEDRDEDIAK QANRPLSREE ADAIFSGDFS DARAFFEAMA AEGPRAITAQ DRAIHALCRP ERLLDLVRRF TVFDGGVRKI ARHQQFFGIR ETVERIRHYD IGGRRKGGVI WHTQGSGKSL TMVMLGRALA LDKAIPNPRI IIVTDREDLD KQIKDTFKSC DMKPIRATSG SDLVEHIHNK TALVTTIINK FDSAAKNAAE VDEDSNIFVL VDESHRSQTG RYGGHSQFAT KMRRLLPKAC YIGFTGTPLL KKEKNTLATF GGLIHKYPID EAVADRAVVP LLYEGRLVEQ QVNAGVIDRW FEKISEGLSK SQKADLKRKF SRTDALSKTG QAIRAKAFDI SEHYRQHWQG TGFKAQLVAP SKAAAVRFKE VLDEIGHVTS EIIISAPDEN EGNEEVDRES KDIVRAFWTR MMARYRNEDE YNRQITEAFK GSGDPEILIV VSKLLTGFDA PRNTVLYVCK TLREHNLLQA IARVNRLYEE DGTEKEFGFI IDYEGLLGEL DAALTTYSAF EGYEDADLIG AVHDVREQIR KLPHVHDRLW DLFKSVRNKK DMEQFEQFLG DEAIRNEFYE RLRAFSRCLH MSLSSDKLFD VFDEARVETM KRDWRQFTEL RRSVQLRYQE TVDVTEFEPK IQKLLDDHVV AMPAETIIEM VNINDPDALK AVVDDKGVSE ASRADRIASA TRRVITEKME EDPTFYMRFS ELLEETIRDY RARRISEREY LASVVDLSSK IARKYRGRTV PEVIKGNDDG QAFFGVLEGT MIFADGRAME EDEVAGIALA IIDIIKEHHI VDVWSNDLAQ NKIRNAIDDY FFDVVRDEKG INLSVGTLDD LESRIMNLAK ARFPG
|
| |