Gene Plav_2344 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_2344 
Symbol 
ID5456649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp2525176 
End bp2528313 
Gene Length3138 bp 
Protein Length1045 aa 
Translation table11 
GC content56% 
IMG OID640877922 
ProductHsdR family type I site-specific deoxyribonuclease 
Protein accessionYP_001413613 
Protein GI154252789 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.0184878 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCTCAGG CGGACGCTGC ACGCTTGCGG GGTGGCCGCT TGCGCAATGT GGTGCTGGAC 
GATGTTCTTG CGGATCAGCT GCTGAAGATA AATAGCTTCA CCCATCGCGG CCGGGAGTAT
CCCTTCGATC TCGAGGATGC ACATGAAGCC ATTCGGCGGC TTAAGCCTAC GCCTGATAGA
CAGAAGGGTC TTCGCGGTAC CAACCAGGAC ATTTACGACA CGCTTGTTCT TGGGACGACC
ATCACCAAGA CAATTCAAGG CGACTCGAAG TCCTACTCTT TCCGGTTCAT CGACTGGGAT
ACGCCGTCAA ACAATATCTT TCACGTCACC GCCGAACTTG CCGTCGAGCG AACGGCCAGC
ACGCAAACGA AGCGGTGCGA CATTGTTGGC TATGTGAATG GCATTCCCTT TCTCGTTGTC
GAAAACAAGC GATCGACCGA AAGCCTGAAG AAAGCCAAAA GCCAGTTGAT CGGCTATCAG
AACGAAGACA ACATCCCGCA ACTGTTTCAT TTCGCCCAGC TTTTGATGGT GATGAATCGA
GCGGAGGCGC GCTATGCGAC TGTCGGCGCT ACCCAGCAGT ATTGGCAGAC GTGGCGCGAC
GATGAAGACA GGGACGAAGA TATTGCCAAA CAAGCAAACC GCCCCCTGTC GCGGGAAGAG
GCGGATGCGA TCTTTTCCGG AGATTTTTCC GATGCCCGTG CATTCTTCGA AGCCATGGCG
GCCGAAGGAC CTCGTGCGAT AACAGCGCAG GACAGGGCGA TCCATGCCCT GTGCCGACCG
GAGCGGCTGC TCGATCTGGT GCGCCGGTTC ACTGTGTTTG ACGGAGGCGT GCGAAAAATT
GCCCGTCACC AGCAATTCTT CGGCATCCGG GAGACGGTGG AGCGCATTCG TCACTATGAC
ATTGGCGGCC GGCGAAAGGG GGGCGTGATA TGGCATACCC AGGGATCCGG CAAGTCGTTG
ACGATGGTCA TGCTCGGCCG GGCGCTGGCA CTCGACAAGG CGATACCGAA CCCGCGGATC
ATCATCGTCA CGGACCGCGA AGATCTGGAC AAGCAGATCA AGGACACGTT CAAGTCGTGC
GACATGAAGC CGATCCGCGC GACGAGCGGG AGTGATCTCG TCGAACACAT CCACAACAAG
ACGGCCCTCG TCACGACGAT CATCAACAAG TTCGACAGCG CGGCAAAGAA TGCCGCGGAA
GTCGATGAGG ATAGCAATAT CTTCGTGCTG GTTGACGAGA GCCACAGGTC TCAAACCGGC
CGCTACGGTG GCCATAGCCA GTTCGCCACG AAGATGCGCC GCCTGTTGCC CAAGGCTTGC
TATATCGGCT TCACCGGCAC GCCGTTGCTG AAGAAGGAAA AGAACACTCT CGCGACCTTT
GGCGGTCTGA TTCACAAGTA TCCGATCGAC GAAGCTGTGG CAGACCGTGC GGTTGTGCCT
CTGCTCTATG AAGGGCGGCT TGTCGAGCAG CAGGTGAATG CGGGTGTGAT TGATCGCTGG
TTCGAAAAGA TCAGCGAGGG CCTCAGCAAG AGTCAAAAGG CCGATCTAAA GCGCAAATTC
TCTCGCACCG ACGCGCTGTC GAAGACCGGC CAGGCGATCC GGGCCAAGGC TTTCGACATT
TCAGAGCACT ACCGTCAGCA CTGGCAGGGC ACCGGGTTCA AGGCGCAGCT CGTTGCTCCC
TCAAAGGCGG CCGCCGTTCG CTTCAAGGAA GTGCTGGACG AAATCGGCCA TGTGACGAGC
GAAATCATTA TCTCGGCGCC TGATGAAAAC GAAGGCAACG AGGAGGTGGA TCGCGAGTCG
AAGGATATCG TTCGTGCGTT CTGGACACGC ATGATGGCGC GCTACAGGAA CGAGGACGAG
TATAACAGGC AGATTACCGA AGCCTTCAAA GGATCCGGGG ATCCGGAAAT CCTGATAGTT
GTGTCGAAGC TCCTCACCGG CTTCGATGCA CCTCGCAACA CAGTACTCTA TGTCTGCAAG
ACGCTTCGCG AGCATAACCT CCTGCAGGCT ATAGCGCGTG TTAACCGGCT CTATGAAGAG
GATGGGACTG AGAAGGAATT CGGATTCATC ATCGACTATG AGGGCCTGCT AGGCGAACTC
GATGCTGCGC TGACGACCTA TAGCGCCTTC GAGGGATATG AAGACGCCGA TCTCATTGGT
GCCGTTCATG ACGTCCGCGA ACAGATCCGC AAGCTTCCGC ACGTGCATGA CCGGCTCTGG
GACTTGTTCA AGTCGGTGAG AAACAAGAAG GATATGGAGC AATTCGAGCA GTTCCTCGGT
GACGAAGCTA TCAGGAACGA GTTTTACGAA AGGTTGCGTG CCTTCAGCCG GTGTCTTCAC
ATGTCGCTGT CATCGGACAA GCTTTTCGAC GTCTTCGACG AAGCCAGGGT CGAGACCATG
AAGCGGGACT GGAGGCAGTT CACGGAATTG CGGCGGTCGG TCCAGCTGCG CTATCAGGAG
ACAGTCGACG TCACCGAATT CGAGCCGAAG ATACAGAAGC TCCTCGACGA TCATGTCGTC
GCCATGCCGG CAGAGACCAT CATCGAAATG GTCAATATCA ACGATCCCGA CGCACTAAAG
GCTGTTGTCG ACGATAAGGG TGTATCCGAG GCGTCGAGGG CTGATCGGAT TGCGAGCGCC
ACTCGCCGGG TTATCACCGA AAAGATGGAG GAAGACCCGA CCTTCTACAT GCGGTTCTCG
GAACTCCTGG AAGAAACCAT TCGGGACTAT CGGGCGAGGC GAATATCGGA GCGCGAGTAT
CTCGCTAGTG TGGTCGATCT TTCAAGCAAG ATTGCCCGCA AATATCGCGG ACGCACGGTT
CCGGAGGTGA TCAAGGGCAA CGACGATGGG CAGGCGTTCT TCGGTGTCCT TGAGGGAACC
ATGATATTCG CCGATGGCCG TGCGATGGAG GAAGATGAAG TGGCCGGCAT CGCCCTTGCA
ATCATCGATA TCATCAAGGA GCACCACATC GTCGATGTCT GGTCCAATGA CCTGGCCCAG
AACAAGATAC GCAATGCGAT CGACGATTAT TTCTTCGATG TTGTCCGCGA TGAGAAGGGG
ATCAACCTTT CCGTCGGAAC TCTCGATGAC CTCGAAAGTC GTATTATGAA CCTCGCAAAG
GCGCGGTTTC CTGGATGA
 
Protein sequence
MSQADAARLR GGRLRNVVLD DVLADQLLKI NSFTHRGREY PFDLEDAHEA IRRLKPTPDR 
QKGLRGTNQD IYDTLVLGTT ITKTIQGDSK SYSFRFIDWD TPSNNIFHVT AELAVERTAS
TQTKRCDIVG YVNGIPFLVV ENKRSTESLK KAKSQLIGYQ NEDNIPQLFH FAQLLMVMNR
AEARYATVGA TQQYWQTWRD DEDRDEDIAK QANRPLSREE ADAIFSGDFS DARAFFEAMA
AEGPRAITAQ DRAIHALCRP ERLLDLVRRF TVFDGGVRKI ARHQQFFGIR ETVERIRHYD
IGGRRKGGVI WHTQGSGKSL TMVMLGRALA LDKAIPNPRI IIVTDREDLD KQIKDTFKSC
DMKPIRATSG SDLVEHIHNK TALVTTIINK FDSAAKNAAE VDEDSNIFVL VDESHRSQTG
RYGGHSQFAT KMRRLLPKAC YIGFTGTPLL KKEKNTLATF GGLIHKYPID EAVADRAVVP
LLYEGRLVEQ QVNAGVIDRW FEKISEGLSK SQKADLKRKF SRTDALSKTG QAIRAKAFDI
SEHYRQHWQG TGFKAQLVAP SKAAAVRFKE VLDEIGHVTS EIIISAPDEN EGNEEVDRES
KDIVRAFWTR MMARYRNEDE YNRQITEAFK GSGDPEILIV VSKLLTGFDA PRNTVLYVCK
TLREHNLLQA IARVNRLYEE DGTEKEFGFI IDYEGLLGEL DAALTTYSAF EGYEDADLIG
AVHDVREQIR KLPHVHDRLW DLFKSVRNKK DMEQFEQFLG DEAIRNEFYE RLRAFSRCLH
MSLSSDKLFD VFDEARVETM KRDWRQFTEL RRSVQLRYQE TVDVTEFEPK IQKLLDDHVV
AMPAETIIEM VNINDPDALK AVVDDKGVSE ASRADRIASA TRRVITEKME EDPTFYMRFS
ELLEETIRDY RARRISEREY LASVVDLSSK IARKYRGRTV PEVIKGNDDG QAFFGVLEGT
MIFADGRAME EDEVAGIALA IIDIIKEHHI VDVWSNDLAQ NKIRNAIDDY FFDVVRDEKG
INLSVGTLDD LESRIMNLAK ARFPG