Gene PP_4742 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPP_4742 
SymbolhsdS 
ID1042031 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas putida KT2440 
KingdomBacteria 
Replicon accessionNC_002947 
Strand
Start bp5396987 
End bp5398717 
Gene Length1731 bp 
Protein Length576 aa 
Translation table11 
GC content52% 
IMG OID637148146 
Producttype I restriction-modification system, S subunit 
Protein accessionNP_746850 
Protein GI26991425 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.266617 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00441093 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGGCGC TACTCATCGA CAACCTGCCG CTGCTGGCCG GTGCACCCAA TGGCATCAGA 
AAGTTGCGCG AGCTGATTCT GGAACTGGCG GTGCGCGGTA AGTTGGTACC GCAAGAGCCT
AATGATGAGC CGGCCAGTGA GTTGCTAGCG CGAATTGCCT TAGAGAGTGC ACAGCTAGTG
GCGGGGGGTA AGGCAAAGAA AGGAAATGTT AAAGCTGGCG TAGTAGCGCA AGGTCTAGAC
AGCGCTGAAC TACCCACAAC TTGGATATGG ACATCGTTCG ATGATCTAAT AAATCCAGAA
TACCCAATCG CTTATGGGGT CCTTGTACCG GGGCCTGATG TAGCAGATGG AGTTCCTTTT
GTTCGAATTG CTGATCTTGA TTTGGTGGCA CCGCCACATA AGCCCGAAAA GTCAATAAGC
CCAGAAGTTG ACCGCCAATA TGAACGGACT CGCATTCGCG GCGGCGAAAT TCTTATGGGG
GTAGTTGGCA GCATCGGGAA ACTGGGTATT GCTCCTGAAA GCTGGGCCGG CGCCAACATT
GCCAGAGCTA TTTGCCGTGT AGTCCCCAGT GTGCATGTCT CGAAGGACTA CATAATTTGG
TTGCTACAAA GTGACCTTAT GCGAAAGCAG TTCTTGGGAG ATACTCGAAC CCTCGCCCAG
CCAACATTAA ATGTTGGCCT GATTCGTAGT GCCGCGGCAC CTCTGCCTCC CCTCGCCGAA
CAACACCGCA TCGTTGCCAA AGTCGAAGAA CTAATGGCCC TATGCGACCG CTTGGAAGCC
CAGCAGGCCG ACGCCGAGTC CGCCCACGTC CAACTGGTGC AGGCGATGCT CGACAGCCTG
ACCCAAGCCA TCGACGCCGC CGACTTCGCG ACCAGTTGGC AGCGCCTAGC CGAGCACTTT
CACACCCTGT TCACAAACGA GTTTGCCATC GACGCCCTCA AGAAAACCCT CTTGCAACTT
GCCGTGATGG GAAAGCTTGT CCCGCAAGAC GTTACTGACG AATCCGCCAG CGAACTGTTA
AAACGCATCG AGGGAGAAAA ACAGCGGTTG GTGAACGAAG GATTGATGAA AAAACAAAAG
CCATTGGTGG AAAGTACAAG CGGGCAAATC AAACCCGCAC TGCCTTCTTC GTGGAAATGG
GTACCTCTTT TAGACATAAC AACGGGTATG GATTCTGGAT GGAGCCCTGC ATGTCTTGGT
AATAGTTCGC CTTCTGATGA TGTGTGGGGG GTCCTCAAAA CAACAGCAGT TCAGGTAATG
AGCTATCTAC AACACGAAAA TAAAGAGCTA CCTAGCCACC TTGAACCTCG CCCTGAGGCA
GAGACTAAGG TCGGTGATAT ACTATTCACT CGGGCTGGCC CTATGAACCG CGTGGGAATT
TCTTGCCTCG TTGAAAGTAC GCGCCCGAAA TTAATGATTT CCGACAAGAT CATCAGGTTT
CATCCTGTTG AGCTGGGTGT TTATGGAAGG TTTGTCGCTC TTTGCTTGAA CGCTGGAGAA
ACTGCCAAAT ATCTCGAACA GGCCAAGTCC GGCATGGCTG CAAGTCAGGT CAATATCTCG
CAAGAAAAGC TGAGATTGGC TCCGATCCCG TTAGCACCGC TCCGCGAGCA ACACCGCATT
GTCACAAAGG TCGATCAGCT TATGAAACTA TGCGATACGC TAAAGCAACA GATAAATGTA
GCTCGCAGCA AGCAAACAGA GCTATTAGAT ACCCTTATGG CTCAGGTGTA G
 
Protein sequence
MTALLIDNLP LLAGAPNGIR KLRELILELA VRGKLVPQEP NDEPASELLA RIALESAQLV 
AGGKAKKGNV KAGVVAQGLD SAELPTTWIW TSFDDLINPE YPIAYGVLVP GPDVADGVPF
VRIADLDLVA PPHKPEKSIS PEVDRQYERT RIRGGEILMG VVGSIGKLGI APESWAGANI
ARAICRVVPS VHVSKDYIIW LLQSDLMRKQ FLGDTRTLAQ PTLNVGLIRS AAAPLPPLAE
QHRIVAKVEE LMALCDRLEA QQADAESAHV QLVQAMLDSL TQAIDAADFA TSWQRLAEHF
HTLFTNEFAI DALKKTLLQL AVMGKLVPQD VTDESASELL KRIEGEKQRL VNEGLMKKQK
PLVESTSGQI KPALPSSWKW VPLLDITTGM DSGWSPACLG NSSPSDDVWG VLKTTAVQVM
SYLQHENKEL PSHLEPRPEA ETKVGDILFT RAGPMNRVGI SCLVESTRPK LMISDKIIRF
HPVELGVYGR FVALCLNAGE TAKYLEQAKS GMAASQVNIS QEKLRLAPIP LAPLREQHRI
VTKVDQLMKL CDTLKQQINV ARSKQTELLD TLMAQV