Gene YPK_3674 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_3674 
Symbol 
ID6089590 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp4056282 
End bp4059509 
Gene Length3228 bp 
Protein Length1075 aa 
Translation table11 
GC content45% 
IMG OID641598761 
ProductHsdR family type I site-specific deoxyribonuclease 
Protein accessionYP_001722394 
Protein GI170025889 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACCT TTAAAACCGA AGCGCAATTT GAGCAGGCCT TTATTGAAGT ACTGACCCAT 
AAGGGCTGGG AAACAGAGAT ACTCAAAAAT AAAACCGAAG CGGATTTACT GCAAAATTGG
GCGTCCATTT TGTTTGAAAA TAATCGCCAG CGGGATCGCT TAAACGATGT GCCGTTAACC
GATACGGAAA TGCAGCAAAT TATTGAGCAA ATCAAAGAGC TTAAAACCCC GCTCAAGCTG
AATGGTTTAA TTAACGGCAA AACCGTGGCG ATTAAGCGCG ATAACCCAGC CGATACCTTG
CATATGGGCA AAGAAGTCAG CTTAAAAATA TACGATCGCC AAGAAATCGC CGCAGGCCAA
AGCCGTTACC AAATTGTGCA GCAACCCAAA TTTGAACGCG GCAGCCCTTT GCGCAATGAC
CGCCGTGGCG ATGTGCTCTT ACTGATCAAC GGTATGCCGG TTATCCATGT CGAGCTAAAG
CGCAGCGGCA TTCCGGTTAG CCAGGCAGTC AACCAAATTG AAAAATACAG TAAAGAGGGG
GTATTTAGCG GCCTGTTTTC GCTCATCCAA GTGTTTGTGG CCATGGAGCC AAACGAGGCC
AAATACTTTG CCAACCCCGG GCTAGACGGC AGATTTAACC CCGACTATCA ATTTAACTGG
GCCGATTTTA ATAACGAACC CATGAACCAC TGGAAAGACA TCGCCTCTAC CTTGCTTTCT
ATCCCTATGG CACACCAGTT GATTGGCTTT TATACCGTCG CCGACGATAC CGACGGCGTG
CTTAAAGTGA TGCGCAGTTA TCAGTATTAC GCCGCCAATG CGATATCTGA CAAAGTGGCC
AAAACCAACT GGCAGCAACT GGGGAGTGCG GCCAATAACC CTGACCGCCT CGGTGGTTAT
GTGTGGCATA CCACCGGTTC GGGTAAAACC ATGACCAGCT TTAAATCGGC ACAATTGATT
GCACAATCGA AAGATGCCGA TAAAGTGATT TTTTTAATGG ACAGGATCGA GCTGGGCACC
CAGTCGCTCG CGGAATACCG CAATTTTGCT GGCGATGGCG AAGACGTGCA AGCCACCGAA
AATACTCATG TACTCATTAC CAAATTAAAA AGCACCGCTC CCGCCGATAC GTTAATTGTT
AGCTCTATTC AAAAAATGAG TAATATTTTT GAAGAAGTTG ATGATGAGGG AACAGCAACA
AATTCGGCCG ACATAGAAAA AATCCGCGCT AAGCGCTTGG TGTTTATTAT CGATGAGGCG
CACCGCTCTA CCTTTGGCGA CATGCTGATT ATTATTAAAC GTACCTTTCC GCGCGCCTTG
TTTTTTGGCT TTACCGGTAC GCCGATTCAA GAAGAAAACG AAAAAAACGG CAATACCACC
AGTACCGTAT TTGGCAACGA GCTACACCGC TACAGCATTG CTGACGGTAT TCGCGACGGC
AATGTTTTGG GCTTTGACCC TTACAAAGTC TGTACCTTTA AAGATAAAGA TCTACGCCAG
GCCGTCGCTC TTGAACAGGC TAAGGCCGAT TCGGTTGCAG ATGCAATGTC TACGGCAGCC
AAAAAGAAAA AATTTAATTA TTTTATGAAT GATGTGCCAA TGGCTGGCCA TAAAGATGCA
ACAGGTAAGT ATCTCAAAGG CATAGAAGAT TATGTGCCAA AAGAGCAATA TCTCACTGAA
ACCCATCAAG AAAAAGTAGT AGAAGATATT CTGGCAGAGT GGGATGTGCT CAGCCAGGCC
AATAAGTTTC ACGCTATTTT AGCCACTAGC AGCATTGCCG AAGCCATTGA CTATTACCGC
CGTTTAAAAG CCGCCAAACC TGAACTTAAA GTATCGGCCC TATTTGACCC AAATATTGAT
AACGACGGCA GTGGCGACCG TGGCCCCACC TTTAAAGGCG ATGGCCTGGA CGAAATTATG
GCCGACTATA ACGCGCGTTA TGGCCAGGAC TTTGATTTTG CCCGCCACGC GGCCTTTAAA
AAAGATTTAG CGGCACGACT TGCCCATAAA AAGCCTTACG AGCGCATCCA TACCGAGCCT
TCGAAGCAAT TAGATTTACT GATTGTGGTG GATCAAATGC TCACTGGCTT TGACTCTAAA
TGGCTCAATA CCTTGTATTT AGACAAGGTG ATTAAATACC AAAATATTAT TCAAGCGTTC
TCGCGTACCA ATCGCTTGTT TGGCCCCGAC AAACCCCATG GCATCATCCG TTATTACCGT
TATCCACACA CGATGGAGCA ACATATTAAT GATGCGGTAA AACTCTATTC CGGCGACAGA
CCTATCGGCT TATTTGTTGA TAGGTTAGAA AGCAACCTTA AAGCCATAAA TGAATTAGTG
GCGGACATTA CCGAGCTATT CGTCAGTGCG GGTGTTGAGA ACTTTGAAAA ACTGCCAGAC
GATATAGAAA CCTGTGCCCA ATTCGCCAAA TTATTTAACA CCTTTAGCCA ACACCTGGCA
GCGGCTAAAG TACAAGGTTT TCATTGGGAA CAGTCGATCT ATTCCTTTAC TGAAAATGAT
GTAGAACATG AGGTAACGCT GGCCATAGAC GAACAAACTT ACCTGAGCCT AGTCCTGCGC
TATAAAGAGT TGGTAGGCAA AGGTGATGGT GGTGGCGCAG GTGAGGGCGA TGTGCCCTTT
GATATCAGTG GTTATTTAAC TGAAATAGAT ACCGGCAAAA TCGATGCCGA CTACATGAAC
AGCCGCTTTG ATAAATTTTT AAAAGAGCTG AACCAACACC AAGACTCTGC GAATATTGAA
ACCACATTAA ATGAGCTACA CAAGTCGTTT GCATCGCTCA CCCAAAGCGA GCAAAAGTAC
GCCAAGCTCT TCTTGCACGA CTTGCAGCGC GGCGATGCGC AGTTAATTGA AGGCCATACT
TTTAGAGACT ACATCAACAC CTACAAAGAT AACGCTGAAA ATGCGCAATT AAACGCCGTT
GTTAATGTTC TTGGTTTAGA TAAAGAACGG CTCATAGCAT TAATGGCTGA TAGTGTTAAT
GACAAGAATC TCAACGACTT TGGTCGCTTC GACGCATTAA AAGATTCGGT AGATAAAGAA
AAAGCCAAGG TCTATTTTGA AAAACAAGAC AGTGTAACCC TACCTCTATT TAAGCTGAAT
ATTCGCATTG ATCAGTTTTT AAAGCAGTTT ATTTTGGCAC AAACGGATGA TTTATTAAGT
GACAGAGATG TTGTTGGTGA TGTGATGGAC GTCCCTCCCT CAGCGTAA
 
Protein sequence
MTTFKTEAQF EQAFIEVLTH KGWETEILKN KTEADLLQNW ASILFENNRQ RDRLNDVPLT 
DTEMQQIIEQ IKELKTPLKL NGLINGKTVA IKRDNPADTL HMGKEVSLKI YDRQEIAAGQ
SRYQIVQQPK FERGSPLRND RRGDVLLLIN GMPVIHVELK RSGIPVSQAV NQIEKYSKEG
VFSGLFSLIQ VFVAMEPNEA KYFANPGLDG RFNPDYQFNW ADFNNEPMNH WKDIASTLLS
IPMAHQLIGF YTVADDTDGV LKVMRSYQYY AANAISDKVA KTNWQQLGSA ANNPDRLGGY
VWHTTGSGKT MTSFKSAQLI AQSKDADKVI FLMDRIELGT QSLAEYRNFA GDGEDVQATE
NTHVLITKLK STAPADTLIV SSIQKMSNIF EEVDDEGTAT NSADIEKIRA KRLVFIIDEA
HRSTFGDMLI IIKRTFPRAL FFGFTGTPIQ EENEKNGNTT STVFGNELHR YSIADGIRDG
NVLGFDPYKV CTFKDKDLRQ AVALEQAKAD SVADAMSTAA KKKKFNYFMN DVPMAGHKDA
TGKYLKGIED YVPKEQYLTE THQEKVVEDI LAEWDVLSQA NKFHAILATS SIAEAIDYYR
RLKAAKPELK VSALFDPNID NDGSGDRGPT FKGDGLDEIM ADYNARYGQD FDFARHAAFK
KDLAARLAHK KPYERIHTEP SKQLDLLIVV DQMLTGFDSK WLNTLYLDKV IKYQNIIQAF
SRTNRLFGPD KPHGIIRYYR YPHTMEQHIN DAVKLYSGDR PIGLFVDRLE SNLKAINELV
ADITELFVSA GVENFEKLPD DIETCAQFAK LFNTFSQHLA AAKVQGFHWE QSIYSFTEND
VEHEVTLAID EQTYLSLVLR YKELVGKGDG GGAGEGDVPF DISGYLTEID TGKIDADYMN
SRFDKFLKEL NQHQDSANIE TTLNELHKSF ASLTQSEQKY AKLFLHDLQR GDAQLIEGHT
FRDYINTYKD NAENAQLNAV VNVLGLDKER LIALMADSVN DKNLNDFGRF DALKDSVDKE
KAKVYFEKQD SVTLPLFKLN IRIDQFLKQF ILAQTDDLLS DRDVVGDVMD VPPSA