Gene YpsIP31758_3538 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_3538 
Symbol 
ID5387516 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp3995615 
End bp3998863 
Gene Length3249 bp 
Protein Length1082 aa 
Translation table11 
GC content44% 
IMG OID640866553 
ProductHsdR family type I site-specific deoxyribonuclease 
Protein accessionYP_001402492 
Protein GI153947414 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0228194 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACCT TTAAAACCGA AGCGCAATTT GAGCAGGCCT TTATTGAGGT TCTCACCAAC 
AAAGGTTGGG AACAGAAGAT ACTCAAAAAC AAAACCGAAG CGGATTTACT GCAAAACTGG
GCAAACATTT TGTTTGAAAA TAATCGCCAG CGGGATCGCT TAAACGATGT GCCGTTAACC
GATACGGAAA TGCAGCAAAT TATTGAGCAA ATCAAAGAAC TTAAAACACC GCTCAAGCTC
AACGGTTTAA TTAACGGCAA AACCGTGGCC ATTAAGCGCG ATAACCCAGC CGATACTTTG
CATATGGGCA AAGAAGTCAG CTTAAAAATA TACGATCGCC AAGAAATTGC AGCCGGCCAA
AGCCGTTACC AAATTGTGCA ACAACCCAAA TTTGAACGTG GCAGCCCCTT GCGTAACGAC
CGACGCGGCG ATGTGCTATT ACTGATCAAC GGTATGCCGG TGATCCATTT AGAGCTGAAG
CGCAGCGGCA CTCCGGTTAG CCAGGCAGTC AACCAAATTG AAAAGTACAG CAAAGAGGGC
GTCTTTAGCG GCCTGTTTTC GCTCATCCAA GTGTTTGTAG CCATGGAGCC AAACGAGGCC
AAATACTTTG CCAACCCCGG GCTAGACGGT AAGTTTAACC CCGACTATCA ATTTAACTGG
GCCGATTTTA ATAACGAACC CATGAACCAC TGGAAAGACA TCGCCTCTAC CTTGCTTTCT
ATCCCTATGG CGCACCAGTT GATTGGCTTT TATACCGTCG CCGACGATAC CGACGGCGTG
CTTAAAGTGA TGCGCAGTTA TCAGTATTAC GCTGCCAATG CAATATCTGA CAAAGTGGCC
AAAACCAACT GGCAGCAACT GGGTAGCGCG GCCAATAACC CCGATCGCCT CGGGGGTTAT
GTGTGGCATA CCACCGGTTC GGGTAAAACC ATGACCAGCT TTAAATCGGC GCAGTTGATC
GCACAATCCA AAGATGCCGA TAAAGTGATT TTTTTAATGG ACAGGATTGA ACTGGGCACC
CAGTCGCTCG CGGAATATCG CAATTTTGCT GGCGATGGTG AAGACGTGCA AGCCACCGAA
AATACTCATG TACTCATTAC CAAATTAAAA AGCAGTGCAA CTGCCGATAC GTTAATTGTT
AGCTCTATTC AAAAAATGAG TAATATTTTT GAAGAAGTTG ATGATGAGGG AACAGCAACA
AATTCGGCCG ACATAGAAAA AATCCGCGCT AAGCGCTTGG TGTTTATTAT CGATGAGGCC
CATCGCTCGA CGATGAGTGG AGGCAAAGAA AATAAAGAAG GCATGTTGGT AAGAATTAAA
AAAACATTCC CCAAGGCACT CTTTTTTGGT TTTACCGGCA CGCCGATTCA TGATGAAAAT
CAAGTTAATA GCAATACTAC CGCTGATGTG TTTGGCAACG AGCTACACCG CTACAGCATT
GCCGATGGTA TTCGCGACGG CAATGTTTTG GGCTTTGACC CTTACAAAAT CTGTACCTTT
AAAGATAAAG GTTTACGCCA GGCCGTCGCC CTTGAACAGG CTAAGGCCGA TTCGGTTGCA
GATGCAATGT CTACGGCTGC CAAAAAGAAA AAATTTAATT ATTTTATGAA TGATGTGCTA
ATGGCTGGCC ATAAAGATGC AACAGGCAAG TATATCAAAG GCATAGAAGA TTATGTGCCA
AAAGAACAAT ATCTCACTGC AGACCATCAA GAAAAGGTGG TAGAAGATAT TTTGGCCGAG
TGGGATGTGC TCAGCCAAGG CAATAAATTT CACGCTATTT TAGCCACTAA CAGCATTGCC
GAAGCCATTG ACTATTACCG CCGTTTAAAA GCCGCCAAAC CTGAACTTAA AGTATCGGCC
CTATTTGACC CAAATATTGA TAACGACGGC AGTGGCGACC GAGACCCCAC CTTTAAAGGC
GATGCTCTGG ACGAAATTAT GGCCGACTAT AATGCGCGTT ATGGCCAGGA TTTTGATTTT
GCCCGCCACG CGGCCTTTAA AAAAGATTTA GCGGCACGAC TTGCCCATAA AAAGCCCTAC
GAGCGCATCC ATACCGAGCC TTCGAAGCAA TTAGATTTAC TGATTGTTGT AGATCAAATG
CTCACTGGCT TTGACTCTAA ATGGCTCAAT ACCTTGTATT TAGACAAGGT GATTAAATAC
CAAAATATTA TTCAAGCGTT CTCGCGCACC AATCGCTTGT TTGGCCCCGA CAAACCCCAC
GGTATCATCC GTTATTATCG TTATCCACAC ACCATGGAGC AGTACATTAA CGATGCGGTA
AAACTCTATT CCGGCGACAG ACCTATCGGC TTATTTGTTG ACAAGTTAGA AAGCAACCTT
AAAGCCATGA ATGAATTAGT CGCGGACATT ACCGAGCTAT TCGTCAGTGC GGGTGTTGAG
AACTTTGAAA AACTGCCAGA CGATATAGAA ACCTGTGCCC AATTCGCCAA ATTATTTAAC
ACCTTTAGCC AACACCTGGA AGCGGCTAAA GTACAAGGTT TGCATTGGGA ACAGTCGGTC
TATTCCCCTA CTGAAAATGA TGTAGAACAT GAGGTAACGC TGGCTATAGA CGAACAAACT
TACCTGAGCC TGGTTCTGCG TTACAAAGAG TTAGTCGCCA AAGGTGATGG TGGTGGCGCA
GGTGGCGGCG ATGTGCCTTT TGATATCAGT GGTTATTTAA CTGAAATAGA TACCGGCAAA
ATCGATGCCG ACTATATGAA CAGCCGCTTT GATAAATTTT TAAAAGAGCT GAACCAACAC
CAAGACCCTG CGAGCATTGA AAGTACATTA AATGAGCTGC ACAAGTCGTT TGCATCGCTC
ACCCAAAGCG AGCAAAAGTA CGCCAAGCTC TTCTTGCACG ACTTGCAGCG CGGCGATGCG
CAGTTAATTG AAGGCCATAC TTTTAGAGAC TATATCAACA CCTACAAAGA TAATGCTGAA
AATGCGCAAT TAAACGCCGT TGTTAATGCT CTTGGTTTAG ATAAAGAACG GCTCATAGCA
TTAATGGCTG ATAGTGTTAA TGATAAAAAT CTCAATGATT TTGGTCGCTT CGACGCATTA
AAAGATTCGG TAAATAAATC GAAAGCCAAG ATCTATTTTG AAAAACAAGA CGGCGTAATC
ATACCTCCAT TTAAATTGAA TATACGTATT GATCAGTTTT TAAAGCAGTT TATTTTGGCA
CAAACGGATG ATTTCTTAAG TGATAGAGAT GTTGTTGGTG ATGTGATGGA CATCCCTCCC
TCAGCGTAA
 
Protein sequence
MTTFKTEAQF EQAFIEVLTN KGWEQKILKN KTEADLLQNW ANILFENNRQ RDRLNDVPLT 
DTEMQQIIEQ IKELKTPLKL NGLINGKTVA IKRDNPADTL HMGKEVSLKI YDRQEIAAGQ
SRYQIVQQPK FERGSPLRND RRGDVLLLIN GMPVIHLELK RSGTPVSQAV NQIEKYSKEG
VFSGLFSLIQ VFVAMEPNEA KYFANPGLDG KFNPDYQFNW ADFNNEPMNH WKDIASTLLS
IPMAHQLIGF YTVADDTDGV LKVMRSYQYY AANAISDKVA KTNWQQLGSA ANNPDRLGGY
VWHTTGSGKT MTSFKSAQLI AQSKDADKVI FLMDRIELGT QSLAEYRNFA GDGEDVQATE
NTHVLITKLK SSATADTLIV SSIQKMSNIF EEVDDEGTAT NSADIEKIRA KRLVFIIDEA
HRSTMSGGKE NKEGMLVRIK KTFPKALFFG FTGTPIHDEN QVNSNTTADV FGNELHRYSI
ADGIRDGNVL GFDPYKICTF KDKGLRQAVA LEQAKADSVA DAMSTAAKKK KFNYFMNDVL
MAGHKDATGK YIKGIEDYVP KEQYLTADHQ EKVVEDILAE WDVLSQGNKF HAILATNSIA
EAIDYYRRLK AAKPELKVSA LFDPNIDNDG SGDRDPTFKG DALDEIMADY NARYGQDFDF
ARHAAFKKDL AARLAHKKPY ERIHTEPSKQ LDLLIVVDQM LTGFDSKWLN TLYLDKVIKY
QNIIQAFSRT NRLFGPDKPH GIIRYYRYPH TMEQYINDAV KLYSGDRPIG LFVDKLESNL
KAMNELVADI TELFVSAGVE NFEKLPDDIE TCAQFAKLFN TFSQHLEAAK VQGLHWEQSV
YSPTENDVEH EVTLAIDEQT YLSLVLRYKE LVAKGDGGGA GGGDVPFDIS GYLTEIDTGK
IDADYMNSRF DKFLKELNQH QDPASIESTL NELHKSFASL TQSEQKYAKL FLHDLQRGDA
QLIEGHTFRD YINTYKDNAE NAQLNAVVNA LGLDKERLIA LMADSVNDKN LNDFGRFDAL
KDSVNKSKAK IYFEKQDGVI IPPFKLNIRI DQFLKQFILA QTDDFLSDRD VVGDVMDIPP
SA