Gene YpsIP31758_2416 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_2416 
Symbol 
ID5385940 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp2719762 
End bp2722989 
Gene Length3228 bp 
Protein Length1075 aa 
Translation table11 
GC content46% 
IMG OID640865407 
Productputative intimin 
Protein accessionYP_001401385 
Protein GI153950341 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCTCT ATCGCATATC ATCACTACAT CAAGCCAAGC AGTTAAATAA AAACAAGCAG 
TTAAATAAAA CCCGAATTTC AAAATCAGTC GTCTGGGCAA ATATTGTAAT CCAAGCCATA
TTTCCTTTGA GTATTGCTTT TACCCCAGCG GTAATGGCGG CTGAAACCGT CGGAGCCTCG
GATGAAAAAC CGCGTTCGGC CTCACAGGCT GAACAGTCTA CCGCCAATGC AGCAACACGG
TTGGCATCAA TATTGACAAA TGATGACTCT GCCAAACAAG CGAGTTCCAT TGCGCGCGGC
ACCGCTGCCA ATGCAGGTAA TGAAGCATTG CAAAAGTGGT TTAATCAGTT TGGCAGTGCG
AAAGTACAAC TAAATCTAGA TGAGAAATTG AGCCTTAAAG GCAGCCAACT GGATGTATTG
CTGCCGCTGA CCGATAGCCC AGATCTACTC ACCTTTACTC AGTTGGGTGG CCGCTATATT
GATGACCGAG TGACATTGAA CGTTGGTTTG GGTCAGCGTC ATTTCTTTGC ACAGCAAATG
CTGGGCTACA ACCTGTTTGT TGATCATGAT GCCAGCTATA GCCATACCCG TATTGGCGTC
GGTGCTGAAT ATGGTCGTGA TTTTATTAAT CTGGCAGCTA ACGGCTACTT TGGTGTCAGT
GGTTGGAAAA ATTCGCCAGA TCTTGATAAG TATGATGAAA AAGTCGCAAA CGGCTTTGAT
TTACGCAGTG AAGCTTATCT GCCAACGTTG CCACAATTGG GGGGGAAACT GATATATGAA
CAATATTTTG GTGATGAAGT TGGCTTGTTT GGTGTGGATA ACCGTCAGAA AAACCCTCTT
GCGGTCACTC TGGGTGTGAA TTATACCCCA ATTCCTTTAT TTACTGTTGG TGTCGACCAT
AAAATGGGGC GCGCAGGAAT GAATGACACC CGGTTCAACC TTGGTTTTAA CTATGCATTT
GGCACTCCTC TGGCACATCA GCTCGATTCG GATGCCGTCG CAATTAAACG TAGCTTAATG
GGTAGCCGCT ATAATCTGGT CGACCGTAAT AATCAGATTG TGATGAAATA CCGTAAGCAG
AATCGGGTTA CCCTAGAGCT GCCAGCACGT GTTAGTGGTG CGGCAAGACA AACAATGCCA
TTAGTGGCAA ATGCCACAGC ACAACAAGGT ATTGATCGTC TTGAATGGGA AGCCAGTGCC
TTAACGCTAG CGGGTGGAAA AATAACCGGT AGCGGCAATA ATTGGCAGAT AACATTGCCA
AGCTATCTGT CTGGTGGTGA GGGTAATAAC ACTTATCGTA TTAGTGCTAT CGCATACGAC
ACCCTTGGTA ATGCTTCTCC CGTTGCTTAC AGCGATCTCG TGGTAGATAG CCATGGTGTG
AATACTAACG CCTCAGGCTT GACTGCTGCG CCAGAAATTC TGCCGGCAAA TGCGAGTGCA
AGCAGTGTTA TTGAATTCAA TATTAAAGAT AATGCTAACC AGCCCATCAC GGGGATTGCC
GATGAACTGG CATTTTCTCT CGAACTGGTA GAGTTACCTG AAGAATTGGC TAAGGCTAAA
GCACGTTCAG TGCCATTGAA GACGGTGTCT CATACTCTAA CGAAGATTAC TGAATCTGCT
CCAGGTATTT ATCAGGCAAC ACTCACATCG GGAAGTAAGC CACAACTGAT TAATATTACC
GCCCAGATTA ATGGTGTGCC ATTAGCCGAT GTGCAAACCA AGGTGACGTT GATTGCCGAT
CAAAGCACGG CAACGTTACA AACGAGCAGC CTGCAAATCA TCACCAATGG TAGCCTGGCA
GATGATACTG ATGCTAACCA AATACGCGCC GTGGTGGTTG ATGCTTATGG CAATAAATTG
TCTGGTGTTC AAGTCAATTT CACTGTGGGC AATAATGCCA AAATAACAGA AACCACCTTG
AGCGACAAAC AAGGGGGAGT AACCGCAGCA ATCACCAGTA CCAAAGCGGG TACATATACG
GTCACAGCCG AACTTAATGG GGTAACACAA CAGATCGATG TTAACTTTAT CCCAGATGCT
GGCACTGCAA CACTGGATGA CAGTGATGAG TATAAATTGC AGTGGGTCAC TAATGGTCAG
GTAGCCGATG GTGAAAGCAC CAATAGCGTT CAACTGACGG TAGTCGATAA GTTTGGTAAC
ACCGTACCTG GTGTGGATGT CGCCTTTACC ACGGATATCG GGGCGATAAT TAGTGAAGTC
ACCCCAACGG ATGCTAATGG TGTCGCAACA GCAAAAATCA TCAGTAGTCA GGCTAAAAGC
CATACAGTGA AAGCAACGCT CAACCGCAAG GAACAAACCG TAGAGGTTAA CTTTATTGCC
GATACTGCCA CGGCAGAAAT TACGGCTAAT AACTTTACGG TAGAAGTCGA TGGTCAAGTC
GCTGGGAGCG GAACTAACCA AGTACAGGCC CTTGTTGTGG ATAAAAAGGG AAATCCTGTT
GCTAATATGA CCGTGAATTT TACCGCCACT AACGGCGTGG TCGTAGAGAC AACCTCAGCC
AAAACAGATG AGAATGGTAA AGTAACGACT AACCTCTCTA TGACCAATGT TGGTGGGACT
ATCAGTACGG TGACGGCAAC GATGATCAAT TCAGCGAACG TGACCAGTAC ACAAGATAAA
CCCGTCATCT TCTATCCAGA TTTCACTAAA GCCACGTTGA ATACGCCAGC GAATACTTAT
AGTGGCTTTA ATATCAACAG TGGTTTCCCA ACAACAGGAT TTAAAAATAC TCACTTCCAA
TTATCGCCAC ATGGTATTAC CGGCGCTAAC AGTGACTATG GTTGGGTAAG TAGTCATCCT
AACGTGAGTG TCAGTAACAC AGGTGCAATT ACGCTTCAGG ATAATCCTGG AGGGAAAGTG
ACCATTACGG CAACCTGGAA ACATGACAGC AGCAAAGTGT TCACTTATGA CTTTACGCTA
AATTATTGGG TAGGCCTCTA TAGCTCGACT AATCTGAGCT GGGCGCAGGC CAATGCGTCA
TGTATCAACG CCGGAATGAG ATTACCGACC AATAGCGAAG TATCGGCGGG TCAAGATGTT
CGTGGTGTAG GTTCGTTATT CGGTGAGTGG GGTAACTTGA ATGCCTATCC AAGTTTCCCA
ACCGCCCAGA TCATTTGGAC GTCAGTAGAT ACTAATGATT TCCATATCGA TACTGGTCTT
ACTCATTCGG CTTCGAATGT CACTCTTGCT TACATGTGTA TAAAATAA
 
Protein sequence
MSLYRISSLH QAKQLNKNKQ LNKTRISKSV VWANIVIQAI FPLSIAFTPA VMAAETVGAS 
DEKPRSASQA EQSTANAATR LASILTNDDS AKQASSIARG TAANAGNEAL QKWFNQFGSA
KVQLNLDEKL SLKGSQLDVL LPLTDSPDLL TFTQLGGRYI DDRVTLNVGL GQRHFFAQQM
LGYNLFVDHD ASYSHTRIGV GAEYGRDFIN LAANGYFGVS GWKNSPDLDK YDEKVANGFD
LRSEAYLPTL PQLGGKLIYE QYFGDEVGLF GVDNRQKNPL AVTLGVNYTP IPLFTVGVDH
KMGRAGMNDT RFNLGFNYAF GTPLAHQLDS DAVAIKRSLM GSRYNLVDRN NQIVMKYRKQ
NRVTLELPAR VSGAARQTMP LVANATAQQG IDRLEWEASA LTLAGGKITG SGNNWQITLP
SYLSGGEGNN TYRISAIAYD TLGNASPVAY SDLVVDSHGV NTNASGLTAA PEILPANASA
SSVIEFNIKD NANQPITGIA DELAFSLELV ELPEELAKAK ARSVPLKTVS HTLTKITESA
PGIYQATLTS GSKPQLINIT AQINGVPLAD VQTKVTLIAD QSTATLQTSS LQIITNGSLA
DDTDANQIRA VVVDAYGNKL SGVQVNFTVG NNAKITETTL SDKQGGVTAA ITSTKAGTYT
VTAELNGVTQ QIDVNFIPDA GTATLDDSDE YKLQWVTNGQ VADGESTNSV QLTVVDKFGN
TVPGVDVAFT TDIGAIISEV TPTDANGVAT AKIISSQAKS HTVKATLNRK EQTVEVNFIA
DTATAEITAN NFTVEVDGQV AGSGTNQVQA LVVDKKGNPV ANMTVNFTAT NGVVVETTSA
KTDENGKVTT NLSMTNVGGT ISTVTATMIN SANVTSTQDK PVIFYPDFTK ATLNTPANTY
SGFNINSGFP TTGFKNTHFQ LSPHGITGAN SDYGWVSSHP NVSVSNTGAI TLQDNPGGKV
TITATWKHDS SKVFTYDFTL NYWVGLYSST NLSWAQANAS CINAGMRLPT NSEVSAGQDV
RGVGSLFGEW GNLNAYPSFP TAQIIWTSVD TNDFHIDTGL THSASNVTLA YMCIK