Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpsIP31758_2416 |
Symbol | |
ID | 5385940 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pseudotuberculosis IP 31758 |
Kingdom | Bacteria |
Replicon accession | NC_009708 |
Strand | + |
Start bp | 2719762 |
End bp | 2722989 |
Gene Length | 3228 bp |
Protein Length | 1075 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640865407 |
Product | putative intimin |
Protein accession | YP_001401385 |
Protein GI | 153950341 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCTCT ATCGCATATC ATCACTACAT CAAGCCAAGC AGTTAAATAA AAACAAGCAG TTAAATAAAA CCCGAATTTC AAAATCAGTC GTCTGGGCAA ATATTGTAAT CCAAGCCATA TTTCCTTTGA GTATTGCTTT TACCCCAGCG GTAATGGCGG CTGAAACCGT CGGAGCCTCG GATGAAAAAC CGCGTTCGGC CTCACAGGCT GAACAGTCTA CCGCCAATGC AGCAACACGG TTGGCATCAA TATTGACAAA TGATGACTCT GCCAAACAAG CGAGTTCCAT TGCGCGCGGC ACCGCTGCCA ATGCAGGTAA TGAAGCATTG CAAAAGTGGT TTAATCAGTT TGGCAGTGCG AAAGTACAAC TAAATCTAGA TGAGAAATTG AGCCTTAAAG GCAGCCAACT GGATGTATTG CTGCCGCTGA CCGATAGCCC AGATCTACTC ACCTTTACTC AGTTGGGTGG CCGCTATATT GATGACCGAG TGACATTGAA CGTTGGTTTG GGTCAGCGTC ATTTCTTTGC ACAGCAAATG CTGGGCTACA ACCTGTTTGT TGATCATGAT GCCAGCTATA GCCATACCCG TATTGGCGTC GGTGCTGAAT ATGGTCGTGA TTTTATTAAT CTGGCAGCTA ACGGCTACTT TGGTGTCAGT GGTTGGAAAA ATTCGCCAGA TCTTGATAAG TATGATGAAA AAGTCGCAAA CGGCTTTGAT TTACGCAGTG AAGCTTATCT GCCAACGTTG CCACAATTGG GGGGGAAACT GATATATGAA CAATATTTTG GTGATGAAGT TGGCTTGTTT GGTGTGGATA ACCGTCAGAA AAACCCTCTT GCGGTCACTC TGGGTGTGAA TTATACCCCA ATTCCTTTAT TTACTGTTGG TGTCGACCAT AAAATGGGGC GCGCAGGAAT GAATGACACC CGGTTCAACC TTGGTTTTAA CTATGCATTT GGCACTCCTC TGGCACATCA GCTCGATTCG GATGCCGTCG CAATTAAACG TAGCTTAATG GGTAGCCGCT ATAATCTGGT CGACCGTAAT AATCAGATTG TGATGAAATA CCGTAAGCAG AATCGGGTTA CCCTAGAGCT GCCAGCACGT GTTAGTGGTG CGGCAAGACA AACAATGCCA TTAGTGGCAA ATGCCACAGC ACAACAAGGT ATTGATCGTC TTGAATGGGA AGCCAGTGCC TTAACGCTAG CGGGTGGAAA AATAACCGGT AGCGGCAATA ATTGGCAGAT AACATTGCCA AGCTATCTGT CTGGTGGTGA GGGTAATAAC ACTTATCGTA TTAGTGCTAT CGCATACGAC ACCCTTGGTA ATGCTTCTCC CGTTGCTTAC AGCGATCTCG TGGTAGATAG CCATGGTGTG AATACTAACG CCTCAGGCTT GACTGCTGCG CCAGAAATTC TGCCGGCAAA TGCGAGTGCA AGCAGTGTTA TTGAATTCAA TATTAAAGAT AATGCTAACC AGCCCATCAC GGGGATTGCC GATGAACTGG CATTTTCTCT CGAACTGGTA GAGTTACCTG AAGAATTGGC TAAGGCTAAA GCACGTTCAG TGCCATTGAA GACGGTGTCT CATACTCTAA CGAAGATTAC TGAATCTGCT CCAGGTATTT ATCAGGCAAC ACTCACATCG GGAAGTAAGC CACAACTGAT TAATATTACC GCCCAGATTA ATGGTGTGCC ATTAGCCGAT GTGCAAACCA AGGTGACGTT GATTGCCGAT CAAAGCACGG CAACGTTACA AACGAGCAGC CTGCAAATCA TCACCAATGG TAGCCTGGCA GATGATACTG ATGCTAACCA AATACGCGCC GTGGTGGTTG ATGCTTATGG CAATAAATTG TCTGGTGTTC AAGTCAATTT CACTGTGGGC AATAATGCCA AAATAACAGA AACCACCTTG AGCGACAAAC AAGGGGGAGT AACCGCAGCA ATCACCAGTA CCAAAGCGGG TACATATACG GTCACAGCCG AACTTAATGG GGTAACACAA CAGATCGATG TTAACTTTAT CCCAGATGCT GGCACTGCAA CACTGGATGA CAGTGATGAG TATAAATTGC AGTGGGTCAC TAATGGTCAG GTAGCCGATG GTGAAAGCAC CAATAGCGTT CAACTGACGG TAGTCGATAA GTTTGGTAAC ACCGTACCTG GTGTGGATGT CGCCTTTACC ACGGATATCG GGGCGATAAT TAGTGAAGTC ACCCCAACGG ATGCTAATGG TGTCGCAACA GCAAAAATCA TCAGTAGTCA GGCTAAAAGC CATACAGTGA AAGCAACGCT CAACCGCAAG GAACAAACCG TAGAGGTTAA CTTTATTGCC GATACTGCCA CGGCAGAAAT TACGGCTAAT AACTTTACGG TAGAAGTCGA TGGTCAAGTC GCTGGGAGCG GAACTAACCA AGTACAGGCC CTTGTTGTGG ATAAAAAGGG AAATCCTGTT GCTAATATGA CCGTGAATTT TACCGCCACT AACGGCGTGG TCGTAGAGAC AACCTCAGCC AAAACAGATG AGAATGGTAA AGTAACGACT AACCTCTCTA TGACCAATGT TGGTGGGACT ATCAGTACGG TGACGGCAAC GATGATCAAT TCAGCGAACG TGACCAGTAC ACAAGATAAA CCCGTCATCT TCTATCCAGA TTTCACTAAA GCCACGTTGA ATACGCCAGC GAATACTTAT AGTGGCTTTA ATATCAACAG TGGTTTCCCA ACAACAGGAT TTAAAAATAC TCACTTCCAA TTATCGCCAC ATGGTATTAC CGGCGCTAAC AGTGACTATG GTTGGGTAAG TAGTCATCCT AACGTGAGTG TCAGTAACAC AGGTGCAATT ACGCTTCAGG ATAATCCTGG AGGGAAAGTG ACCATTACGG CAACCTGGAA ACATGACAGC AGCAAAGTGT TCACTTATGA CTTTACGCTA AATTATTGGG TAGGCCTCTA TAGCTCGACT AATCTGAGCT GGGCGCAGGC CAATGCGTCA TGTATCAACG CCGGAATGAG ATTACCGACC AATAGCGAAG TATCGGCGGG TCAAGATGTT CGTGGTGTAG GTTCGTTATT CGGTGAGTGG GGTAACTTGA ATGCCTATCC AAGTTTCCCA ACCGCCCAGA TCATTTGGAC GTCAGTAGAT ACTAATGATT TCCATATCGA TACTGGTCTT ACTCATTCGG CTTCGAATGT CACTCTTGCT TACATGTGTA TAAAATAA
|
Protein sequence | MSLYRISSLH QAKQLNKNKQ LNKTRISKSV VWANIVIQAI FPLSIAFTPA VMAAETVGAS DEKPRSASQA EQSTANAATR LASILTNDDS AKQASSIARG TAANAGNEAL QKWFNQFGSA KVQLNLDEKL SLKGSQLDVL LPLTDSPDLL TFTQLGGRYI DDRVTLNVGL GQRHFFAQQM LGYNLFVDHD ASYSHTRIGV GAEYGRDFIN LAANGYFGVS GWKNSPDLDK YDEKVANGFD LRSEAYLPTL PQLGGKLIYE QYFGDEVGLF GVDNRQKNPL AVTLGVNYTP IPLFTVGVDH KMGRAGMNDT RFNLGFNYAF GTPLAHQLDS DAVAIKRSLM GSRYNLVDRN NQIVMKYRKQ NRVTLELPAR VSGAARQTMP LVANATAQQG IDRLEWEASA LTLAGGKITG SGNNWQITLP SYLSGGEGNN TYRISAIAYD TLGNASPVAY SDLVVDSHGV NTNASGLTAA PEILPANASA SSVIEFNIKD NANQPITGIA DELAFSLELV ELPEELAKAK ARSVPLKTVS HTLTKITESA PGIYQATLTS GSKPQLINIT AQINGVPLAD VQTKVTLIAD QSTATLQTSS LQIITNGSLA DDTDANQIRA VVVDAYGNKL SGVQVNFTVG NNAKITETTL SDKQGGVTAA ITSTKAGTYT VTAELNGVTQ QIDVNFIPDA GTATLDDSDE YKLQWVTNGQ VADGESTNSV QLTVVDKFGN TVPGVDVAFT TDIGAIISEV TPTDANGVAT AKIISSQAKS HTVKATLNRK EQTVEVNFIA DTATAEITAN NFTVEVDGQV AGSGTNQVQA LVVDKKGNPV ANMTVNFTAT NGVVVETTSA KTDENGKVTT NLSMTNVGGT ISTVTATMIN SANVTSTQDK PVIFYPDFTK ATLNTPANTY SGFNINSGFP TTGFKNTHFQ LSPHGITGAN SDYGWVSSHP NVSVSNTGAI TLQDNPGGKV TITATWKHDS SKVFTYDFTL NYWVGLYSST NLSWAQANAS CINAGMRLPT NSEVSAGQDV RGVGSLFGEW GNLNAYPSFP TAQIIWTSVD TNDFHIDTGL THSASNVTLA YMCIK
|
| |