Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpsIP31758_0409 |
Symbol | |
ID | 5385735 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pseudotuberculosis IP 31758 |
Kingdom | Bacteria |
Replicon accession | NC_009708 |
Strand | - |
Start bp | 478956 |
End bp | 481832 |
Gene Length | 2877 bp |
Protein Length | 958 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640863378 |
Product | putative insecticial toxin complex protein |
Protein accession | YP_001399402 |
Protein GI | 153950831 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTAATA CGTCACCCAC TGATCTGTGC ACCAACACCC CCAGTATCAC CATCCACGAT AACCGGGGGG GCGCTATCAT CGCACTGGCC TATAATCGTA CCGATACCCA CCAAACACCC GAACAACTCA TTAACTGCAA CCGCTATAAC GCCTCCGGTC AGTTAATCGC CAGCCGCGAC CCGCGCCTTG AGGTGGATAA TTTTCGCTAT CAATACAGCC TCAGCGGTGT TCCCCTGCGT ACCGACAGCG TCGATAGCGG CAGTACGCTG CAACTGGCAG ATAGTGCTGG CCGCACGGTG CTCACGCGGG ATGCACACCA CACCCGCCGC TGGGTAGAGT ATGAGACCGG TGAGCACAGT TTAGGCCGCC CGCTAAGTTA CCGCGAGCAA GCCAAAGGCG GCCTGAAAAC GGTTACCGAC CGCTTTTTCT ATGGCGAAAA TAGCGAGCAG GATAAAGGCT GCAATCTGAA CGGCCAGTGT GTCCGCCATT ACGACAGCGC CGGTTTACAG GCACTGATTA GCCAGTCAAT TATTGATGTG CCGCTGCAAC AGCAGCGCCG TCTACTGACG GAGACTAAAG GCCCGGTTGA CTGGTTTGGC GAAAAGGAGA ACTGGGGCGT TCGCCTGAGT GAATCGCCGT TTGTTAGCCA TTGCACCACC AATGCTCTCG GCCAGTTAAT CACGCAAACC GATGCCAAAG GCCATATCCA GCGCATGGCC TATAACCGTG CCGGACAACT TATCGGTTCG TGGCTAACGG TAAAAAATGG CGTGGAGCAG CCCATTGTAT ACTCACTCAC CTATTCGGCT GCCGGGCAAA AACTGCGCGA AGAGAGCGGC AACGGGGTCA TTACCGAATA CCGTTATGAA CCCCAGACTC AGCGCTTAAT CGGCATTAAA ACCACCCGCC CGGCGAAGAA AGACCGCCCG ACCCGGTTAC AAGACCTGCG TTACGATTAT GACCCAGTCG GGAATATTCT CGCCATCCAT AATGACGCCG AAGCCACCCG CTTCTACCGT AATCAGAAAA TCGTGCCGGA AACCACTTAC CGCTACGATG CACTGTACCA GCTTATCGAG GCCACTGGCC GTGAAGCCGA TACCAACGGC ATACAAAACA GCCAGTTGCC CACGCTGGCG TCACTGAACG ACAGCAACCA GTTCGTCAAT TACACCCGCC ACTACCACTA TGACCGCGCC GGTAACCTGC TAAAAATTCA GCACACAGGA GCCAGCCAAT ACAGTACCTA TATCACGGTG TCCGATTCGT CCAATCATGG CATTCAGCAA CAAGATGGCA TCACCGCCCA TGATGTCCGC TCCCAGTTTG ATGCGGCGGG TAATCAGCGA CAACTGCAAC CCGGTCAACT CCTTCACTGG AACAGCCGCA ATCAGTTACA GCAAGTGGAG CCCGTGCCCC GCAACGACGG CATCAGTGAC AGCGAAAGTT ATCTCTATGA TGGCAGCGGT AGGCGGGTGG TCAAAATCAG TCTCCATAAA ACCCATAACG CCATCCAAAC CCGTTCAGTC ATTTATTTAG CGGGACTGGA ACTGCGTAGC CAATATAATG GCAATAATCT GACAGAAGAT TTTCAGGTGA TGACCGTGGG TGCCGCGGGC CGTGCTCAGG TACGGGTATT ACACTGGGAG CGCGGCCAAC CCGCTGATAT CGTCAATGAC CAACTGCGTT ACAGTTTCGA TAACCACATT GGCTCGGCGT TAATCGAATT AGACAGCGAC GGCGATATTA TCAGCCAGGA AGAATATTAC CCATTTGGTG GCACCGCGGT TTTGGTCTCC CGTAATACCG TGGAAGCCAA ATATAAAACC GTTCGTTATT CCGGTAAAGA GCGCGATGCC ACCGGGCTGT ATTATTACGG TTACCGTTAT TACCAGCCGT GGCTGGGCCG ATGGTTAAGC GCTGACCCCG CAGGCACTAT AGACGGACTG AATTTATATC GGATGGTGAG GAATAACCCG GTGGGGTTGA TGGATGGGGA TGGGTTAATG ACCGATAAGC TTCTTGCTAA ACATGAAGCA AATTTTGCAA AAAAAAACAT ATCATCTATG GCTGAATTAA AATCAGAAAT AGAAAAACTC GGTCTGCTCC CTGCTGATAG TAAGCAATTA TTCTTGCATT TAAATGGTGG TGAATCTGAC GATGAACCCT CTGATTCTTC CGGTTCTTCC GGTTCTTCTG GTTCTTCTGG TTCTTCTGGT TCTTCTGGTT CTTCTGGTTC TTCTGAAATA TTAGAGAATA CTTCACCCCA CCAAATAAAA AACTTCCATT TCATTAGCGA AATAAATTTA GCCACCATGC CCAGACCTTA TTATAAAGAC TTTTCCAGCA CAGAGGATAT GTTGGAGTCT GCAGAGAGAC TTAAAGCATA TGGTTCAATA GATACGCTGT TGACACTAGA TTTAACCAGT GAAGATATTC CTGAATTTAC GTCAATATTA GCCGATAAAG GAATAAATTA TATAGCAGAA AAACAATATG AAATAATTGA TTACTTTAGC GAAGATGAAT TACCCTCAGA AAATATAGAC CGCATCGTCA ACATGATAAA AACAATTCAA AATAACAACC ACAAAGTAGG TATACATTGC GCAGCAGGAA ATGGCCGCAG TGGACTTATT GCCACCGCCA TGATAATAAA CAAAAGATAT ACACAAAGTC GTATAAATAG CTTTGAAGAA ACAAATAAAT TAAAGGAAAT AATTGATAAA AATAAAAATG CAATAAATGT TGACGATATT ACTTATGATG CAATGAAATT AGTAAGAAAA ACAAACCCCT TTGCCGGAGA ACGAACAACT GACATAAAAG CAGCACGTGA GTACTCACGT TATTTATATA GCAAACAGAA TCGATAA
|
Protein sequence | MPNTSPTDLC TNTPSITIHD NRGGAIIALA YNRTDTHQTP EQLINCNRYN ASGQLIASRD PRLEVDNFRY QYSLSGVPLR TDSVDSGSTL QLADSAGRTV LTRDAHHTRR WVEYETGEHS LGRPLSYREQ AKGGLKTVTD RFFYGENSEQ DKGCNLNGQC VRHYDSAGLQ ALISQSIIDV PLQQQRRLLT ETKGPVDWFG EKENWGVRLS ESPFVSHCTT NALGQLITQT DAKGHIQRMA YNRAGQLIGS WLTVKNGVEQ PIVYSLTYSA AGQKLREESG NGVITEYRYE PQTQRLIGIK TTRPAKKDRP TRLQDLRYDY DPVGNILAIH NDAEATRFYR NQKIVPETTY RYDALYQLIE ATGREADTNG IQNSQLPTLA SLNDSNQFVN YTRHYHYDRA GNLLKIQHTG ASQYSTYITV SDSSNHGIQQ QDGITAHDVR SQFDAAGNQR QLQPGQLLHW NSRNQLQQVE PVPRNDGISD SESYLYDGSG RRVVKISLHK THNAIQTRSV IYLAGLELRS QYNGNNLTED FQVMTVGAAG RAQVRVLHWE RGQPADIVND QLRYSFDNHI GSALIELDSD GDIISQEEYY PFGGTAVLVS RNTVEAKYKT VRYSGKERDA TGLYYYGYRY YQPWLGRWLS ADPAGTIDGL NLYRMVRNNP VGLMDGDGLM TDKLLAKHEA NFAKKNISSM AELKSEIEKL GLLPADSKQL FLHLNGGESD DEPSDSSGSS GSSGSSGSSG SSGSSGSSEI LENTSPHQIK NFHFISEINL ATMPRPYYKD FSSTEDMLES AERLKAYGSI DTLLTLDLTS EDIPEFTSIL ADKGINYIAE KQYEIIDYFS EDELPSENID RIVNMIKTIQ NNNHKVGIHC AAGNGRSGLI ATAMIINKRY TQSRINSFEE TNKLKEIIDK NKNAINVDDI TYDAMKLVRK TNPFAGERTT DIKAAREYSR YLYSKQNR
|
| |