Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpsIP31758_0406 |
Symbol | |
ID | 5388276 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pseudotuberculosis IP 31758 |
Kingdom | Bacteria |
Replicon accession | NC_009708 |
Strand | - |
Start bp | 470313 |
End bp | 473102 |
Gene Length | 2790 bp |
Protein Length | 929 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640863375 |
Product | putative insecticidal toxin complex protein |
Protein accession | YP_001399399 |
Protein GI | 153950752 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 43 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCCAA GTTTACTAAC CCAGCTTTGC ACCAACACCC CCAGTATTGC CGTTCACGAT AACCGGGGGT TAGCCATTCG CACACTGGCT TATAACCGCC GCAGTCATAA TGAAACCGTT GACGAACTGA GCAGCCGCAA CCGCTATAAC GCCTCCGGTC AGTTAATCGC CAGCCGCGAC CCGCGCCTTG AGGTGGATAA TTTCCGCTAT CAATACAGCC TCAGCGGTGT TCCACTGCGT ACCGACAGCG TCGATAGCGG CAGTACGCTG CAACTGGCAG ATAGTGCTGG CCGCACGGTG CTCACGCGGG ATGCACACCA CACCCGCCGC TGGGTAGAGT ATGAGACCGG TGAGCACAGT TTAGATCGCC CACTAAGTTA CCGCGAGCAA GCCAAAGGCG GCCTGAAAAC GGTTACCGAC CGCTTTTTCT ATGGCGAAAA TAGCGAGCAG GATAAAAGCT GTAATCTCAA TGGTCAGTGT GTCCGCCATT ACGACAGCGC CGGTTTACAG GCACTGATTA GCCAGTCAAT TATTGATGTG CCGCTGCAAC AGCAGCGCCG TCTACTGACG GAGACTAAAG GCCCGGTTGA CTGGTTTGGC GAGCAGGAGA ACTGGGGCGC TCGCCTGAGT GAATCGCCGT TTGTTAGCCA TTGCACCACC AATGCTCTCG GCCAGTTAAT CACGCAAACC GATGCCAAAG GCCATATCCA GCGCATGGCC TATAACCGTG CCGGACAGCT TATCGGTTCG TGGTTAACGG TAAAAAATGG CATGGAGCAG CCCATTGTAT ACTCACTCAC CTATTCGGCT GCTGGGCAAA AACTGCGCGA AGAGAGCGGC AACGGGGTCA TTACCGAATA CCGTTATGAG CCGCAAACCC AGCGTTTAAT CGGCATTAAA ACCACCCGTC CGGCGAAGAA AGACCGCCCG ACCCGGTTAC AAGACCTGCG TTACGATTAT GACCCGGTCG GGAATATTCT CGCCATCCAT AACGACGCCG AAGCCACCCG CTTCTACCGT AATCAGAAAA TCGTGCCGGA AACCACTTAC CGCTACGATG CACTGTATCA GCTTATTGAG GCCACTGGCC GTGAAGCCGA TACCAACGGC ATACAAAACA GCCAGTTGCC CACGCTGGCG TCACTGAACG ACAGCAACCA GTTCGTCAAT TACACCCGCC ACTACCACTA TGACCGCGCC GGTAACCTGC TAAAAATTCA GCACACAGGA GCCAGCCAAT ACAGTACCTA TATCACGGTG TCCGATTCGT CCAATCATGG CATTCAGCAA CAAGATGGCA TCACCGCCCA TGATGTCCGC TCCCAGTTTG ATGCGGCGGG TAATCAGCGA CAACTGCAAC CCGGTCAACT CCTTCACTGG AACAGCCGCA ATCAGTTACA GCAAGTGGAG CCCGTGCCCC GCAACGACGG CATCAGTGAC AGCGAAAACT ATCTCTATGA TGGCAGCGGT AGGCGGGTGG TCAAAATCAG TCTCCATAAA ACCCATAACG CCATCCAAAC CCGTTCAGTC ATTTATTTAG CGGGACTGGA ACTGCGTAGC CAATATAATG GCAATAATCT GACAGAAGAT TTTCAGGTGA TGACCGTGGG TGCCGCGGGC CGTGCTCAGG TACGGGTATT ACACTGGGAG CGCGGCCAAC CCGCTGATAT CGTCAATGAC CAACTGCGTT ACAGTTTCGA TAACCACATT GGCTCGGCAT TAATCGAATT AGACAGCGAC GGCGATATTA TCAGCCAGGA AGAATATTAC CCATTTGGCG GCACCGCGGT GTTAGCCTCC CGTAATACCG TGGAAGCTAA ATATAAAACC GTTCGTTACT CCGGTAAAGA GCGTGACACC ACCGGATTGT ATTATTACGG TTACCGTTAT TACCAGCCGT GGCTGGGCCG CTGGTTAAGC GCTGACCCCG CAGGCACTAT AGACGGACTG AATTTATACC GAATGGTGAG GAATAACCCA GTGGGGTTGA TGGATGGGGA TGGGTTAATG ACCGATGAAC AATTAAAGGA AAATATAAAT ATGCTAAAAA AAATAGGGTT ATCAACAATA GAAGATTTAA AGCAAACACT CTCAATGTTT AATTACAGCA AGGAAGATAA TGAGAAGCTA TTTTATTTGA TGCAAGAGCA AATATTGAAT CAAAGCGTTT CAATTTCAGA TGATGAGATT ATATTTACAG AAGAAAATAG AATGTCTGAT TCTGAATATT CTGATGATGA TGATGATGAT GACACGTTAG AAAATGAAGT GGAAATAAGT ATGGAAAATT ATCGAAAAGT GAGTCATAAA CTGGCACTAA GTACGACAGG AATGGGGGAC TGCACATCGA TTGCAATATT TTCTTCAACT GAAAAAAGCC TAATGCATAT CAGTGGATCA AATTTGGAAA CACCAATGAA AATATATAAG GAACGTGAAA AACTTGCTTA TTCACACTCG ACTGCCGGAT CCGTTGTTGT TGATTTAACC GAAAAAATTG AAAATTACAA CAATAAGAAA ATAGCAATTA TATTTGGAAT TAACAATGGT TCCATGGGCT TCGAGATTTT TTTAGACCAA TATTACAAAG GAAGTAAACC ACTACTTGAC CTGCTCTATA ATTTTAAAAA AGAAAACATT AGTTTTTATA AAAATATAAA GGTTGGAATA AGTCAAGAGG GAGAAATATT TAGTGATTTA AATACAAGAA CGCGACTTAA ATCCATTTCT ATTAACGATA GAGAAGAGCT ACTTTCAATG ATATTTAATC GAGAGTATAG CGATTGTTAA
|
Protein sequence | MSPSLLTQLC TNTPSIAVHD NRGLAIRTLA YNRRSHNETV DELSSRNRYN ASGQLIASRD PRLEVDNFRY QYSLSGVPLR TDSVDSGSTL QLADSAGRTV LTRDAHHTRR WVEYETGEHS LDRPLSYREQ AKGGLKTVTD RFFYGENSEQ DKSCNLNGQC VRHYDSAGLQ ALISQSIIDV PLQQQRRLLT ETKGPVDWFG EQENWGARLS ESPFVSHCTT NALGQLITQT DAKGHIQRMA YNRAGQLIGS WLTVKNGMEQ PIVYSLTYSA AGQKLREESG NGVITEYRYE PQTQRLIGIK TTRPAKKDRP TRLQDLRYDY DPVGNILAIH NDAEATRFYR NQKIVPETTY RYDALYQLIE ATGREADTNG IQNSQLPTLA SLNDSNQFVN YTRHYHYDRA GNLLKIQHTG ASQYSTYITV SDSSNHGIQQ QDGITAHDVR SQFDAAGNQR QLQPGQLLHW NSRNQLQQVE PVPRNDGISD SENYLYDGSG RRVVKISLHK THNAIQTRSV IYLAGLELRS QYNGNNLTED FQVMTVGAAG RAQVRVLHWE RGQPADIVND QLRYSFDNHI GSALIELDSD GDIISQEEYY PFGGTAVLAS RNTVEAKYKT VRYSGKERDT TGLYYYGYRY YQPWLGRWLS ADPAGTIDGL NLYRMVRNNP VGLMDGDGLM TDEQLKENIN MLKKIGLSTI EDLKQTLSMF NYSKEDNEKL FYLMQEQILN QSVSISDDEI IFTEENRMSD SEYSDDDDDD DTLENEVEIS MENYRKVSHK LALSTTGMGD CTSIAIFSST EKSLMHISGS NLETPMKIYK EREKLAYSHS TAGSVVVDLT EKIENYNNKK IAIIFGINNG SMGFEIFLDQ YYKGSKPLLD LLYNFKKENI SFYKNIKVGI SQEGEIFSDL NTRTRLKSIS INDREELLSM IFNREYSDC
|
| |