Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpsIP31758_0407 |
Symbol | |
ID | 5384534 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pseudotuberculosis IP 31758 |
Kingdom | Bacteria |
Replicon accession | NC_009708 |
Strand | - |
Start bp | 473406 |
End bp | 476171 |
Gene Length | 2766 bp |
Protein Length | 921 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640863376 |
Product | putative insecticidal toxin complex protein |
Protein accession | YP_001399400 |
Protein GI | 153949316 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCAAT ATTTTATTCC ATCACTTACT GCCAACACCC CCAGTATTGC CGTTCACGAT AACCGGGGGT TAGCCATTCG CACACTGGCT TATAACCGCC GCAGTCATGA TGAAACCGTT GACGAACTGA GCAGCCGCAA CCGCTATAAC GCCTCCGGTC AGTTAATCGC CAGCCGCGAC CCGCGCCTTG AGGTGGATAA TTTTCGCTAT CAATACAGCC TCAGCGGTGT TCCCCTGCGT ACCGACAGCG TCGATAGCGG CAGTACGCTG CAACTGGCAG ATAGTGCTGG CCGCACGGTG CTCACGCTGG ATGCACACCA CACCCGCCGC TGGGTAGAGT ATGAGACCGG TGAGCACAGT TTAGATCGCC CACTAAGTTA CCGCGAGCAA GCCAAAGGCG GCCTGAAAAC GGTTACCGAC CGCTTTTTCT ATGGCGAAAA TAGCGAACAG GATAAAGGCT GTAATCTCAA CGGTACATGT GTGCGCCATT ACGACAGCGC CGGTTTACAG GCACTGATTA GCCAGTCGAT TATTGGTATA CCACTGCAGC AACAGCGCCG TCTGCTAACG GATACCAAAG GCCCGGTTGA CTGGTTTGGC GAAAAGGAGA ACTGGGGCAC TCGCCTGAGT GAATCGCCGT TTGTTAGCCA TAGCACCACC AATGCTCTCG GCCAGTTAAT CACGCAAACC GATGCCAAAG GCCATATCCA GCGCATGGCC TACAACCGTG CCGGGCAACT TATCGGTTCG TGGTTAACGG TAAAAAATGG CGTGGAGCAG CCCATTGTAT ACTCACTCAC CTATTCAGCC GCCGGGCAAA AACTGCGCGA AGAGAGTGGC AACGGGGTCA TTACCGAATA CCGTTATGAA CCCCAGACCC AGCGTTTAAT CGCCATTAAA ACCACCCGGC CAGCGAAGAA AGACCGCCCG ACCCTGTTAC AAGACCTGCG TTATGATTAT GACCCGGTCG GGAATATTCT CGCCATCCAT AACGACGCCG AAGCCACCCG CTTCTACCGT AATCAGAAAA TCGTGCCGGA AACCACTTAC CGCTACGATG CACTGTATCA GCTTATTGAG GCCACCGGCC GTGAAGCCGA TACCAACGGC ATACAAAACA GCCAGTTGCC CACGCTGGCG TCACTGAACG ACAGCAACCA GTTCGTCAAC TATACCCGCC GCTACCACTA TGACCGCGCC GGTAACCTGC TAAAAATTCA GCATACCGGT GCCAGCCAAT ACAGTACCCA TATCACGGTG TCCGATTCGT CCAATCACGG CATTCAGCAA CAAGATGGCA TCACTGCCCG TGATATTCGC TCCCAGTTTG ATGCGGCGGG TAATCAGCGA CAACTGCAAC CCGGTCAACC CCTGCGCTGG AACAGCCGCA ATCAGTTACA GCAGGTAGAG CCCGTGCCCC GCAACGACGG CATCAGTGAC AGCGAAAGTT ATCTCTATGA TGGCAGCGGT AGCCGGGTGG TCAAAATCAG TCTCCATAAA ACCCATAACG CCATCCAAAC CCGTTCAGTC ATTTATTTAG CGGGACTGGA ACTGCGTAGC CAATATAACG GCAATAATCT GACAGAAGAT TTTCAGGTGA TGACCGTGGG TGCCGCGGGC CGTGCTCAGG TACGGGTATT ACACTGGGAG CGCGGCCAAC CCGCTGATAT CGTCAATGAC CAACTGCGTT ACAGTTTCGA TAACCACATT GGCTCGGCGT TAATCGAATT AGACAGCGAC GGCGATATTA TCAGCCAGGA AGAATATTAC CCATTTGGCG GCACCGCGGT GTTAGCCTCC CGTAATACCA TGGAAGCTAA ATATAAAACC GTTCGTTACT CCGGTAAAGA GCGTGACACC ACCGGACTGT ATTATTACGG TTACCGTTAT TACCAGCCAT GGCTGGGCCG ATGGTTAAGC GCCGACCCCG CAGGCACTAT AGACGGGCTG AATTTATACC GAATGGTGAG GAATAACCCT ATAAAATTGG TGGACAGAGA TGGGCTTCAG CCTGATAAAA TAATAGGGCT AGAAAATTAT GCAGAATATC TAGAAGAAAC AATAGAAGAT GAAAGTGAAT TTTTAGAAAT AGCACAAGGA ATAGGCGCTG CTTTTTTAGA TGCATCATTA ATATTACCAG AAGCAATCGC CAGGTTAAGG GATGAGAGCC TAGCAGATGA AAATGAAATC CTGATAAGAA AATTTTTACC CGATCAGGAA GTTGATTTCT CACGAATAAA AGACGAGCTA CTTGATAGAT TTGACAATAT GGAAGACATG ATTAATGAAG TAATAGAAAA TAGAAATACA AAAATTATCT TTGATTTGAA AAGCAACACA AACTCAATAG CTTATGTTAA TTATAAAGAT GAATTACATA GAATAAACGT TACAAATTTA TTTATAAAAA ATGTTGGAAC AGTAAGTAAT ATCCATGCAA TATTACATGA GTTATCTCAT ATGGAGCTTC CAACAAGAAG AGTAACAAAA GATTATTATT ACATTAGCGA ATTTAGTACC GAAGAAATAT TTCCAGACAC TGAAGATTTT TTATCTATAG CACAAGAATC TTTCGAGAAT TATCATATTA TAATGAGAAC TGCACTTGAT AAAAATATGG ATGAAAGTCC CTCTTCCATG TTAACAAGAG AACTTTATGA CACATCAGAT GATATTAGTA TTGCTCTAGA GAATGCTGAT CACATTGCAA TATTAGCCTT AGCATTAGGG AGATCCGAAA TACAACACAG CCAATATGCA TTATAA
|
Protein sequence | MSQYFIPSLT ANTPSIAVHD NRGLAIRTLA YNRRSHDETV DELSSRNRYN ASGQLIASRD PRLEVDNFRY QYSLSGVPLR TDSVDSGSTL QLADSAGRTV LTLDAHHTRR WVEYETGEHS LDRPLSYREQ AKGGLKTVTD RFFYGENSEQ DKGCNLNGTC VRHYDSAGLQ ALISQSIIGI PLQQQRRLLT DTKGPVDWFG EKENWGTRLS ESPFVSHSTT NALGQLITQT DAKGHIQRMA YNRAGQLIGS WLTVKNGVEQ PIVYSLTYSA AGQKLREESG NGVITEYRYE PQTQRLIAIK TTRPAKKDRP TLLQDLRYDY DPVGNILAIH NDAEATRFYR NQKIVPETTY RYDALYQLIE ATGREADTNG IQNSQLPTLA SLNDSNQFVN YTRRYHYDRA GNLLKIQHTG ASQYSTHITV SDSSNHGIQQ QDGITARDIR SQFDAAGNQR QLQPGQPLRW NSRNQLQQVE PVPRNDGISD SESYLYDGSG SRVVKISLHK THNAIQTRSV IYLAGLELRS QYNGNNLTED FQVMTVGAAG RAQVRVLHWE RGQPADIVND QLRYSFDNHI GSALIELDSD GDIISQEEYY PFGGTAVLAS RNTMEAKYKT VRYSGKERDT TGLYYYGYRY YQPWLGRWLS ADPAGTIDGL NLYRMVRNNP IKLVDRDGLQ PDKIIGLENY AEYLEETIED ESEFLEIAQG IGAAFLDASL ILPEAIARLR DESLADENEI LIRKFLPDQE VDFSRIKDEL LDRFDNMEDM INEVIENRNT KIIFDLKSNT NSIAYVNYKD ELHRINVTNL FIKNVGTVSN IHAILHELSH MELPTRRVTK DYYYISEFST EEIFPDTEDF LSIAQESFEN YHIIMRTALD KNMDESPSSM LTRELYDTSD DISIALENAD HIAILALALG RSEIQHSQYA L
|
| |