Gene YpsIP31758_0409 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_0409 
Symbol 
ID5385735 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp478956 
End bp481832 
Gene Length2877 bp 
Protein Length958 aa 
Translation table11 
GC content48% 
IMG OID640863378 
Productputative insecticial toxin complex protein 
Protein accessionYP_001399402 
Protein GI153950831 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTAATA CGTCACCCAC TGATCTGTGC ACCAACACCC CCAGTATCAC CATCCACGAT 
AACCGGGGGG GCGCTATCAT CGCACTGGCC TATAATCGTA CCGATACCCA CCAAACACCC
GAACAACTCA TTAACTGCAA CCGCTATAAC GCCTCCGGTC AGTTAATCGC CAGCCGCGAC
CCGCGCCTTG AGGTGGATAA TTTTCGCTAT CAATACAGCC TCAGCGGTGT TCCCCTGCGT
ACCGACAGCG TCGATAGCGG CAGTACGCTG CAACTGGCAG ATAGTGCTGG CCGCACGGTG
CTCACGCGGG ATGCACACCA CACCCGCCGC TGGGTAGAGT ATGAGACCGG TGAGCACAGT
TTAGGCCGCC CGCTAAGTTA CCGCGAGCAA GCCAAAGGCG GCCTGAAAAC GGTTACCGAC
CGCTTTTTCT ATGGCGAAAA TAGCGAGCAG GATAAAGGCT GCAATCTGAA CGGCCAGTGT
GTCCGCCATT ACGACAGCGC CGGTTTACAG GCACTGATTA GCCAGTCAAT TATTGATGTG
CCGCTGCAAC AGCAGCGCCG TCTACTGACG GAGACTAAAG GCCCGGTTGA CTGGTTTGGC
GAAAAGGAGA ACTGGGGCGT TCGCCTGAGT GAATCGCCGT TTGTTAGCCA TTGCACCACC
AATGCTCTCG GCCAGTTAAT CACGCAAACC GATGCCAAAG GCCATATCCA GCGCATGGCC
TATAACCGTG CCGGACAACT TATCGGTTCG TGGCTAACGG TAAAAAATGG CGTGGAGCAG
CCCATTGTAT ACTCACTCAC CTATTCGGCT GCCGGGCAAA AACTGCGCGA AGAGAGCGGC
AACGGGGTCA TTACCGAATA CCGTTATGAA CCCCAGACTC AGCGCTTAAT CGGCATTAAA
ACCACCCGCC CGGCGAAGAA AGACCGCCCG ACCCGGTTAC AAGACCTGCG TTACGATTAT
GACCCAGTCG GGAATATTCT CGCCATCCAT AATGACGCCG AAGCCACCCG CTTCTACCGT
AATCAGAAAA TCGTGCCGGA AACCACTTAC CGCTACGATG CACTGTACCA GCTTATCGAG
GCCACTGGCC GTGAAGCCGA TACCAACGGC ATACAAAACA GCCAGTTGCC CACGCTGGCG
TCACTGAACG ACAGCAACCA GTTCGTCAAT TACACCCGCC ACTACCACTA TGACCGCGCC
GGTAACCTGC TAAAAATTCA GCACACAGGA GCCAGCCAAT ACAGTACCTA TATCACGGTG
TCCGATTCGT CCAATCATGG CATTCAGCAA CAAGATGGCA TCACCGCCCA TGATGTCCGC
TCCCAGTTTG ATGCGGCGGG TAATCAGCGA CAACTGCAAC CCGGTCAACT CCTTCACTGG
AACAGCCGCA ATCAGTTACA GCAAGTGGAG CCCGTGCCCC GCAACGACGG CATCAGTGAC
AGCGAAAGTT ATCTCTATGA TGGCAGCGGT AGGCGGGTGG TCAAAATCAG TCTCCATAAA
ACCCATAACG CCATCCAAAC CCGTTCAGTC ATTTATTTAG CGGGACTGGA ACTGCGTAGC
CAATATAATG GCAATAATCT GACAGAAGAT TTTCAGGTGA TGACCGTGGG TGCCGCGGGC
CGTGCTCAGG TACGGGTATT ACACTGGGAG CGCGGCCAAC CCGCTGATAT CGTCAATGAC
CAACTGCGTT ACAGTTTCGA TAACCACATT GGCTCGGCGT TAATCGAATT AGACAGCGAC
GGCGATATTA TCAGCCAGGA AGAATATTAC CCATTTGGTG GCACCGCGGT TTTGGTCTCC
CGTAATACCG TGGAAGCCAA ATATAAAACC GTTCGTTATT CCGGTAAAGA GCGCGATGCC
ACCGGGCTGT ATTATTACGG TTACCGTTAT TACCAGCCGT GGCTGGGCCG ATGGTTAAGC
GCTGACCCCG CAGGCACTAT AGACGGACTG AATTTATATC GGATGGTGAG GAATAACCCG
GTGGGGTTGA TGGATGGGGA TGGGTTAATG ACCGATAAGC TTCTTGCTAA ACATGAAGCA
AATTTTGCAA AAAAAAACAT ATCATCTATG GCTGAATTAA AATCAGAAAT AGAAAAACTC
GGTCTGCTCC CTGCTGATAG TAAGCAATTA TTCTTGCATT TAAATGGTGG TGAATCTGAC
GATGAACCCT CTGATTCTTC CGGTTCTTCC GGTTCTTCTG GTTCTTCTGG TTCTTCTGGT
TCTTCTGGTT CTTCTGGTTC TTCTGAAATA TTAGAGAATA CTTCACCCCA CCAAATAAAA
AACTTCCATT TCATTAGCGA AATAAATTTA GCCACCATGC CCAGACCTTA TTATAAAGAC
TTTTCCAGCA CAGAGGATAT GTTGGAGTCT GCAGAGAGAC TTAAAGCATA TGGTTCAATA
GATACGCTGT TGACACTAGA TTTAACCAGT GAAGATATTC CTGAATTTAC GTCAATATTA
GCCGATAAAG GAATAAATTA TATAGCAGAA AAACAATATG AAATAATTGA TTACTTTAGC
GAAGATGAAT TACCCTCAGA AAATATAGAC CGCATCGTCA ACATGATAAA AACAATTCAA
AATAACAACC ACAAAGTAGG TATACATTGC GCAGCAGGAA ATGGCCGCAG TGGACTTATT
GCCACCGCCA TGATAATAAA CAAAAGATAT ACACAAAGTC GTATAAATAG CTTTGAAGAA
ACAAATAAAT TAAAGGAAAT AATTGATAAA AATAAAAATG CAATAAATGT TGACGATATT
ACTTATGATG CAATGAAATT AGTAAGAAAA ACAAACCCCT TTGCCGGAGA ACGAACAACT
GACATAAAAG CAGCACGTGA GTACTCACGT TATTTATATA GCAAACAGAA TCGATAA
 
Protein sequence
MPNTSPTDLC TNTPSITIHD NRGGAIIALA YNRTDTHQTP EQLINCNRYN ASGQLIASRD 
PRLEVDNFRY QYSLSGVPLR TDSVDSGSTL QLADSAGRTV LTRDAHHTRR WVEYETGEHS
LGRPLSYREQ AKGGLKTVTD RFFYGENSEQ DKGCNLNGQC VRHYDSAGLQ ALISQSIIDV
PLQQQRRLLT ETKGPVDWFG EKENWGVRLS ESPFVSHCTT NALGQLITQT DAKGHIQRMA
YNRAGQLIGS WLTVKNGVEQ PIVYSLTYSA AGQKLREESG NGVITEYRYE PQTQRLIGIK
TTRPAKKDRP TRLQDLRYDY DPVGNILAIH NDAEATRFYR NQKIVPETTY RYDALYQLIE
ATGREADTNG IQNSQLPTLA SLNDSNQFVN YTRHYHYDRA GNLLKIQHTG ASQYSTYITV
SDSSNHGIQQ QDGITAHDVR SQFDAAGNQR QLQPGQLLHW NSRNQLQQVE PVPRNDGISD
SESYLYDGSG RRVVKISLHK THNAIQTRSV IYLAGLELRS QYNGNNLTED FQVMTVGAAG
RAQVRVLHWE RGQPADIVND QLRYSFDNHI GSALIELDSD GDIISQEEYY PFGGTAVLVS
RNTVEAKYKT VRYSGKERDA TGLYYYGYRY YQPWLGRWLS ADPAGTIDGL NLYRMVRNNP
VGLMDGDGLM TDKLLAKHEA NFAKKNISSM AELKSEIEKL GLLPADSKQL FLHLNGGESD
DEPSDSSGSS GSSGSSGSSG SSGSSGSSEI LENTSPHQIK NFHFISEINL ATMPRPYYKD
FSSTEDMLES AERLKAYGSI DTLLTLDLTS EDIPEFTSIL ADKGINYIAE KQYEIIDYFS
EDELPSENID RIVNMIKTIQ NNNHKVGIHC AAGNGRSGLI ATAMIINKRY TQSRINSFEE
TNKLKEIIDK NKNAINVDDI TYDAMKLVRK TNPFAGERTT DIKAAREYSR YLYSKQNR