Gene YpsIP31758_0407 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_0407 
Symbol 
ID5384534 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp473406 
End bp476171 
Gene Length2766 bp 
Protein Length921 aa 
Translation table11 
GC content47% 
IMG OID640863376 
Productputative insecticidal toxin complex protein 
Protein accessionYP_001399400 
Protein GI153949316 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAAT ATTTTATTCC ATCACTTACT GCCAACACCC CCAGTATTGC CGTTCACGAT 
AACCGGGGGT TAGCCATTCG CACACTGGCT TATAACCGCC GCAGTCATGA TGAAACCGTT
GACGAACTGA GCAGCCGCAA CCGCTATAAC GCCTCCGGTC AGTTAATCGC CAGCCGCGAC
CCGCGCCTTG AGGTGGATAA TTTTCGCTAT CAATACAGCC TCAGCGGTGT TCCCCTGCGT
ACCGACAGCG TCGATAGCGG CAGTACGCTG CAACTGGCAG ATAGTGCTGG CCGCACGGTG
CTCACGCTGG ATGCACACCA CACCCGCCGC TGGGTAGAGT ATGAGACCGG TGAGCACAGT
TTAGATCGCC CACTAAGTTA CCGCGAGCAA GCCAAAGGCG GCCTGAAAAC GGTTACCGAC
CGCTTTTTCT ATGGCGAAAA TAGCGAACAG GATAAAGGCT GTAATCTCAA CGGTACATGT
GTGCGCCATT ACGACAGCGC CGGTTTACAG GCACTGATTA GCCAGTCGAT TATTGGTATA
CCACTGCAGC AACAGCGCCG TCTGCTAACG GATACCAAAG GCCCGGTTGA CTGGTTTGGC
GAAAAGGAGA ACTGGGGCAC TCGCCTGAGT GAATCGCCGT TTGTTAGCCA TAGCACCACC
AATGCTCTCG GCCAGTTAAT CACGCAAACC GATGCCAAAG GCCATATCCA GCGCATGGCC
TACAACCGTG CCGGGCAACT TATCGGTTCG TGGTTAACGG TAAAAAATGG CGTGGAGCAG
CCCATTGTAT ACTCACTCAC CTATTCAGCC GCCGGGCAAA AACTGCGCGA AGAGAGTGGC
AACGGGGTCA TTACCGAATA CCGTTATGAA CCCCAGACCC AGCGTTTAAT CGCCATTAAA
ACCACCCGGC CAGCGAAGAA AGACCGCCCG ACCCTGTTAC AAGACCTGCG TTATGATTAT
GACCCGGTCG GGAATATTCT CGCCATCCAT AACGACGCCG AAGCCACCCG CTTCTACCGT
AATCAGAAAA TCGTGCCGGA AACCACTTAC CGCTACGATG CACTGTATCA GCTTATTGAG
GCCACCGGCC GTGAAGCCGA TACCAACGGC ATACAAAACA GCCAGTTGCC CACGCTGGCG
TCACTGAACG ACAGCAACCA GTTCGTCAAC TATACCCGCC GCTACCACTA TGACCGCGCC
GGTAACCTGC TAAAAATTCA GCATACCGGT GCCAGCCAAT ACAGTACCCA TATCACGGTG
TCCGATTCGT CCAATCACGG CATTCAGCAA CAAGATGGCA TCACTGCCCG TGATATTCGC
TCCCAGTTTG ATGCGGCGGG TAATCAGCGA CAACTGCAAC CCGGTCAACC CCTGCGCTGG
AACAGCCGCA ATCAGTTACA GCAGGTAGAG CCCGTGCCCC GCAACGACGG CATCAGTGAC
AGCGAAAGTT ATCTCTATGA TGGCAGCGGT AGCCGGGTGG TCAAAATCAG TCTCCATAAA
ACCCATAACG CCATCCAAAC CCGTTCAGTC ATTTATTTAG CGGGACTGGA ACTGCGTAGC
CAATATAACG GCAATAATCT GACAGAAGAT TTTCAGGTGA TGACCGTGGG TGCCGCGGGC
CGTGCTCAGG TACGGGTATT ACACTGGGAG CGCGGCCAAC CCGCTGATAT CGTCAATGAC
CAACTGCGTT ACAGTTTCGA TAACCACATT GGCTCGGCGT TAATCGAATT AGACAGCGAC
GGCGATATTA TCAGCCAGGA AGAATATTAC CCATTTGGCG GCACCGCGGT GTTAGCCTCC
CGTAATACCA TGGAAGCTAA ATATAAAACC GTTCGTTACT CCGGTAAAGA GCGTGACACC
ACCGGACTGT ATTATTACGG TTACCGTTAT TACCAGCCAT GGCTGGGCCG ATGGTTAAGC
GCCGACCCCG CAGGCACTAT AGACGGGCTG AATTTATACC GAATGGTGAG GAATAACCCT
ATAAAATTGG TGGACAGAGA TGGGCTTCAG CCTGATAAAA TAATAGGGCT AGAAAATTAT
GCAGAATATC TAGAAGAAAC AATAGAAGAT GAAAGTGAAT TTTTAGAAAT AGCACAAGGA
ATAGGCGCTG CTTTTTTAGA TGCATCATTA ATATTACCAG AAGCAATCGC CAGGTTAAGG
GATGAGAGCC TAGCAGATGA AAATGAAATC CTGATAAGAA AATTTTTACC CGATCAGGAA
GTTGATTTCT CACGAATAAA AGACGAGCTA CTTGATAGAT TTGACAATAT GGAAGACATG
ATTAATGAAG TAATAGAAAA TAGAAATACA AAAATTATCT TTGATTTGAA AAGCAACACA
AACTCAATAG CTTATGTTAA TTATAAAGAT GAATTACATA GAATAAACGT TACAAATTTA
TTTATAAAAA ATGTTGGAAC AGTAAGTAAT ATCCATGCAA TATTACATGA GTTATCTCAT
ATGGAGCTTC CAACAAGAAG AGTAACAAAA GATTATTATT ACATTAGCGA ATTTAGTACC
GAAGAAATAT TTCCAGACAC TGAAGATTTT TTATCTATAG CACAAGAATC TTTCGAGAAT
TATCATATTA TAATGAGAAC TGCACTTGAT AAAAATATGG ATGAAAGTCC CTCTTCCATG
TTAACAAGAG AACTTTATGA CACATCAGAT GATATTAGTA TTGCTCTAGA GAATGCTGAT
CACATTGCAA TATTAGCCTT AGCATTAGGG AGATCCGAAA TACAACACAG CCAATATGCA
TTATAA
 
Protein sequence
MSQYFIPSLT ANTPSIAVHD NRGLAIRTLA YNRRSHDETV DELSSRNRYN ASGQLIASRD 
PRLEVDNFRY QYSLSGVPLR TDSVDSGSTL QLADSAGRTV LTLDAHHTRR WVEYETGEHS
LDRPLSYREQ AKGGLKTVTD RFFYGENSEQ DKGCNLNGTC VRHYDSAGLQ ALISQSIIGI
PLQQQRRLLT DTKGPVDWFG EKENWGTRLS ESPFVSHSTT NALGQLITQT DAKGHIQRMA
YNRAGQLIGS WLTVKNGVEQ PIVYSLTYSA AGQKLREESG NGVITEYRYE PQTQRLIAIK
TTRPAKKDRP TLLQDLRYDY DPVGNILAIH NDAEATRFYR NQKIVPETTY RYDALYQLIE
ATGREADTNG IQNSQLPTLA SLNDSNQFVN YTRRYHYDRA GNLLKIQHTG ASQYSTHITV
SDSSNHGIQQ QDGITARDIR SQFDAAGNQR QLQPGQPLRW NSRNQLQQVE PVPRNDGISD
SESYLYDGSG SRVVKISLHK THNAIQTRSV IYLAGLELRS QYNGNNLTED FQVMTVGAAG
RAQVRVLHWE RGQPADIVND QLRYSFDNHI GSALIELDSD GDIISQEEYY PFGGTAVLAS
RNTMEAKYKT VRYSGKERDT TGLYYYGYRY YQPWLGRWLS ADPAGTIDGL NLYRMVRNNP
IKLVDRDGLQ PDKIIGLENY AEYLEETIED ESEFLEIAQG IGAAFLDASL ILPEAIARLR
DESLADENEI LIRKFLPDQE VDFSRIKDEL LDRFDNMEDM INEVIENRNT KIIFDLKSNT
NSIAYVNYKD ELHRINVTNL FIKNVGTVSN IHAILHELSH MELPTRRVTK DYYYISEFST
EEIFPDTEDF LSIAQESFEN YHIIMRTALD KNMDESPSSM LTRELYDTSD DISIALENAD
HIAILALALG RSEIQHSQYA L