Gene YpsIP31758_0918 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_0918 
Symbol 
ID5385494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp1101637 
End bp1103697 
Gene Length2061 bp 
Protein Length686 aa 
Translation table11 
GC content51% 
IMG OID640863884 
Productbeta-galactosidase 
Protein accessionYP_001399902 
Protein GI153949001 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1874] Beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value0.995798 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTAAGT TTCCGCCCTT AAGCGCGAAA GTTTCTGCGT TATTGCATGG CGCTGACTAT 
AACCCAGAGC AATGGGAAAA TTATCCTGAC ATTATTGATA AAGATATTGC CATGATGAAA
CAGGCAAAAT GCAATGTCAT GTCAGTGGGA ATATTCAGTT GGGTAAAACT TGAGCCAAGT
GAAGGAGAAT ATAATTTCTC TTGGTTAGAT GAATTAATTG AAAAACTTTA TGCTGCCGGT
ATCCATATAT TCCTGGCGAC GCCAAGTGGC GCACGTCCGG CTTGGATGTC GCAGAAATAT
CCTGAAGTTT TACGTGTCGG GCGTGATCGG GTGCCTGCGC TGCATGGTGG GCGTCATAAC
CACTGTATGA CCTCGCCGGT ATATCGCCAG AAAGTGCGGC AGATAAACCA GAAGTTAGCA
GAGCGTTACG CCCATCATCC TGCGGTTATT GGCTGGCACA TCTCTAATGA ATACGGCGGC
GAATGTCACT GTCATTCCTG TCAGCAAAAA TTCCGCCTCT GGCTTCAGGA TCGTTATCAA
ACGCTGGATA ACCTTAACGA AGCCTGGTGG AGTGCTTTCT GGAGCCACAC CTACAGCGAT
TGGTCGCAAA TTGAGTCTCC AGCACCGCAA GGTGAAGTTT CAATTCATGG TCTGAACCTT
GACTGGCGGC GTTTTAATAC CGCTCAGGTT ACGGAGTTTT GTTCTGAAGA AGCGAAGCCA
CTAAAGGCGG CAAACCCAGA GTTGCCTGTT ACGACTAACT TTATGGAGTA CTTCTACGAT
TACGACTACT GGAAGTTGGC GCAGGTGATT GATTTTATCT CCTGGGACAG CTACCCGATG
TGGCATCGTG AGAAAGATGA AACGCAGCTT GCCTGTTATA CCGCGATGTA TCATGACCTC
ATGCGTACCC TGAAACAGGG CCGTCCATTT GTCTTAATGG AATCAACGCC GAGCGCGACC
AACTGGCAAC CTACCAGTAA GTTGAAGAAG CCGGGGATGC ATATCCTCTC TTCTTTACAG
GCTGTCGCGC ACGGTGCGGA TGCGGTGCAG TATTTCCAAT GGCGTAAGAG CCGTGGCTCC
GTTGAGAAAT TCCACGGTGC GGTTGTCGAT CATGTTGGTC ACATTGATAC CCGTGTTGGG
CGGGAGGTGA GCGAACTGGG TCGTATACTC GAGGCCATGT CGCCGGTGAT GGGAAGTAAA
GTAGACGCTG ATGTTGCCAT CATTTTTGAT TGGGAAAGTC GTTGGGCTAT GGACGACGCC
GAAGGCCCAC GTAACTGTGG CTTGGAATAT GAGAAAACCG TTGCCGAGCA TTACCGTCCA
TTCTGGGAAC GGGGCATTGC GGTCGATATC ATTAATGCTG ACTGCGATCT GTCGGGTTAC
AAGCTGGTCA TTGCGCCGAT GCTCTATATG GTTCGTGAAG GGTTTGCTGA AAGGGCGACG
CGTTTTGTAG AGCAAGGTGG GCAGTTTGTC GCGACCTACT GGAGCGGTAT TGTCAACGAA
TCTGACCTGT GTCATCTCGG TGGCTTCCCT GGCCCTCTGC GGCCGCTGTT GGGTATCTGG
TCCGAAGAAA TTGACTGTCT GGCCGATGGC GAGAGCAATC AAGTCCAAGG GCTTGCAGGC
AATAAAGCCG GTCTACAGGG GCCTTATCAG GCCATCCATC TCTGCGATTT GATTCATCTT
GAAGGGGCGA CAGCCGTCGC GCGCTATAGA GATGACTTCT ATGCTGACCG TGCGGCGGTG
ACCGTAAACT TTGTTGGCGA AGGTAAAGCC TGGTATGTCG CTTCACGTAA CGATGCGGCT
TTCCAGCGTG ATTTCTTCAT GAACATTGCC GAGGAGCTAA ACTTGGCACG TGCGCTAGAC
ACCCAGTTCC CTTATGGGGT TACCGCTCAC CGTCGTACCG ATGGTGAGAG TGAATTTATC
GTGGTAGAAA ACTACAGTAA TGACAGCAAA TCGCTGGTGT TACCTGCGGT ATACCGCGAT
ATGGTCGACC AGCAGCCGGT ACAAGGTAGC CTGACGCTTG CCCCATGGGG AAGCCGAGTG
CTAACTCGCT ATCTCGAATA A
 
Protein sequence
MSKFPPLSAK VSALLHGADY NPEQWENYPD IIDKDIAMMK QAKCNVMSVG IFSWVKLEPS 
EGEYNFSWLD ELIEKLYAAG IHIFLATPSG ARPAWMSQKY PEVLRVGRDR VPALHGGRHN
HCMTSPVYRQ KVRQINQKLA ERYAHHPAVI GWHISNEYGG ECHCHSCQQK FRLWLQDRYQ
TLDNLNEAWW SAFWSHTYSD WSQIESPAPQ GEVSIHGLNL DWRRFNTAQV TEFCSEEAKP
LKAANPELPV TTNFMEYFYD YDYWKLAQVI DFISWDSYPM WHREKDETQL ACYTAMYHDL
MRTLKQGRPF VLMESTPSAT NWQPTSKLKK PGMHILSSLQ AVAHGADAVQ YFQWRKSRGS
VEKFHGAVVD HVGHIDTRVG REVSELGRIL EAMSPVMGSK VDADVAIIFD WESRWAMDDA
EGPRNCGLEY EKTVAEHYRP FWERGIAVDI INADCDLSGY KLVIAPMLYM VREGFAERAT
RFVEQGGQFV ATYWSGIVNE SDLCHLGGFP GPLRPLLGIW SEEIDCLADG ESNQVQGLAG
NKAGLQGPYQ AIHLCDLIHL EGATAVARYR DDFYADRAAV TVNFVGEGKA WYVASRNDAA
FQRDFFMNIA EELNLARALD TQFPYGVTAH RRTDGESEFI VVENYSNDSK SLVLPAVYRD
MVDQQPVQGS LTLAPWGSRV LTRYLE