Gene YpsIP31758_1629 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_1629 
SymbollacZ 
ID5387098 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp1887718 
End bp1890918 
Gene Length3201 bp 
Protein Length1066 aa 
Translation table11 
GC content50% 
IMG OID640864610 
Productbeta-D-galactosidase 
Protein accessionYP_001400606 
Protein GI153949016 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.893337 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGTCAC AGGAAAAGGT ACCATTCCAG GTGCAACTGA GTCTTCCCCA GATTTTGTCC 
CGCCGAGATT GGGAAAACCC GCAGATCACA CAGTATCATC GCCTGGAGGC CCACCCGCCT
TTTCACAGTT GGCGTGATGT AGAATCTGCC CAGAAGGATC GTCCTTCACC ACAGCAACAA
ACACTCAATG GGCTATGGTC ATTCAGCTAT TTCACACAAC CTGAAGCGGT ACCCGAGCAC
TGGGTGAGGT GCGATTTAGC CGAGGCAAAG CCGCTCCCTG TACCGGCTAA CTGGCAACTT
CATGGTTATG ACGCACCAAT TTACACCAAT ATACAATACC CTATTCCCGT CAACCCACCA
CGGGTCCCGG ATCTAAATCC AACGGGTTGC TATTCCCGTG ATTTCACGTT AGAGCCAAGC
TGGTTGGCAT CGGGTAAGAC TCGCATTATT TTTGACGGTG TCAGTTCTGC ATTTTATCTG
TGGTGTAATG GGCAATGGGT AGGTTATTCA CAAGACAGCC GCCTACCTGC TGAGTTCGAT
CTCACCCCCT ATTTGCAGGC TGGCAGTAAC CGTATCGCAG TTTTAGTTCT GCGCTGGAGT
GATGGGAGTT ATCTTGAAGA TCAAGATATG TGGCGTATGA GCGGAATTTT TCGTGATGTG
AAATTGTTGC ATAAACCCGA GATTCACTTA CGGGATATCC ACATCATGAC GCATCTATCC
CCTGAATTCA CCTCTGCAAA TTTAGAGGTA ATGGCGGCCG TCAATATCCC CTCTCTACAG
CTCAATGATC CGCAGGTGAC CGGATCCTAT CAGCTCCGTG TACAACTTTG GTTAGCCGAT
AAATTGGTCG CCAGTTTACA ACAGCCTTTA GGCACCCAAG CCATTGATGA ACGAGGTCCT
TATACTGATC GTACCCAGCT AGTATTGCGA ATAGATCAGC CTCTGCTCTG GAGTGCCGAG
CAGCCGACGC TATACCGAGC CGTGGTTTCC TTGCTCAATC ATCAGCAAGA ATTGATTGAG
GCCGAAGCCT ATGACGTGGG TTTCAGGCAA GTGGCAATCC ATCAAGGCTT GCTTAAAATC
AATGGCAAAG CGGTGCTGAT CAGAGGGGTG AATCGACATG AACATCACCC GCAAACAGGT
CAGGCCATTG ATGAAGAGAG TCTGTTGCAA GACATTTTAT TAATGAAACA GCATAATTTT
AATGCTGTGC GCTGCTCCCA CTATCCCAAT CATCCTTTAT GGTACCGCCT TTGTGACCGC
TATGGTTTGT ATGTGGTTGA TGAAGCGAAT ATTGAGACAC ACGGTATGCA GCCCATGAGC
AGGCTGTCCG ATGACCCAAG CTGGTTTTCA GCTTTCAGTG AACGGGTGAC GCGGATGGTT
CAGCGAGATC GCAACCATCC ATGCATTATT ATCTGGTCAC TGGGCAATGA ATCAGGCCAT
GGCGCAACCC ATGATGCCCT CTATCGTTGG ATAAAAACCA ATGACCCCAC CCGCCCTGTG
CAATATGAAG GGGGCGGTGC CAACACCTTA GCGACCGACA TTCTGTGTCC GATGTATGCC
CGTGTTGATG AAGACCAGCC CTTTCCTGCC GTCCCCAAGT GGTCAATCAA AAAATGGGTT
GGCTTACCGA ATGAATCTCG CCCCTTGATC CTATGTGAAT ACGCCCATGC GATGGGCAAT
AGCTTCGGTG GATTTGCCCG CTATTGGCAG GCATTTCGTC AGTACCCGCG CTTACAGGGC
GGGTTTATTT GGGACTGGGT AGACCAAAGT CTGACTCATC ATAATGACCA TGGTCAGCCT
TATTGGGCGT ATGGGGGTGA TTTTGGTGAT ACCCCCAATG ACCGCCAGTT CTGCATGAAC
GGATTAGTCT TCCCTGACCG CAGCCCGCAC CCGAGCCTTT ATGAAGCGCA GTGCGCACAG
CAATTCTTCC AATTTTCGTT GCTGAGTACG ACCCCGTTGG TGATCAACAT TACCAGTGAA
TATTTGTTCC GAGAGAGTGA TAACGAACAA TTATATTGGC GGATAATGTT AGAGGGAGAA
TCCATGTTGG AGGGTAGCCA ACCCCTGAAT TTGTCGCCTG AAAGCTCACA GTGCTACAGG
TTGGCAGAGA AATTACCCAC GCTTAATAAA CCTGGGCAGC TATGGCTAAA TGTTGAGATA
AGGCAACCAA AAGAAACCCC GTGGTCCCCT GCTCAACATC GCAGTGCCTG GCATCAATGG
CGCTTACCAC AACCACTCTT TTCGCCGTCC AGTGATCTGA CCAATGCTAC AGCGCATTAT
GCCCCTCAAC TGCAACATAA CCTTCAACTA CAACATAACC GTCAACTACA ACATGACCTT
CAACTGCAGC AAGATGAACA GCATATTAAG GTGACTTATC AGCAACAATG CTGGCAATTC
AGTCGTCAAA CGGGGCGGTT GGCGCAATGG TGGGTGGCGG ATAAACCGAT GCTACTGCGC
CCACTACAAG ATCAATTTGT GCGTGCGCCG CTGGATAACG ATATCGGTAT CAGCGAAGCT
ACGCATATTG ACCCCAATGC TTGGGTTGAG CGCTGGAAGA AAGCCGGAAT GTATCAACTC
CAGCAACGCT GCCTCTCTCT ACACGTAGAT CATTTATCCC ATTCAGTACA AATCAGTGCC
GAATACGGTT ATGAATTCGA GCAAGAGCCC TTGCTACACA GCCATTGGGT ATACCGTTTT
GACCGACATG GCCGTATGAC CATTGATGTT AACGTCCGTA TCGCTACCTC ACTCCCTGCG
CCAGCCAGAA TTGGCATGTG TTGCCAACTG GCTGATATCT CACCTACGGT TGACTGGCTA
GGGTTGGGGC CACATGAAAA CTACCCTGAT CGGCAGCTTG CAGCACAATA TGGGCACTGG
TCCCTGCCAT TAGAGCAGAT GCACACCGCG TATATTTTCC CCAGTGAGAA TGGCTTGCGC
TGCAATACCC ATACGCTGAA TTATGGCCGC TGGACGTTAA CGGGCGATTT CCACTTTGGT
ATAAGTCGCT ACAGCACCCA GCAACTGATG GTGACCTCCC ATCAACATCT ATTGGAACCC
GAAGAGGGCA CCTGGCTCAA TATTGATGGC TTCCATATGG GGGTCGGCGG TGATGATTCA
TGGAGCCCGA GTGTTCACAT TGATGACATA CTCACCCGTG AAACCTATCA GTACCAAATC
TGTTGGCAAT ACAAGGTGTA A
 
Protein sequence
MTSQEKVPFQ VQLSLPQILS RRDWENPQIT QYHRLEAHPP FHSWRDVESA QKDRPSPQQQ 
TLNGLWSFSY FTQPEAVPEH WVRCDLAEAK PLPVPANWQL HGYDAPIYTN IQYPIPVNPP
RVPDLNPTGC YSRDFTLEPS WLASGKTRII FDGVSSAFYL WCNGQWVGYS QDSRLPAEFD
LTPYLQAGSN RIAVLVLRWS DGSYLEDQDM WRMSGIFRDV KLLHKPEIHL RDIHIMTHLS
PEFTSANLEV MAAVNIPSLQ LNDPQVTGSY QLRVQLWLAD KLVASLQQPL GTQAIDERGP
YTDRTQLVLR IDQPLLWSAE QPTLYRAVVS LLNHQQELIE AEAYDVGFRQ VAIHQGLLKI
NGKAVLIRGV NRHEHHPQTG QAIDEESLLQ DILLMKQHNF NAVRCSHYPN HPLWYRLCDR
YGLYVVDEAN IETHGMQPMS RLSDDPSWFS AFSERVTRMV QRDRNHPCII IWSLGNESGH
GATHDALYRW IKTNDPTRPV QYEGGGANTL ATDILCPMYA RVDEDQPFPA VPKWSIKKWV
GLPNESRPLI LCEYAHAMGN SFGGFARYWQ AFRQYPRLQG GFIWDWVDQS LTHHNDHGQP
YWAYGGDFGD TPNDRQFCMN GLVFPDRSPH PSLYEAQCAQ QFFQFSLLST TPLVINITSE
YLFRESDNEQ LYWRIMLEGE SMLEGSQPLN LSPESSQCYR LAEKLPTLNK PGQLWLNVEI
RQPKETPWSP AQHRSAWHQW RLPQPLFSPS SDLTNATAHY APQLQHNLQL QHNRQLQHDL
QLQQDEQHIK VTYQQQCWQF SRQTGRLAQW WVADKPMLLR PLQDQFVRAP LDNDIGISEA
THIDPNAWVE RWKKAGMYQL QQRCLSLHVD HLSHSVQISA EYGYEFEQEP LLHSHWVYRF
DRHGRMTIDV NVRIATSLPA PARIGMCCQL ADISPTVDWL GLGPHENYPD RQLAAQYGHW
SLPLEQMHTA YIFPSENGLR CNTHTLNYGR WTLTGDFHFG ISRYSTQQLM VTSHQHLLEP
EEGTWLNIDG FHMGVGGDDS WSPSVHIDDI LTRETYQYQI CWQYKV