Gene YPK_1739 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_1739 
SymbollacZ 
ID6087222 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp1924556 
End bp1927756 
Gene Length3201 bp 
Protein Length1066 aa 
Translation table11 
GC content50% 
IMG OID641596809 
Productbeta-D-galactosidase 
Protein accessionYP_001720485 
Protein GI170023980 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000361485 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGTCAC AGGAAAAGGT ACCATTCCAG GTGCAACTGA GTCTTCCCCA GATTTTGTCC 
CGCCGAGATT GGGAAAACCC GCAGATCACA CAGTATCATC GCCTGGAGGC CCACCCGCCT
TTTCACAGTT GGCGTGATGT AGAATCTGCC CAGAAGGATC GTCCTTCACC ACAGCAACAA
ACACTCAATG GGCTATGGTC ATTCAGCTAT TTCACACAAC CTGAAGCGGT ACCCGAGCAC
TGGGTGAGGT GCGATTTAGC CGAGGCAAAG CCGCTCCCTG TACCGGCTAA CTGGCAACTT
CATGGTTATG ACGCACCAAT TTACACCAAT ATACAATACC CTATTCCCGT CAACCCACCA
CGGGTCCCGG ATCTAAATCC AACGGGTTGC TATTCCCGTG ATTTCACGTT AGAGCCAAGC
TGGTTGGCAT CGGGTAAGAC TCGCATTATT TTTGACGGTG TCAGTTCTGC ATTTTATCTG
TGGTGTAATG GGCAATGGGT AGGTTATTCA CAAGACAGCC GCCTACCTGC TGAGTTCGAT
CTCACCCCCT ATTTGCAGGC TGGCAGTAAC CGTATCGCAG TTTTAGTTCT GCGCTGGAGT
GATGGGAGTT ATCTTGAAGA TCAAGATATG TGGCGTATGA GCGGAATTTT TCGTGATGTG
AAATTGTTGC ATAAACCCGA GATTCACTTA CGGGATATCC ACATCATGAC GCATCTATCC
CCTGAATTCA CCTCTGCAAA TTTAGAGGTA ATGGCGGCCG TCAATATCCC CTCTCTACAG
CTCAATGATC CGCAGGTGAC CGGATCCTAT CAGCTCCGTG TACAACTTTG GTTAGCCGAT
AAATTGGTCG CCAGTTTACA ACAGCCTTTA GGCACCCAAG CCATTGATGA ACGAGGTCCT
TATACTGATC GTACCCAGCT AGTATTGCGA ATAGATCAGC CTCTGCTCTG GAGTGCCGAG
CAGCCGACGC TATACCGAGC CGTGGTTTCC TTGCTCAATC ATCAGCAAGA ATTGATTGAG
GCCGAAGCCT ATGACGTGGG TTTCAGGCAA GTGGCAATCC ATCAAGGCTT GCTTAAAATC
AATGGCAAAG CGGTGCTGAT CAGAGGGGTG AATCGACATG AACATCACCC GCAAACAGGT
CAGGCCATTG ATGAAGAGAG TCTGTTGCAA GACATTTTAT TAATGAAACA GCATAATTTT
AATGCTGTGC GCTGCTCCCA CTATCCCAAT CATCCTTTAT GGTACCGCCT TTGTGACCGC
TATGGTTTGT ATGTGGTTGA TGAAGCGAAT ATTGAGACAC ACGGTATGCA GCCCATGAGC
AGGCTGTCCG ATGACCCAAG CTGGTTTTCA GCTTTCAGTG AACGGGTGAC GCGGATGGTT
CAGCGAGATC GCAACCATCC ATGCATTATT ATCTGGTCAC TGGGCAATGA ATCAGGCCAT
GGCGCAACCC ATGATGCCCT CTATCGTTGG ATAAAAACCA ATGACCCCAC CCGCCCTGTG
CAATATGAAG GGGGCGGTGC CAACACCTTA GCGACCGACA TTCTGTGTCC GATGTATGCC
CGTGTTGATG AAGACCAGCC CTTTCCTGCC GTCCCCAAGT GGTCAATCAA AAAATGGATT
GGCTTACCGA ATGAATCTCG CCCCTTGATC CTATGTGAAT ACGCCCATGC GATGGGCAAT
AGCTTCGGTG GATTTGCCCG CTATTGGCAG GCATTTCGTC AGTACCCGCG CTTACAGGGC
GGGTTTATTT GGGACTGGGT AGACCAAAGT CTGACTCATC ATAATGACCA TGGTCAGCCT
TATTGGGCGT ATGGGGGTGA TTTTGGTGAT ACTCCCAATG ACCGCCAGTT CTGCATGAAC
GGATTAGTCT TCCCTGACCG CAGCCCGCAC CCGAGCCTTT ATGAAGCGCA GTGCGCACAG
CAATTCTTCC AATTTTCGTT GCTGAGTACG ACCCCGTTGG TGATCAACAT TACCAGTGAA
TATTTGTTCC GAGAGAGTGA TAACGAACAA TTATATTGGC GGATAATGTT AGAGGGAGAA
TCCGTGTTGG AGGGTAGCCA ACCCCTGAAT TTGTCGCCTG AAAGCTCACA GTGCTACAGG
TTGGCAGAGA AATTACCCAC GCTTAATAAA CCTGGGCAGC TATGGCTGAA TGTTGAGATA
AGGCAACCAA AAGAAACCCC GTGGTCCCCT GCTCAACATC GCAGTGCCTG GCATCAATGG
CGCTTACCAC AACCACTCTT TTCGCCGTCC AGTGATCTGA CCAATGCTAC AGCGCATTAT
GCCCCTCAAC TGCAACATAA CCTTCAACTA CAACATAACC GTCAACTACA ACATGACCTT
CAACTGCAGC AAGATGAACA GCATATTAAG GTGACTTATC AGCAACAATG CTGGCAATTC
AGTCGTCAAA CGGGGCGGTT GGATCAATGG TGGGTGGCGG ATAAACCGAT GCTACTGCGC
CCACTACAAG ATCAATTTGT GCGTGCGCCG CTGGATAACG ATATCGGTAT CAGCGAAGCT
ACGCATATTG ACCCCAATGC TTGGGTTGAG CGCTGGAAGA AAGCCGGAAT GTATCAACTC
CAGCAACGCT GCCTCTCTCT ACACGTAGAT CATTTATCCC ATTCAGTACA AATCAGTGCC
GAATACGGTT ATGAATTCGA GCAAGAGCCC TTGCTACACA GCCATTGGGT ATACCGTTTT
GACCGACATG GCCGTATGAC CATTGATGTT AACGTCCGTA TCGCTACCTC ACTCCCTGCG
CCAGCCAGAA TTGGCATGTG TTGCCAACTG GCTGATATCT CACCTACGGT TGAATGGCTA
GGGTTGGGGC CACATGAAAA CTACCCTGAT CGGCAGCTTG CAGCACAATA TGGGCACTGG
TCCCTGCCAT TAGAGCAGAT GCACACCGCG TATATTTTCC CCAGTGAGAA TGGCTTGCGC
TGCAATACCC ATACGCTGAA TTATGGCCGC TGGACGTTAA CGGGCGATTT CCACTTTGGT
ATAAGTCGCT ACAGCACCCA GCAACTGATG GTGACCTCCC ATCAACATCT ATTGGAACCC
GAAGAGGGCA CCTGGCTCAA TATTGATGGT TTCCATATGG GGGTGGGCGG TGATGATTCA
TGGAGCCCGA GTGTTCACAT TGATGACATA CTCACCCGTG AAACCTATCA GTACCAAATC
TGTTGGCAAT ACAAGGTGTA A
 
Protein sequence
MTSQEKVPFQ VQLSLPQILS RRDWENPQIT QYHRLEAHPP FHSWRDVESA QKDRPSPQQQ 
TLNGLWSFSY FTQPEAVPEH WVRCDLAEAK PLPVPANWQL HGYDAPIYTN IQYPIPVNPP
RVPDLNPTGC YSRDFTLEPS WLASGKTRII FDGVSSAFYL WCNGQWVGYS QDSRLPAEFD
LTPYLQAGSN RIAVLVLRWS DGSYLEDQDM WRMSGIFRDV KLLHKPEIHL RDIHIMTHLS
PEFTSANLEV MAAVNIPSLQ LNDPQVTGSY QLRVQLWLAD KLVASLQQPL GTQAIDERGP
YTDRTQLVLR IDQPLLWSAE QPTLYRAVVS LLNHQQELIE AEAYDVGFRQ VAIHQGLLKI
NGKAVLIRGV NRHEHHPQTG QAIDEESLLQ DILLMKQHNF NAVRCSHYPN HPLWYRLCDR
YGLYVVDEAN IETHGMQPMS RLSDDPSWFS AFSERVTRMV QRDRNHPCII IWSLGNESGH
GATHDALYRW IKTNDPTRPV QYEGGGANTL ATDILCPMYA RVDEDQPFPA VPKWSIKKWI
GLPNESRPLI LCEYAHAMGN SFGGFARYWQ AFRQYPRLQG GFIWDWVDQS LTHHNDHGQP
YWAYGGDFGD TPNDRQFCMN GLVFPDRSPH PSLYEAQCAQ QFFQFSLLST TPLVINITSE
YLFRESDNEQ LYWRIMLEGE SVLEGSQPLN LSPESSQCYR LAEKLPTLNK PGQLWLNVEI
RQPKETPWSP AQHRSAWHQW RLPQPLFSPS SDLTNATAHY APQLQHNLQL QHNRQLQHDL
QLQQDEQHIK VTYQQQCWQF SRQTGRLDQW WVADKPMLLR PLQDQFVRAP LDNDIGISEA
THIDPNAWVE RWKKAGMYQL QQRCLSLHVD HLSHSVQISA EYGYEFEQEP LLHSHWVYRF
DRHGRMTIDV NVRIATSLPA PARIGMCCQL ADISPTVEWL GLGPHENYPD RQLAAQYGHW
SLPLEQMHTA YIFPSENGLR CNTHTLNYGR WTLTGDFHFG ISRYSTQQLM VTSHQHLLEP
EEGTWLNIDG FHMGVGGDDS WSPSVHIDDI LTRETYQYQI CWQYKV