Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YPK_1739 |
Symbol | lacZ |
ID | 6087222 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pseudotuberculosis YPIII |
Kingdom | Bacteria |
Replicon accession | NC_010465 |
Strand | - |
Start bp | 1924556 |
End bp | 1927756 |
Gene Length | 3201 bp |
Protein Length | 1066 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641596809 |
Product | beta-D-galactosidase |
Protein accession | YP_001720485 |
Protein GI | 170023980 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000361485 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGTCAC AGGAAAAGGT ACCATTCCAG GTGCAACTGA GTCTTCCCCA GATTTTGTCC CGCCGAGATT GGGAAAACCC GCAGATCACA CAGTATCATC GCCTGGAGGC CCACCCGCCT TTTCACAGTT GGCGTGATGT AGAATCTGCC CAGAAGGATC GTCCTTCACC ACAGCAACAA ACACTCAATG GGCTATGGTC ATTCAGCTAT TTCACACAAC CTGAAGCGGT ACCCGAGCAC TGGGTGAGGT GCGATTTAGC CGAGGCAAAG CCGCTCCCTG TACCGGCTAA CTGGCAACTT CATGGTTATG ACGCACCAAT TTACACCAAT ATACAATACC CTATTCCCGT CAACCCACCA CGGGTCCCGG ATCTAAATCC AACGGGTTGC TATTCCCGTG ATTTCACGTT AGAGCCAAGC TGGTTGGCAT CGGGTAAGAC TCGCATTATT TTTGACGGTG TCAGTTCTGC ATTTTATCTG TGGTGTAATG GGCAATGGGT AGGTTATTCA CAAGACAGCC GCCTACCTGC TGAGTTCGAT CTCACCCCCT ATTTGCAGGC TGGCAGTAAC CGTATCGCAG TTTTAGTTCT GCGCTGGAGT GATGGGAGTT ATCTTGAAGA TCAAGATATG TGGCGTATGA GCGGAATTTT TCGTGATGTG AAATTGTTGC ATAAACCCGA GATTCACTTA CGGGATATCC ACATCATGAC GCATCTATCC CCTGAATTCA CCTCTGCAAA TTTAGAGGTA ATGGCGGCCG TCAATATCCC CTCTCTACAG CTCAATGATC CGCAGGTGAC CGGATCCTAT CAGCTCCGTG TACAACTTTG GTTAGCCGAT AAATTGGTCG CCAGTTTACA ACAGCCTTTA GGCACCCAAG CCATTGATGA ACGAGGTCCT TATACTGATC GTACCCAGCT AGTATTGCGA ATAGATCAGC CTCTGCTCTG GAGTGCCGAG CAGCCGACGC TATACCGAGC CGTGGTTTCC TTGCTCAATC ATCAGCAAGA ATTGATTGAG GCCGAAGCCT ATGACGTGGG TTTCAGGCAA GTGGCAATCC ATCAAGGCTT GCTTAAAATC AATGGCAAAG CGGTGCTGAT CAGAGGGGTG AATCGACATG AACATCACCC GCAAACAGGT CAGGCCATTG ATGAAGAGAG TCTGTTGCAA GACATTTTAT TAATGAAACA GCATAATTTT AATGCTGTGC GCTGCTCCCA CTATCCCAAT CATCCTTTAT GGTACCGCCT TTGTGACCGC TATGGTTTGT ATGTGGTTGA TGAAGCGAAT ATTGAGACAC ACGGTATGCA GCCCATGAGC AGGCTGTCCG ATGACCCAAG CTGGTTTTCA GCTTTCAGTG AACGGGTGAC GCGGATGGTT CAGCGAGATC GCAACCATCC ATGCATTATT ATCTGGTCAC TGGGCAATGA ATCAGGCCAT GGCGCAACCC ATGATGCCCT CTATCGTTGG ATAAAAACCA ATGACCCCAC CCGCCCTGTG CAATATGAAG GGGGCGGTGC CAACACCTTA GCGACCGACA TTCTGTGTCC GATGTATGCC CGTGTTGATG AAGACCAGCC CTTTCCTGCC GTCCCCAAGT GGTCAATCAA AAAATGGATT GGCTTACCGA ATGAATCTCG CCCCTTGATC CTATGTGAAT ACGCCCATGC GATGGGCAAT AGCTTCGGTG GATTTGCCCG CTATTGGCAG GCATTTCGTC AGTACCCGCG CTTACAGGGC GGGTTTATTT GGGACTGGGT AGACCAAAGT CTGACTCATC ATAATGACCA TGGTCAGCCT TATTGGGCGT ATGGGGGTGA TTTTGGTGAT ACTCCCAATG ACCGCCAGTT CTGCATGAAC GGATTAGTCT TCCCTGACCG CAGCCCGCAC CCGAGCCTTT ATGAAGCGCA GTGCGCACAG CAATTCTTCC AATTTTCGTT GCTGAGTACG ACCCCGTTGG TGATCAACAT TACCAGTGAA TATTTGTTCC GAGAGAGTGA TAACGAACAA TTATATTGGC GGATAATGTT AGAGGGAGAA TCCGTGTTGG AGGGTAGCCA ACCCCTGAAT TTGTCGCCTG AAAGCTCACA GTGCTACAGG TTGGCAGAGA AATTACCCAC GCTTAATAAA CCTGGGCAGC TATGGCTGAA TGTTGAGATA AGGCAACCAA AAGAAACCCC GTGGTCCCCT GCTCAACATC GCAGTGCCTG GCATCAATGG CGCTTACCAC AACCACTCTT TTCGCCGTCC AGTGATCTGA CCAATGCTAC AGCGCATTAT GCCCCTCAAC TGCAACATAA CCTTCAACTA CAACATAACC GTCAACTACA ACATGACCTT CAACTGCAGC AAGATGAACA GCATATTAAG GTGACTTATC AGCAACAATG CTGGCAATTC AGTCGTCAAA CGGGGCGGTT GGATCAATGG TGGGTGGCGG ATAAACCGAT GCTACTGCGC CCACTACAAG ATCAATTTGT GCGTGCGCCG CTGGATAACG ATATCGGTAT CAGCGAAGCT ACGCATATTG ACCCCAATGC TTGGGTTGAG CGCTGGAAGA AAGCCGGAAT GTATCAACTC CAGCAACGCT GCCTCTCTCT ACACGTAGAT CATTTATCCC ATTCAGTACA AATCAGTGCC GAATACGGTT ATGAATTCGA GCAAGAGCCC TTGCTACACA GCCATTGGGT ATACCGTTTT GACCGACATG GCCGTATGAC CATTGATGTT AACGTCCGTA TCGCTACCTC ACTCCCTGCG CCAGCCAGAA TTGGCATGTG TTGCCAACTG GCTGATATCT CACCTACGGT TGAATGGCTA GGGTTGGGGC CACATGAAAA CTACCCTGAT CGGCAGCTTG CAGCACAATA TGGGCACTGG TCCCTGCCAT TAGAGCAGAT GCACACCGCG TATATTTTCC CCAGTGAGAA TGGCTTGCGC TGCAATACCC ATACGCTGAA TTATGGCCGC TGGACGTTAA CGGGCGATTT CCACTTTGGT ATAAGTCGCT ACAGCACCCA GCAACTGATG GTGACCTCCC ATCAACATCT ATTGGAACCC GAAGAGGGCA CCTGGCTCAA TATTGATGGT TTCCATATGG GGGTGGGCGG TGATGATTCA TGGAGCCCGA GTGTTCACAT TGATGACATA CTCACCCGTG AAACCTATCA GTACCAAATC TGTTGGCAAT ACAAGGTGTA A
|
Protein sequence | MTSQEKVPFQ VQLSLPQILS RRDWENPQIT QYHRLEAHPP FHSWRDVESA QKDRPSPQQQ TLNGLWSFSY FTQPEAVPEH WVRCDLAEAK PLPVPANWQL HGYDAPIYTN IQYPIPVNPP RVPDLNPTGC YSRDFTLEPS WLASGKTRII FDGVSSAFYL WCNGQWVGYS QDSRLPAEFD LTPYLQAGSN RIAVLVLRWS DGSYLEDQDM WRMSGIFRDV KLLHKPEIHL RDIHIMTHLS PEFTSANLEV MAAVNIPSLQ LNDPQVTGSY QLRVQLWLAD KLVASLQQPL GTQAIDERGP YTDRTQLVLR IDQPLLWSAE QPTLYRAVVS LLNHQQELIE AEAYDVGFRQ VAIHQGLLKI NGKAVLIRGV NRHEHHPQTG QAIDEESLLQ DILLMKQHNF NAVRCSHYPN HPLWYRLCDR YGLYVVDEAN IETHGMQPMS RLSDDPSWFS AFSERVTRMV QRDRNHPCII IWSLGNESGH GATHDALYRW IKTNDPTRPV QYEGGGANTL ATDILCPMYA RVDEDQPFPA VPKWSIKKWI GLPNESRPLI LCEYAHAMGN SFGGFARYWQ AFRQYPRLQG GFIWDWVDQS LTHHNDHGQP YWAYGGDFGD TPNDRQFCMN GLVFPDRSPH PSLYEAQCAQ QFFQFSLLST TPLVINITSE YLFRESDNEQ LYWRIMLEGE SVLEGSQPLN LSPESSQCYR LAEKLPTLNK PGQLWLNVEI RQPKETPWSP AQHRSAWHQW RLPQPLFSPS SDLTNATAHY APQLQHNLQL QHNRQLQHDL QLQQDEQHIK VTYQQQCWQF SRQTGRLDQW WVADKPMLLR PLQDQFVRAP LDNDIGISEA THIDPNAWVE RWKKAGMYQL QQRCLSLHVD HLSHSVQISA EYGYEFEQEP LLHSHWVYRF DRHGRMTIDV NVRIATSLPA PARIGMCCQL ADISPTVEWL GLGPHENYPD RQLAAQYGHW SLPLEQMHTA YIFPSENGLR CNTHTLNYGR WTLTGDFHFG ISRYSTQQLM VTSHQHLLEP EEGTWLNIDG FHMGVGGDDS WSPSVHIDDI LTRETYQYQI CWQYKV
|
| |