Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpAngola_A2834 |
Symbol | lacZ |
ID | 5801306 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pestis Angola |
Kingdom | Bacteria |
Replicon accession | NC_010159 |
Strand | + |
Start bp | 2973072 |
End bp | 2976224 |
Gene Length | 3153 bp |
Protein Length | 1050 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641340686 |
Product | beta-D-galactosidase |
Protein accession | YP_001607216 |
Protein GI | 162421202 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.677484 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.179493 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCAACTGA GTCTTCCCCA GATTTTGTCC CGCCGAGATT GGGAAAACCC GCAGATCACA CAGTATCATC GCCTGGAGGC CCACCCGCCT TTTCACAGTT GGCGTGATGT AGAATCTGCC CAGAAGGATC GTCCTTCACC ACAGCAACAA ACACTCAATG GGCTATGGTC ATTCAGCTAT TTCACACAAC CTGAAGCGGT ACCCGAGCAC TGGGTGAGGT GCGATTTAGC CGAGGCAAAG CCGCTCCCTG TACCGGCTAA CTGGCAACTT CATGGTTATG ACGCACCAAT TTACACCAAT ATACAATACC CTATTCCCGT CAACCCACCA CGGGTCCCGG ATCTAAATCC AACGGGTTGC TATTCCCGTG ATTTCACGTT AGAGCCAAGC TGGTTGGCAT CGGGTAAGAC TCGCATTATT TTTGACGGTG TCAGTTCTGC ATTTTATCTG TGGTGTAATG GGCAATGGGT AGGTTATTCA CAAGACAGCC GCCTACCTGC TGAGTTCGAT CTCACCCCCT ATTTGCAGGC TGGCAGTAAC CGTATCGCAG TTTTAGTTCT GCGCTGGAGT GATGGGAGTT ATCTTGAAGA TCAAGATATG TGGCGTATGA GCGGAATTTT TCGTGATGTG AAATTGTTGC ATAAACCCGA GATTCACTTA CGGGATATCC ACATCATGAC GCATCTATCC CCTGAATTCA CCTCTGCAAA TTTAGAGGTA ATGGCGGCCG TCAATATCCC CTCTCTACAG CTCAATGATC CGCAGGTGAC CGGATCCTAT CAGCTCCGTG TACAACTTTG GTTAGCCGAT AAATTGGTCG CCAGTTTACA ACAGCCTTTA GGCACCCAAG CCATTGATGA ACGAGGTCCT TATACTGATC GTACCCAGCT AGTATTGCGA ATAGATCAGC CTCTGCTCTG GAGTGCCGAG CAGCCGACGC TATACCGAGC CGTGGTTTCC TTGCTCAATC ATCAGCAAGA ATTGATTGAG GCCGAAGCCT ATGACGTGGG TTTCAGGCAA GTGGCAATCC ATCAAGGCTT GCTTAAAATC AATGGCAAAG CGGTGCTGAT CAGAGGGGTG AATCGACATG AACATCACCC GCAAACAGGT CAGGCCATTG ATGAAGAGAG TCTGTTGCAA GACATTTTAT TAATGAAACA GCATAATTTT AATGCTGTGC GCTGCTCCCA CTATCCCAAT CATCCTTTAT GGTACCGCCT TTGTGACCGC TATGGTTTGT ATGTGGTTGA TGAAGCGAAT ATTGAGACAC ACGGTATGCA GCCCATGAGC AGGCTGTCCG ATGACCCAAG CTGGTTTTCA GCTTTCAGTG AACGGGTGAC GCGGATGGTT CAGCGAGATC GCAACCATCC ATGCATTATT ATCTGGTCAC TGGGCAATGA ATCAGGCCAT GGCGCAACCC ATGATGCCCT CTATCGTTGG ATAAAAACCA ATGACCCCAC CCGCCCTGTG CAATATGAAG GGGGCGGTGC CAACACCTTA GCGACCGACA TTCTGTGTCC GATGTATGCC CGTGTTGATG AAGACCAGCC CTTTCCTGCC GTCCCCAAGT GGTCAATCAA AAAATGGATT GGCTTACCGA ATGAATCTCG CCCCTTGATC CTATGTGAAT ACGCCCATGC GATGGGCAAT AGCTTCGGTG GATTTGCCCG CTATTGGCAG GCATTTCGTC AGTACCCGCG CTTACAGGGC GGGTTTATTT GGGACTGGGT AGACCAAAGT CTGACTCATC ATAATGACCA TGGTCAGCCT TATTGGGCGT ATGGGGGTGA TTTTGGTGAT ACTCCCAATG ACCGCCAGTT CTGCATGAAC GGATTAGTCT TCCCTGACCG CAGCCCGCAC CCGAGCCTTT ATGAAGCGCA GTGCGCACAG CAATTCTTCC AATTTTCGTT GCTGAGTACG ACCCCGTTGG TGATCAACAT TACCAGTGAA TATTTGTTCC GAGAGAGTGA TAACGAACAA TTATATTGGC GGATAATGTT AGAGGGAGAA TCCGTGTTGG AGGGTAGCCA ACCCCTGAAT TTGTCGCCTG AAAGCTCACA GTGCTACAGG TTGGCAGAGA AATTACCCAC GCTTAATAAA CCTGGGCAGC TATGGCTGAA TGTTGAGATA AGGCAACCAA AAGAAACCCC GTGGTCCCCT GCTCAACATC GCAGTGCCTG GCATCAATGG CGCTTACCAC AACCACTCTT TTCGCCGTCC AGTGATCTGA CCAATGCTAC AGCGCATTAT GCCCCTCAAC TGCAACATAA CCTTCAACTA CAACATGACC TTCAACTGCA GCAAGATGAA CAGCATATTA AGGTGACTTA TCAGCAACAA TGCTGGCAAT TCAGTCGTCA AACGGGGCGG TTGGCGCAAT GGTGGGTGGC GGATAAACCG ATGCTACTGC GCCCACTACA AGATCAATTT GTGCGTGCGC CGCTGGATAA CGATATCGGT ATCAGCGAAG CTACGCATAT TGACCCCAAT GCTTGGGTTG AGCGCTGGAA GAAAGCCGGA ATGTATCAAC TCCAGCAACG CTGCCTCTCT CTACACGTAG ATCATTTATC CCATTCAGTA CAAATCAGTG CCGAATACGG TTATGAATTC GAGCAAGAGC CCTTGCTACA CAGCCATTGG GTATACCGTT TTGACCGACA TGGCCGTATG ACCATTGATG TTAACGTCCG TATCGCTACC TCACTCCCTG CGCCAGCCAG AATTGGCATG TGTTGCCAAC TGGCTGATAT CTCACCTACG GTTGAATGGC TAGGGTTGGG GCCACATGAA AACTACCCTG ATCGGCAGCT TGCAGCACAA TATGGGCACT GGTCCCTGCC ATTAGAGCAG ATGCACACCG CGTATATTTT CCCCAGTGAG AATGGCTTGC GCTGCAATAC CCATACGCTG AATTATGGCC GCTGGACGTT AACGGGCGAT TTCCACTTTG GTATAAGTCG CTACAGCACC CAGCAACTGA TGGTGACCTC CCATCAACAT CTATTGGAAC CCGAAGAGGG CACCTGGCTC AATATTGATG GTTTCCATAT GGGGGTGGGC GGTGATGATT CATGGAGCCC GAGTGTTCAC ATTGATGACA TACTCACCCG TGAAACCTAT CAGTACCAAA TCTGTTGGCA ATACAAGGTG TAA
|
Protein sequence | MQLSLPQILS RRDWENPQIT QYHRLEAHPP FHSWRDVESA QKDRPSPQQQ TLNGLWSFSY FTQPEAVPEH WVRCDLAEAK PLPVPANWQL HGYDAPIYTN IQYPIPVNPP RVPDLNPTGC YSRDFTLEPS WLASGKTRII FDGVSSAFYL WCNGQWVGYS QDSRLPAEFD LTPYLQAGSN RIAVLVLRWS DGSYLEDQDM WRMSGIFRDV KLLHKPEIHL RDIHIMTHLS PEFTSANLEV MAAVNIPSLQ LNDPQVTGSY QLRVQLWLAD KLVASLQQPL GTQAIDERGP YTDRTQLVLR IDQPLLWSAE QPTLYRAVVS LLNHQQELIE AEAYDVGFRQ VAIHQGLLKI NGKAVLIRGV NRHEHHPQTG QAIDEESLLQ DILLMKQHNF NAVRCSHYPN HPLWYRLCDR YGLYVVDEAN IETHGMQPMS RLSDDPSWFS AFSERVTRMV QRDRNHPCII IWSLGNESGH GATHDALYRW IKTNDPTRPV QYEGGGANTL ATDILCPMYA RVDEDQPFPA VPKWSIKKWI GLPNESRPLI LCEYAHAMGN SFGGFARYWQ AFRQYPRLQG GFIWDWVDQS LTHHNDHGQP YWAYGGDFGD TPNDRQFCMN GLVFPDRSPH PSLYEAQCAQ QFFQFSLLST TPLVINITSE YLFRESDNEQ LYWRIMLEGE SVLEGSQPLN LSPESSQCYR LAEKLPTLNK PGQLWLNVEI RQPKETPWSP AQHRSAWHQW RLPQPLFSPS SDLTNATAHY APQLQHNLQL QHDLQLQQDE QHIKVTYQQQ CWQFSRQTGR LAQWWVADKP MLLRPLQDQF VRAPLDNDIG ISEATHIDPN AWVERWKKAG MYQLQQRCLS LHVDHLSHSV QISAEYGYEF EQEPLLHSHW VYRFDRHGRM TIDVNVRIAT SLPAPARIGM CCQLADISPT VEWLGLGPHE NYPDRQLAAQ YGHWSLPLEQ MHTAYIFPSE NGLRCNTHTL NYGRWTLTGD FHFGISRYST QQLMVTSHQH LLEPEEGTWL NIDGFHMGVG GDDSWSPSVH IDDILTRETY QYQICWQYKV
|
| |