Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Spro_3173 |
Symbol | lacZ |
ID | 5605502 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Serratia proteamaculans 568 |
Kingdom | Bacteria |
Replicon accession | NC_009832 |
Strand | + |
Start bp | 3491438 |
End bp | 3494527 |
Gene Length | 3090 bp |
Protein Length | 1029 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640938716 |
Product | beta-D-galactosidase |
Protein accession | YP_001479401 |
Protein GI | 157371412 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00462554 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTAGCC TGCCCTATCC TTCCCTGAAA GACCTCCTCG CCCGCCGGGA CTGGCAAAAT CCGGCCTGTA CCCATTATCA GCGGCTGGCT GCACATCCAC CTTTCTCGAG CTGGCGCAAC CTGAATGCCG CCCGTGATGA CAAGAGCAGC GAGAGCCGGC AAATATTGAA CGGCGACTGG CAGTTCAGCT ATTTCGATAA GCCCCAGGCG GTACCCGACG GTTGGTTACA ACAGGACCTG ACCGACGCAG ATACTCTTGC GGTGCCGTCC AACTGGCAAC TTGCCGCTTA TGACGCGCCG ATCTACACCA ACGTCCGCTA CCCGATCCCG GTCAATCCGC CACAGGTACC AGAGGAAAAC CCGACCGGCT GCTATTCGCG GCAGTTTACC GTCGACCCTG CCTGGCTGGC GGAAGGCCAG ACGCGCATCA TTTTTGACGG CGTCAACTCG GCGTTTTATC TGTGGTGCAA CGGCCACTGG GTCGGTTATT CACAGGACAG CCGCCTGCCC GCCGAGTTTG ATCTCAGCCC CTGGTTGCAG GCCGGTGAGA ACCGACTGGC GGTGATGGTG CTGCGCTGGT GCGATGGCAG CTATCTGGAA GATCAGGATA TGTGGCGCAT GAGCGGTATT TTCCGCGACG TCAGCCTGCT GCATAAACCC GCCACGCATC TGAGCGATAT CCGCATCACC ACCCCGCTGT ATGACAGTTT CCGCCGTGGC GAACTGGTGG CGGAAGTTCA CATCAACCAG CCCGCGCAGC ACCGGGTACA GTTGCAGCTA TGGCGTGATG GCCAGCTGGT TGGGGAAAAA ACTCAGGCAT TCGGCAGTGA AATTATCGAC GAACGCGGTG CCTATGAGGA TCGCACTACC CTGTGCCTCC CGGTGGAACA ACCGGCGTTG TGGAGCGCGG AAACACCCAC GCTGTACCGC GCAACCGTCA CCCTGCTGTC GCCGGAAGGA AAAATTATTG AGGTGGAGGC CTATGACGTC GGCTTCCGCC AGGTGGAAAT CAGCAATGGA CTGCTGAAGC TTAACGGCCA GCCCTTGTTG ATCCGCGGTA CCAACCGCCA CGAACATCAT CCGCAGCACG GCCAGGTGAT GGACGAGGCC ACCATGCGCC ATGACATCCT GCTGATGAAG CAACACAACT TCAACGCGGT GCGCTGCTCA CACTACCCGA ACCATCCATT GTGGTACCGG TTATGCGATC GCTACGGGCT GTATGTGGTG GATGAAGCCA ATATTGAAAC CCACGGCATG CAGCCGATGA ACCGGTTGTC TGACGATCCG CTATGGTTGC CGGCAATGAG CGAACGCGTA ACCCGCATGG TGCAGCGTGA CCGCAACCAC CCTTGTATTA TTATCTGGTC GCTGGGTAAC GAATCCGGTC ACGGCTGCAA CCACGACGCG CTGTATCGCT GGGTGAAAAC TCAGGATCCT ACCCGCCCGG TTCAGTACGA AGGCGGGGGG GCCAACAGCG CCGCCACCGA TATTATCTGC CCGATGTATG CGCGGGTGGA TCAGGATCAG CCGTTCCCCG CCGTGCCCAA GTGGTCAATC AAAAAGTGGA TCGGCCTGCC GGATGAGCAT CGTCCGCTGA TCCTGTGCGA ATACGCTCAT GCGATGGGCA ACAGCTTCGG GGGTTTTGAC CGCTACTGGC AGGCCTTCCG TCAGTATCCG CGCCTGCAGG GTGGCTTCGT CTGGGACTGG GTCGATCAGG CACTGACCCG CAGTGATGAA AACGGCAACC CTTACTGGGC TTACGGTGGC GACTTTGGCG ACACGCCGAA CGATCGACAA TTCTGCCTTA ACGGTCTGGT ATTCCCCGAC CGCACACCCC ACCCTGCGCT GTTTGAAGCG CAACGCGCAC AGCAATTTTT CCAGTTTACC TTCGACGCCG AAACGCTGAC GCTGACCGTC AACAGCGAGT ATCTGTTCCG CCAAACCGAT AATGAACGGC TGAACTGGCG GCTGGAACTC GATGGCACGG AGCGCGCCAG CGGCAGCTTC GATCTCAACC TGCTACCGCA GAGTAGCGCC AGCTTCCCAC TGCTCGAACG CTTGCCGATG CTCCATCAAC CCGGCGAACT GTGGCTGAAT GTCGAAGTGG TGCAACCGCT GGCCACCGAC TGGTCCGAAG CCAACCATCG CTGCGCCTGG GATCAATGGC TGGTGCCACG CACGCTGCAT TTTGCACCAC CAGCAGTGGC CGGTTCAGCG CCACAGCTGA GCCAAAATGA CCAAACTATC GACATAACCC ATGGCCATCA ACGCTGGCAG TTTACGCGCC ACGACGGCTG CCTGAGCCAA TGGTGGCAAC ATGACCACTC TCAACTGCTG ACGCCACTGC GCGATAACTT TATCCGCGCG CCGCTGGATA ACGACATCGG CATCAGCGAA GTCGAGCGCA TCGATCCCAA CGCCTGGGTA GAACGCTGGA AGCTGGCGGG CATGTATCGG CTGGAGGAGC GCTGCACGCT GTTGCAGGCC GATCAATTGA GCGACGGCGT GCGGGTGGTG AGTGAACACC TGTTCGAAGC CGATGGGCAA ACGCTGCTGC GCAGTCGCAA ACAGTGGCTG TTCGACAGCG AGGGCGCCGT CAGCATCAGC GTCGACGTCG ATATTGCCGC CAGTCTGCCG CCACCGGCAC GTATTGGCCT GAGCTGCCAA TTGAAAGAAA TTCATCCACA GGCGCAATGG TTGGGGCTGG GCCCACATGA GAATTACCCG GACCGCCGCC TCGCCGCGCA ATTTGGACGT TGGCAGCAGC CGCTGGAAGC GTTGCACACG CCGTATATCT TCCCCGGCGA GAACGGGCTG CGCTGCGAGA CCCGCAGCCT GCTGTACGGT GGCTGGCACA TCGACGGACG GTTCCACTTC TCGCTCAGCC GCTACGGCCT GCGCCAGTTG ATGGAGTGCA GCCACCAGCA CCTGCTGCAA CCGGAAGCCG GCACCTGGCT CAGCCTGGAC GGTTTCCACA TGGGGGTGGG CGGTGACGAC TCCTGGAGCC CGAGCGTTAA TCAGGACTAC CTGCTCAGCG GCAGCCATTA CCATTATCAA CTGCGTCTAA AACGCGCAGA ACGGAGCTAA
|
Protein sequence | MSSLPYPSLK DLLARRDWQN PACTHYQRLA AHPPFSSWRN LNAARDDKSS ESRQILNGDW QFSYFDKPQA VPDGWLQQDL TDADTLAVPS NWQLAAYDAP IYTNVRYPIP VNPPQVPEEN PTGCYSRQFT VDPAWLAEGQ TRIIFDGVNS AFYLWCNGHW VGYSQDSRLP AEFDLSPWLQ AGENRLAVMV LRWCDGSYLE DQDMWRMSGI FRDVSLLHKP ATHLSDIRIT TPLYDSFRRG ELVAEVHINQ PAQHRVQLQL WRDGQLVGEK TQAFGSEIID ERGAYEDRTT LCLPVEQPAL WSAETPTLYR ATVTLLSPEG KIIEVEAYDV GFRQVEISNG LLKLNGQPLL IRGTNRHEHH PQHGQVMDEA TMRHDILLMK QHNFNAVRCS HYPNHPLWYR LCDRYGLYVV DEANIETHGM QPMNRLSDDP LWLPAMSERV TRMVQRDRNH PCIIIWSLGN ESGHGCNHDA LYRWVKTQDP TRPVQYEGGG ANSAATDIIC PMYARVDQDQ PFPAVPKWSI KKWIGLPDEH RPLILCEYAH AMGNSFGGFD RYWQAFRQYP RLQGGFVWDW VDQALTRSDE NGNPYWAYGG DFGDTPNDRQ FCLNGLVFPD RTPHPALFEA QRAQQFFQFT FDAETLTLTV NSEYLFRQTD NERLNWRLEL DGTERASGSF DLNLLPQSSA SFPLLERLPM LHQPGELWLN VEVVQPLATD WSEANHRCAW DQWLVPRTLH FAPPAVAGSA PQLSQNDQTI DITHGHQRWQ FTRHDGCLSQ WWQHDHSQLL TPLRDNFIRA PLDNDIGISE VERIDPNAWV ERWKLAGMYR LEERCTLLQA DQLSDGVRVV SEHLFEADGQ TLLRSRKQWL FDSEGAVSIS VDVDIAASLP PPARIGLSCQ LKEIHPQAQW LGLGPHENYP DRRLAAQFGR WQQPLEALHT PYIFPGENGL RCETRSLLYG GWHIDGRFHF SLSRYGLRQL MECSHQHLLQ PEAGTWLSLD GFHMGVGGDD SWSPSVNQDY LLSGSHYHYQ LRLKRAERS
|
| |