Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PC1_1362 |
Symbol | lacZ |
ID | 8132301 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pectobacterium carotovorum subsp. carotovorum PC1 |
Kingdom | Bacteria |
Replicon accession | NC_012917 |
Strand | - |
Start bp | 1585233 |
End bp | 1588364 |
Gene Length | 3132 bp |
Protein Length | 1043 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 644864652 |
Product | beta-D-galactosidase |
Protein accession | YP_003016944 |
Protein GI | 253687754 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00215799 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGATA CCGTTTTGCC GCATCCCGTA CGCCATCGCG CCACGCTGCA AGAAATCCTT TCCCGACGCG ATTGGGAAAA CCCAACCTGT ACTCACTACC AGAGGCTTCC TGCACATCCG CCGTTTAACA GCTGGCGCAG CGTAGCTACC GCACAGCAGG ATGAACCTTC TCAGCGGCTG CGTCGCCTCA ATGGCGAATG GACGTTTAGC TATTTCACTC GCCCGGAAGC GGTGCCGGAA AGCTGGTTAC AGCAGGATCT GCCCGATGCC GATACCATCC CAGTGCCTTC CAATTGGCAA TTACAGGGCT ATGACGCTCC GATTTATACC AACGTAAAAT ACCCGATCCC CGTCACCCCA CCTTATGTGC AGAAAGACAA TCCCACCGGT TGTTACTCGC TCACTTTTAA GATCAATCAT GACTGGATCA GCAACGGGCA AACCCGGATT ATCTTTGACG GTGTTAACTC CGCGTTTTAC CTCTGGTGTA ACGGTCACTG GGTAGGATAT TCGCAGGATA GTCGCCTGCC TGCGGAGTTT GATATCGGGC GCTATCTGAC AACCGGAGAG AATCGGCTGG CCGTCATGGT GTTACGCTGG TCTGACGGCA GCTATCTGGA AGATCAGGAT ATGTGGCGTA TGAGCGGTAT TTTCCGCGAT GTCACGCTTC TGCATAAACC CACGGTTCAC CTCAGAGATA TCCAACTGAC TACGCCGCTC AGCGCCGATT TCCGCCACGG CACACTGGAG ATTCAAGTGA GGGCAACGCT CACCGAAGCG GAATCCAAAA GCCATCGTGT CCGTGCACAG CTTTGGCGTG GTAATAAACT CATCGGTGAC ACACGACAGG CATTCGGCAG CGATATTGTC GATGAACGCG GTGCTTACCA CGATAAGGCT TTTCTCCGTA TTGACGTGCC ACAGCCCGAT CTGTGGAGCG CCGAACTGCC GCACCTGTAT CGCACAGTTA TTTCATTGGA AACAGCAGAA GGCGAGCTCG TAGAAGCAGA AGCCTATGAT GTCGGTTTCA GAAAAGTTGA AATCCGCAAC GGTCTGCTGC TGCTGAACGG TCAACCGCTA CTGATTCGTG GCGTTAACCG CCATGAGCAT CATCCTCAGC ATGGTCAGGT CATGGATGAA GACACGATGC GGCGCGATAT TATGCTAATG AAACAGCATA ATTTTAACGC CGTGCGTTGC TCACATTACC CCAACCATCC GCTGTGGTAT CGGCTGTGCG ATCGTTACGG CCTGTATGTT GTGGATGAGG CGAACATTGA GACGCACGGC ATGCAGCCAA TGAATCGTCT GTCAGACGAT CCCGTCTGGC TACCGGCCTA CAGCGAACGC GTGTCGAGGA TGGTACAACG CGACCGCAAT CATCCCTGCA TTATTATCTG GTCATTGGGG AACGAGTCCG GTTACGGCGC AAATCATGAC GCACTCTATC AGTGGATCAA GCGCCACGAT CCGACGCGTC CCGTGCATTA CGAAGGCGGC GGCGCGAACA GTCGTGCGAC CGACATTGTG TGCCCCATGT ACGCCCGCGT CGATGAAGAT CAGCCCTTCC CCAGCGTGCC CAAATGGTCG ATCACCAAAT GGATCAGCAT GCCGGACGAA CATCGGCCAC TTATCCTCTG CGAGTACGCA CACGCGATGG GTAACAGTCT GGGGGGATTT GCTCGCTACT GGCAGGCATT CCGTCAATAC CCTCGGTTAC AGGGCGGATT CATCTGGGAC TGGGTCGATC AGGCACTGAC CCGCCACGAC GAGCAAGGCA ATGCCTACTG GGCATACGGT GGAGACTTTG GCGACACACC TAACGATCGT CAGTTCTGTC TGGACGGCTT GCTGTTTCCC GATCGCACCC CGCATCCCAG CCTGTATGAA GCCCAGCGGG CACAGCAGCA TATTCAGTTC GACTGGCAGG CGGAGTCACC GTGTGAGCTT CACGTCACCA GCGAATACCT GTTTCGCCAT ACGGACAATG AGCAGTTGAA CTGGCGCATC ACGCTGGACG ATAAAATGCT CGCAGAAGGT TCGCTGCCGC TGTCTCTCGC TCCGCAGTCC ACCCAGACCC TCACGCTACT GGAGGCGCTT CCCGCCGTCG AACACACAGG GGAGCTTTGG CTCAATGTTG AAGTCGTACA GCCTAAAGCC ACCGCCTGGT CAGAGGCAAA CCACCGCTGC GCCTGGGATC AATGGCAACT CCCTGCGCCA CTTCATCTTC CCGATACGGC CAGTTCCGGG CAGAAGCAAC GCCCCGTACT GCGATCGTCT GATGAGCACT TCGATATCGT TCAGGGCGAA CAACGCTGGC ACTTTAACCG CCAGAGCGGC TGGCTGGAAC AGTGGTGGAC AGCCGACACG CCAGCGCTGT TAACCCCTTT GCAAGATCAG TTTGTTCGGG CACCGCTGGA TAACGACATC GGTATCAGCG AAGTTGATCG CATCGATCCG CACGCTTGGG CCGAACGTTG GAAATCAGCC GGTCTTTATC AATTGCAGAC GCAGTGTATA GCGATTCAAG CTGACCAACT CGACGATGCG GTGCAGGTTA CTACCGAGCA CGTATTCCGC CATGCCGGAC AAATCTTGCT ACGGAGTAAA AAGCGCTGGC AGATTGATGT ACACGGCGTA ATGACTGTTG ATGTCGACGT TGATGTCGCG ACGGTACTCC CATCATTAGC CAGAGTGGGC TTGAGCTGCC AGCTAGCCGA CGTGGCACCT CAGGTCAGTT GGGCCGGGCT TGGGCCACAT GAGAATTACC CAGACCGACA GTTGGCTGCA CAGCATGGAC ACTGGAGTCT GCCGCTGGAT GACCTTCATA CGCCCTACAT TTTCCCGTCA GAAAACGGAC TGCGCTGCAA CACTCGCACG CTAACGTACG GCAAATGGAC AATCACCGGA AATTTCCACT TCGGGTTAAG TCGTTATGGG TTAACACAGT TAATGACCTG TACTCACCAC CATTTATTAG AAAAGGAAGA AGGTGTTTGG CTCAATCTGG ATGGCTTCCA TATGGGCATT GGCGGCGATG ATTCCTGGAG TCCCAGCGTT CATCGCGATG ATTTACTCAC GGCGACACAT TATCACTACC GCGTCGCGCT TCAACATCAC CAACCGTATT GA
|
Protein sequence | MSDTVLPHPV RHRATLQEIL SRRDWENPTC THYQRLPAHP PFNSWRSVAT AQQDEPSQRL RRLNGEWTFS YFTRPEAVPE SWLQQDLPDA DTIPVPSNWQ LQGYDAPIYT NVKYPIPVTP PYVQKDNPTG CYSLTFKINH DWISNGQTRI IFDGVNSAFY LWCNGHWVGY SQDSRLPAEF DIGRYLTTGE NRLAVMVLRW SDGSYLEDQD MWRMSGIFRD VTLLHKPTVH LRDIQLTTPL SADFRHGTLE IQVRATLTEA ESKSHRVRAQ LWRGNKLIGD TRQAFGSDIV DERGAYHDKA FLRIDVPQPD LWSAELPHLY RTVISLETAE GELVEAEAYD VGFRKVEIRN GLLLLNGQPL LIRGVNRHEH HPQHGQVMDE DTMRRDIMLM KQHNFNAVRC SHYPNHPLWY RLCDRYGLYV VDEANIETHG MQPMNRLSDD PVWLPAYSER VSRMVQRDRN HPCIIIWSLG NESGYGANHD ALYQWIKRHD PTRPVHYEGG GANSRATDIV CPMYARVDED QPFPSVPKWS ITKWISMPDE HRPLILCEYA HAMGNSLGGF ARYWQAFRQY PRLQGGFIWD WVDQALTRHD EQGNAYWAYG GDFGDTPNDR QFCLDGLLFP DRTPHPSLYE AQRAQQHIQF DWQAESPCEL HVTSEYLFRH TDNEQLNWRI TLDDKMLAEG SLPLSLAPQS TQTLTLLEAL PAVEHTGELW LNVEVVQPKA TAWSEANHRC AWDQWQLPAP LHLPDTASSG QKQRPVLRSS DEHFDIVQGE QRWHFNRQSG WLEQWWTADT PALLTPLQDQ FVRAPLDNDI GISEVDRIDP HAWAERWKSA GLYQLQTQCI AIQADQLDDA VQVTTEHVFR HAGQILLRSK KRWQIDVHGV MTVDVDVDVA TVLPSLARVG LSCQLADVAP QVSWAGLGPH ENYPDRQLAA QHGHWSLPLD DLHTPYIFPS ENGLRCNTRT LTYGKWTITG NFHFGLSRYG LTQLMTCTHH HLLEKEEGVW LNLDGFHMGI GGDDSWSPSV HRDDLLTATH YHYRVALQHH QPY
|
| |