Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dd1591_2733 |
Symbol | lacZ |
ID | 8119175 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dickeya zeae Ech1591 |
Kingdom | Bacteria |
Replicon accession | NC_012912 |
Strand | + |
Start bp | 3099483 |
End bp | 3102593 |
Gene Length | 3111 bp |
Protein Length | 1036 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644853104 |
Product | beta-D-galactosidase |
Protein accession | YP_003005037 |
Protein GI | 251790316 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000508239 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTACTG CTTCCGCTTA TGCTCCGCTA TCCGGCCTCT CGCTGGCGGA TATTCTGGCC AGACGCGATT GGGAAAACCC GGCCTGCCCC CACATCCGCC GGCTGGATGC GCATCCGCCG TTTTCCAGTT GGCGCAACCT CAACGCCGCA CGAGACGATC AACCTTCCGA CCGACGCCAG ATGCTGAATG GTGAATGGAC CTTCAGCTAT TTCTCACGTC CGGAGGCCGT ACCGGAACAG TGGCTGACAC AGGATCTGAC CGATGCGAAC CCGCTCGCGG TGCCGTCCAA CTGGCAGCTG GCGGGATACG ATGCGCCGAT TTACACCAAT ATCAAATACC CGATACCGGT CAATCCACCG TTCGTACCGC AAGACAATCC CACCGGTTGT TACTCGCTCA CATTCTCAGT CAACGGTGAC TGGCTGACTC AGGGCCAGAC CCGTATCGTG TTTGACGGCG TTAACTCCGC CTTCCATCTG TGGTGCAACG GGCAATGGGT CGGCTATTCC CAGGACAGCC GGCTACCGGC GGAATTCGAT CTGACATCCT GCCTGCAACC GGGAGAAAAC CGGCTGGCGG TGATGGTGCT GCGCTGGTCT GACGGCACCT ATCTGGAAGA TCAGGACATG TGGCGCATGA GCGGCATTTA TCGTGATGTC TACCTGTTGC ATAAACCGGC GGTACACCTG CGTGATGTGC AACTCACCAC CCCGCTACGG CATAGCTATA CCCAAGGTAC GCTGTGCGTC ACCGCGCTGG CCAACCTGCC GGAAGATCAG GCTCAGGCGT GGCAACTGGC GGTGCAATTG TGGCGCGGCG GGCAACTGGT GGGAGAACAG CGCGCCCCCT TCGGCACGCC GGCTATTGAT GAACGCGGCG CCTATCACGA CAGGGTGAGC CTGCAACTGG AGGTGGCGCA GCCCGCTTTA TGGAGTGCGG AAGAGCCCAA CCTGTATCGC GCAGTGGTGG CGCTGGAGTA CGACGGCACG CTGGTGGAAG CGGAAGCCTA TGACGTCGGT TTCCGCGAAG TCGCCATCCG TAACGGGCTA CTGTTGCTCA ACGGTCAACC GCTGTTGATT CGCGGCACTA ACCGCCATGA ACACCATCCG CAGTATGGTC AGGCGATTGA TGAAGCGACC ATGCGGCAGG ACATTCTGCT GATGAAGCAG CACAACTTCA ACGCGGTGCG CTGCTCCCAC TATCCCAATC ATCCATTATG GTATCGGCTG TGTGACCGCT ACGGTCTATA CGTAGTGGAC GAGGCCAACA TCGAAACCCA CGGTATGCAG CCGATGAGCC GCCTGTCGGA CGACCCGCGC TGGCTACCGG CTTATGCCGA ACGCGTCACC CGCATGGTAC AGCGGGATCG CAACCACCCC TGCATTATTA TCTGGTCGCT GGGCAATGAG TCTGGCTACG GCCCTACCCA CAGCGCGCTC TACCAGTGGG TAAAGCAGCA GGATCCAACC CGACCGGTGC AGTACGAAGG CGGCGGCGCC GACACGCCCG CCACCGATAT TTTGTGCCCG ATGTATGCCC GTGTTGATCA GGACCAACCG TTCCCGGCGG TGCCGAAGTG GTCAATCAAA AAGTGGATTG GGCTGCCGGG AGAACATCGC CCGCTCATTT TGTGCGAATA CGCACACGCT ATGGGTAACA GTTTCGGCGG GTTTGACCGT TACTGGCAGG CATTCCGTCA ATATCCGCGC TTGCAGGGCG GTTTCGTCTG GGACTGGGTG GATCAGGCGC TGACACGTGA GCAGGATGGC AAAACGCACT GGGCCTACGG CGGCGACTTC GGTGATAAAC CCAACGACCG ACAGTTCTGC CTCAACGGTC TGGTGTTCCC CGATCGCACG CCGCACCCGG CATTGTACGA AGCTCAGCGC GCCCAGCAGT TTTTCCAGTT CAACCACCAT GAGAATGCCC CACTGACGCT GACCATCACC AGCGAATACC TATTCCGGCG GAGCGACAAT GAGGAACTGC ACTGGCGCAT TATGCAGGAT GATGTACAAC TGGCATCAGG CCGTGTACCG CTCGATATTA CCCCACAGGG TTGCCAGACC CTCACTCTGC TTGAGCAACT GCCTGCGCCG CAACACCACG CCGACATGTG GCTGACGGTA GAGGTGATTC AGCCGAACGC GACCGACTGG TCGCCTGCCG GTCACCGCTG CGCCTGGGAT CAATGGCAGT TACCGATGCC GCTGGCACGC CCAACGCCAC GCCGTGACGG TAGCGAACGT CCGACGCTGA CCCAGAATGA CGACCAGTTC GAGGTTATCC ACGGCCAACA GCGCTGGGCG TTCAATCGCC ATAACGGGTT GCTGACCCAG TGGTGGCGCG ATGGGCAAGC TCAGTTACTC AGCCCGCTAC AGGACAACCT GGCGCGCGCG CCACTGGATA ACGACATCGG CATCAGTGAA GTCGACCGTA TTGACCCGAA TGCCTGGGTA GAACGCTGGA AACTGGCCGG GCTGTATCGG TACGACACCG ACTGTCGGCA TATTCATGCC GATACCCTCA GCGACGGCGT GCTGATCACC ACCGAACACA TCGGCCATTA TCAGCAACAG GTGCTGTTCA TCAGCCGTAA ACAGTGGCGC ATTGACGCGC AAGGCGTGCT GACGGTAAGC GTGGAAGTGG ACGTCGCGCG TCACCTGCCT TCACTGGCGC GCGTCGGCCT CAGTATGCAA CTGGCGGCGG TCACCCCACA GGTGAGCTGG CTGGGGCTGG GCCCGCACGA AAACTATCCT GACCGACGCC TTGCCGCGCT GCACGGTCGC TGGCAACAAC CGCTGGAAGC GATGCACACG CCATACATTT TTCCATCGGA AAACGGCTTG CGTTGCCACA CTCGGGAACT GCGCTACGGC GACTGGCTTA TCGAGGGCGA TTTCCATTTC GGTATCGGCC GCTACAGCCG GCAGCAACTG ATGGATTGCA CTCATCACCA TCTATTGCAG CCGGAACCGG GCACCTGGCT CAATCTGGAC GGCTTCCACA TGGGCATCGG CGGTGACGAT TCCTGGAGCC CCAGCGTTGC GCCGGACTTC CTGCTAACCG CCCCTCGCTA CCGTTACCAA TTGCAACTGC AGTTACGATA A
|
Protein sequence | MSTASAYAPL SGLSLADILA RRDWENPACP HIRRLDAHPP FSSWRNLNAA RDDQPSDRRQ MLNGEWTFSY FSRPEAVPEQ WLTQDLTDAN PLAVPSNWQL AGYDAPIYTN IKYPIPVNPP FVPQDNPTGC YSLTFSVNGD WLTQGQTRIV FDGVNSAFHL WCNGQWVGYS QDSRLPAEFD LTSCLQPGEN RLAVMVLRWS DGTYLEDQDM WRMSGIYRDV YLLHKPAVHL RDVQLTTPLR HSYTQGTLCV TALANLPEDQ AQAWQLAVQL WRGGQLVGEQ RAPFGTPAID ERGAYHDRVS LQLEVAQPAL WSAEEPNLYR AVVALEYDGT LVEAEAYDVG FREVAIRNGL LLLNGQPLLI RGTNRHEHHP QYGQAIDEAT MRQDILLMKQ HNFNAVRCSH YPNHPLWYRL CDRYGLYVVD EANIETHGMQ PMSRLSDDPR WLPAYAERVT RMVQRDRNHP CIIIWSLGNE SGYGPTHSAL YQWVKQQDPT RPVQYEGGGA DTPATDILCP MYARVDQDQP FPAVPKWSIK KWIGLPGEHR PLILCEYAHA MGNSFGGFDR YWQAFRQYPR LQGGFVWDWV DQALTREQDG KTHWAYGGDF GDKPNDRQFC LNGLVFPDRT PHPALYEAQR AQQFFQFNHH ENAPLTLTIT SEYLFRRSDN EELHWRIMQD DVQLASGRVP LDITPQGCQT LTLLEQLPAP QHHADMWLTV EVIQPNATDW SPAGHRCAWD QWQLPMPLAR PTPRRDGSER PTLTQNDDQF EVIHGQQRWA FNRHNGLLTQ WWRDGQAQLL SPLQDNLARA PLDNDIGISE VDRIDPNAWV ERWKLAGLYR YDTDCRHIHA DTLSDGVLIT TEHIGHYQQQ VLFISRKQWR IDAQGVLTVS VEVDVARHLP SLARVGLSMQ LAAVTPQVSW LGLGPHENYP DRRLAALHGR WQQPLEAMHT PYIFPSENGL RCHTRELRYG DWLIEGDFHF GIGRYSRQQL MDCTHHHLLQ PEPGTWLNLD GFHMGIGGDD SWSPSVAPDF LLTAPRYRYQ LQLQLR
|
| |