Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_3125 |
Symbol | |
ID | 7294605 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011886 |
Strand | + |
Start bp | 3472060 |
End bp | 3474075 |
Gene Length | 2016 bp |
Protein Length | 671 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643591535 |
Product | Beta-galactosidase |
Protein accession | YP_002489175 |
Protein GI | 220913866 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1874] Beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.0401597 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAACGC AGGAAATTAA CCGTCCGGCA AGTGTTTGGA GCAACGTCGA AGGCCTCGGT TTCGGCGGCG ACTACAACCC CGAACAGTGG CCCGTGAGCG TGCGGCTGGA GGACCTCGAG CTGATGCAGG AAGCCGGCGT GAACTTCCTG AGCGTGGGCA TCTTTTCCTG GGCCCTGCTG GAACCGGTTG AGGGCCAGTA CGATTTCGGC TGGCTTGATG AGGTGCTGGA CAACCTCGCA GGCATCGGCG TCAGGGTTGC CCTCGCCACC GCCACCGCCG CTCCCCCGGC GTGGCTGGTC CGCAAGCATC CGGAAATCCT GCCGGTCACC GCTGACGGAA CCGTGCTGGG ACCGGGCTCA CGGCGGCACT ACACGCCGTC GTCGGCCGCC TACCGGCGCT ACGCCACGGG CATCACGCGG GTCCTCGCCG AACGATACAA GGACCATCCG GCGCTGGCGC TCTGGCACGT GGACAACGAG CTGGGTTGCC ACATTTCCGA GTTCTACGGC AAAGAGGACG CAGCCGCTTT CCGCAGTTGG CTGGAACGGC GTTACGGGAG CATTGACGCC CTCAACGCGT CCTGGGGCAC CGCGTTCTGG AGCCAGAACT ACGCATCGTT CGAAGAAATC CTGCCGCCCT CAGTTGCCCC CAGCACGCTG AACCCGGGGC AGCAGCTCGA TTTCCAGCGC TTCAATTCGT GGGCGCTGAT GGATTACTAC CGCGAACTCG TGGCAGTGCT CCGCGAGGTG ACCCCGGAGG TTCCCTGCAC CACCAACCTC ATGGCCTCCA GCGCCACGAA GTCCATGGAC TACTTCAGCT GGGCCAAGGA TCTTGACGTC ATCGCCAACG ACCACTACCT CGTGGCTGCC GACCCCGAAC GGCACATCGA ACTCGCGTTC AGCGCGGACC TGACCCGGGG CATTGCGGGC GGTGACCCGT GGATCCTAAT GGAACATTCG ACGTCGGCCG TCAACTGGCA ACCCCGCAAC CAGCCGAAGA TGCCCGGCGA AATGCTCCGG AACTCGCTGG CGCACGTGGC CCGCGGCGCG GATGCCGTGA TGTTCTTCCA GTGGCGGCAG AGCTTCGCGG GGTCCGAAAA GTTCCACTCC GCCATGGTGC CGCACGGCGG CCGGGACACG CGCGTGTGGC GCGAGGTGGT GGACCTCGGG GCGGCGCTGA AGCTGCTCGA ACCGGTCCGC GGTTCCCGGG TGGAGTCCCG CGCGGCCATC GTCTTCGATT ACGAGGCGTG GTGGGCCAGC GAGATCGATT CCAAGCCGAG CATCGACGTG AAGTACCTGG ACCTGCTGCG GGCCTTCCAC CGTTCGCTGT TCCTGAAGGG CGTTTCCGTG GACATGGTCC ACCCGTCTGC GCCGCTCGAG GGCTACGACC TGGTGCTCGT CTGCACGCTC TACGCCGTAT CCGACGCCGA CGCCGGCAAC ATCGCCGCGG CCGCCTCCGC TGGCGCCACC GTCCTGGTCA GCTACTTCAG CGGGATCGCG GACCCGCAGG ACCACATTCG GCTCGGCGGG TATCCGGGCG CATTCCGGGA CCTCCTTGGC GTGCGGGTGG AAGAGTTCCA CCCGCTGCTG GCCGGATCGC AGCTGAAGCT CGACGACGGC ACCGTTGCCT CGGTCTGGAG CGAGCACGTG CACCTCGCCG GCGCCGAAGC GGTCCAGGCG TTCACGGAGT ATCCGCTGGA AGGCGTCCCG GCCCTGACCC GCCGCTCCGT GGGCGCCGGC GCTGCCTGGT ACCTGGCCAC CTTCCCGGAC AGCGACGGCA TTGATGCGTT GGTGGAACGG CTGCTCGCCG AATCGGGCGT CTCCCCCGCG GCTGCCGCCG ACACCGGCGT CGAACTGGTC CGTCGGCGCT CGGCCGATGG GCAGCGCTTC CTGTTCGCCA TCAACCACAC CCGCTCCTCC GCCGCCGTCT CGGCCACAGG AACCGATTTG CTGACCGGGG AGCCCTTTGG CGGGTCCGTT CCGGCGGGCA GCATTGCGGT GATCAGGGAG GGCTAG
|
Protein sequence | MATQEINRPA SVWSNVEGLG FGGDYNPEQW PVSVRLEDLE LMQEAGVNFL SVGIFSWALL EPVEGQYDFG WLDEVLDNLA GIGVRVALAT ATAAPPAWLV RKHPEILPVT ADGTVLGPGS RRHYTPSSAA YRRYATGITR VLAERYKDHP ALALWHVDNE LGCHISEFYG KEDAAAFRSW LERRYGSIDA LNASWGTAFW SQNYASFEEI LPPSVAPSTL NPGQQLDFQR FNSWALMDYY RELVAVLREV TPEVPCTTNL MASSATKSMD YFSWAKDLDV IANDHYLVAA DPERHIELAF SADLTRGIAG GDPWILMEHS TSAVNWQPRN QPKMPGEMLR NSLAHVARGA DAVMFFQWRQ SFAGSEKFHS AMVPHGGRDT RVWREVVDLG AALKLLEPVR GSRVESRAAI VFDYEAWWAS EIDSKPSIDV KYLDLLRAFH RSLFLKGVSV DMVHPSAPLE GYDLVLVCTL YAVSDADAGN IAAAASAGAT VLVSYFSGIA DPQDHIRLGG YPGAFRDLLG VRVEEFHPLL AGSQLKLDDG TVASVWSEHV HLAGAEAVQA FTEYPLEGVP ALTRRSVGAG AAWYLATFPD SDGIDALVER LLAESGVSPA AAADTGVELV RRRSADGQRF LFAINHTRSS AAVSATGTDL LTGEPFGGSV PAGSIAVIRE G
|
| |