Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_3719 |
Symbol | |
ID | 7295204 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011886 |
Strand | - |
Start bp | 4145822 |
End bp | 4148836 |
Gene Length | 3015 bp |
Protein Length | 1004 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643592128 |
Product | glycoside hydrolase family 2 TIM barrel |
Protein accession | YP_002489763 |
Protein GI | 220914454 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 129 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGTCC AGTCCCCCAC AACGGTGTCG AATGCTGCTG CAAGTGATGC CTCCGACTCT GTGCCGCGCA ATCTTGAACT CGGCGCAGGA GGCGTTCTCG AGCTCTCCTC AGTGGCTCCG GGGCGGGGTG CGCTTGCTCC CCGGGCCTAT CTCGCCAGCG ATGCGCCGCG GCTGTCCCTG AACGGCGACT GGCAGTTCCG CCTCAGCCCG GGCATCCGCA GCGCCCCCGC CGAGGGGTGG CAGCTGGGCC AGGATCTGGA CGGCTTCGAA ACCCTGCCCG TTCCGTCCAG CTGGCCCATG CACGGGCACG GCGCACCGGC GTACACCAAC ATCCAGTTTC CGTTTGCCGT GGAACCCCCG CATGTGCCGG AAGCCAACCC TATTGGTGAC CACCTCCTGA CGTTCCACGC CGGCCCGGAG TTCTTCCCCA ACGCACTGCT GCGTTTCGAC GGCATCGACT CCGCCGGCAC GGTGTGGCTG AACGGCGTCG AACTCGGCAC CACCCGCGGC AGCCGCCTTG CCCACGAATT CGACGTCTCA GGAATCCTTA CGGAAGGCAG CAACACCCTC GCCGTGCGGG TGGCGCAGTT CTCGGCCGCC AGCTACGTGG AGGACCAGGA CATGTGGTGG CTCCCGGGGA TCTTCCGGGA CGTCACCCTT CAAGCCCGCC CTGCGGACGG CATAGACGAC GTCTTCGTCC ACGCTGACTA CGACGCCGGC ACAGGCGAAG GTGTGCTGCG CGTGGAGGCG AGCCGGGGCG GAAAAGCGAT TGACGCCGTC GTACGCGTTC CCGAACTGGA CCTGGAGCTT AAGGCCGGCC AGGAGCACCG CCTGCCATCC GTAGCGCCGT GGTCGGCAGA GGAGCCCCGC CTGTACGAAG CCACGGTCAG CACCCCGGCC GAAACGGTGT CCTTGCAGCT GGGATTCCGC AGCATCACCA TCGAAGACTC CCAGTTCAAG GTGAACGGGC GGCGGATCCT GCTGCGCGGG GTGAACCGCC ACGAGCACCA TCCCCGGCTG GGGCGCGTGG TTCCGCAGGA GGTGATGGAG GCGGAGCTCA AGCTCATGAA GCAGCACAAC ATCAACGCAA TCCGCACATC GCACTACCCG CCCCACCCAA AATTCCTTGC CCTGGCTGAC CAGCTTGGTT TCTACGTCGT GCTCGAGTGC GACCTGGAAA CGCACGGGTT CGAGAGCGCC GGCTGGGCCC AGAACCCGAG CGATGATCCC CAATGGGAGG ACGCCCTGCT GGACCGGATG CGCCGGACGG TGGAGCGGGA CAAGAACCAT GCAGCCGTCG TGATGTGGTC GCTGGGCAAC GAATCCGGGA CCGGCCGGAA CCTGGCCGCG ATGTCGCGCT GGACCAAGGA CCGCGACCCG TCCCGCCCCA TCCACTACGA GGGTGACTGG TCCTCACCCT ACGTTGACGT GTACTCCCGC ATGTACGCCA ACCAGGCCGA GACTGCACTG ATTGGACAGG GAACTGAGCC TCCACTCGAT GACCCCGAGC TGGATGCCCG GCGCCGTGCC ATGCCCTTCG TGCTGTGCGA ATATGTTCAC GCCATGGGCA ATGGCCCGGG AGGAATGACC GAGTACCAGG AACTCTTTGA ACGCCACCCC AGGCTGATGG GCGGCTTCGT CTGGGAGTGG CTGGAACACG GAATCGCCGT CCCCACGCCG GGTGGCGGTG AGCACTTCGC CTATGGCGGT GATTTCGGTG AGGAAATTCA CGACGGCAAC TTCGTCACCG ACGGACTGGT GGACGCCAAT CGCCAACCCC GGCCCGGCCT CCTCGACTTC AAGCGCGTCG TGGCGCCGCT GCGGATCCAG GTTGCCGGGG ACTGGTCCGG CTTCACCGTC CACAACGGAC ACGATTTCAC CGACACGTCC CCGTTCAGCT TCCAGTACAC GGTGGAGGCC GACGGCGACA CTATCGACGG CGGCACGGTG GAGGTTCCGC GCGTCGCGCC GCAGTCCGCG GCCACAGTTG CGTTGCCCGC CGGCCTTCCG CACACGGACG GGCCGGCAGT CCTCACCGTC AGCGCCGTTC TCACCGCTGC GACGTCCTGG GCCGGCACCG GCCACGAGAT CGCCTGGGGC CAGTCCGTCC GTGGCGTCGC GCGTGTTGAA GAGCCCCGCG CTGCTGAATC CGTGGCCGCC ACTGACGATG AGCTGCGGCT GGGCCCGGCG GTGTTCAGCA GGCTGACGGG GATGCCCACA CGCATAGGCG GCATCAGTGT GGCAAAGCTG GGGCTCTCGC TGTGGTGGCC CCCCACCGAC AATGACCTGG GCAGGGAATG GGGCGGCGCC GACCTCCGGC CCCTGGCCAC CCAATGGAAG GATGCAGGGC TTGACCGGCT CCATACCCGG CTGCTTGGCA TTAGCGCTGA ACGCACCGAC GATGGCGGGG AGCGGCTCGT CACGAGGACC AGGGTGGGTG CCGCGGACAA ACAGTTCGGG GTCCTTGTGG ACTACACGTG GACCAGTGAC GGGGAATCGC TGGCGCTCCG AACCCAGGTC CGGCCGCAGG GATCCTGGGT CAACGCCGGA TTTGAGGTGG AGTGGGCCCG GATCGGCCTG GAACTGGTGC TGGACGGCGG GACGTCAGCG GTGAGCTGGT TCGGGCAGGG ACCGCACCAG GCTTATCCGG ATACGGGGCA AGGCACGCGG ACGGGATGGT TTTCCATGCC CTTGGGTGAC CTCGATGTTG AGTATGCGCG GCCCCAGGAG TCCGGTGCCC GCGCCGGTGT CCGCTCGGCT GCCCTTGAGG TGGATGGCGC CACGTTGGAC ATCGCCGGCG AGCCCTTTGC CCTGACGGTA CGCCCCTACA GCCAGGCCGC ACTCAACGAA GCAACCCACC GGCCTGACCT GGCAGCCGAC GGGCGGACCT ACATCTACCT CGACAACGCC ATGCGTGGTG TGGGCACCGG AGCCTGCGGT CCGGGTGTCC TCGATCCGTA CCGCCTGCAG CCACGGGATG CCGACTTCAC GGTAGTGCTG CGGGTCCGTC CGTAG
|
Protein sequence | MPVQSPTTVS NAAASDASDS VPRNLELGAG GVLELSSVAP GRGALAPRAY LASDAPRLSL NGDWQFRLSP GIRSAPAEGW QLGQDLDGFE TLPVPSSWPM HGHGAPAYTN IQFPFAVEPP HVPEANPIGD HLLTFHAGPE FFPNALLRFD GIDSAGTVWL NGVELGTTRG SRLAHEFDVS GILTEGSNTL AVRVAQFSAA SYVEDQDMWW LPGIFRDVTL QARPADGIDD VFVHADYDAG TGEGVLRVEA SRGGKAIDAV VRVPELDLEL KAGQEHRLPS VAPWSAEEPR LYEATVSTPA ETVSLQLGFR SITIEDSQFK VNGRRILLRG VNRHEHHPRL GRVVPQEVME AELKLMKQHN INAIRTSHYP PHPKFLALAD QLGFYVVLEC DLETHGFESA GWAQNPSDDP QWEDALLDRM RRTVERDKNH AAVVMWSLGN ESGTGRNLAA MSRWTKDRDP SRPIHYEGDW SSPYVDVYSR MYANQAETAL IGQGTEPPLD DPELDARRRA MPFVLCEYVH AMGNGPGGMT EYQELFERHP RLMGGFVWEW LEHGIAVPTP GGGEHFAYGG DFGEEIHDGN FVTDGLVDAN RQPRPGLLDF KRVVAPLRIQ VAGDWSGFTV HNGHDFTDTS PFSFQYTVEA DGDTIDGGTV EVPRVAPQSA ATVALPAGLP HTDGPAVLTV SAVLTAATSW AGTGHEIAWG QSVRGVARVE EPRAAESVAA TDDELRLGPA VFSRLTGMPT RIGGISVAKL GLSLWWPPTD NDLGREWGGA DLRPLATQWK DAGLDRLHTR LLGISAERTD DGGERLVTRT RVGAADKQFG VLVDYTWTSD GESLALRTQV RPQGSWVNAG FEVEWARIGL ELVLDGGTSA VSWFGQGPHQ AYPDTGQGTR TGWFSMPLGD LDVEYARPQE SGARAGVRSA ALEVDGATLD IAGEPFALTV RPYSQAALNE ATHRPDLAAD GRTYIYLDNA MRGVGTGACG PGVLDPYRLQ PRDADFTVVL RVRP
|
| |