Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3149 |
Symbol | |
ID | 4444262 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 3533515 |
End bp | 3535506 |
Gene Length | 1992 bp |
Protein Length | 663 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639690975 |
Product | Beta-glucosidase |
Protein accession | YP_832627 |
Protein GI | 116671694 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.167157 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGACTTCAC AGACCCGCAC CAGCCCGCGC CCCGGAAACC CGCGCCCCGG AAACCCCCGT CCCGGAAACT CCGGCAACAG CACCCAAACC GCCGGCACCG GCCCCGCCAC CTCCGTGGCG GCCGACGGCA CACGGTACCG GGACCTCAAC GGCAACGGGA TCATGGATCC CTTCGAGAAC CCTGGACTCA GCCCGCACGA ACGCGCAGCC GACCTCGTGG CCCGGCTCAG CCTCGAAGAA AAGGCCGGGC TGATGTTCCA CACCGTGATT GAGACCGGAC CGGGCGGATC CCTGCTCGAG ACTCCCGGAA ACATCAGCAA GTCGCCCACC AGCACGGTGA TCCTGGGCAA GTTCATGAAC CACTTCAACA TTCACGCCTT GGGCACTGCC CGGGAGGCCG CGATGTGGAG CAACGCCCTG CAACAGCTTG CGGCACAGAC CCCGCACGGC ATTCCGGTCA CGATCTCCAC GGACCCCCGC CACGCCTTCA TCGAGAACTC CGGCGTATCG TTCACCGCCG CCCACTTTTC CCAGTGGCCC GAGCCGATCG GCCTGGCCGC CGTCGGCAGT GCCGAGCTAA TCCGCCGCTT CGCCGAGATC GCCCGCACGG AGTACACCGC CGTCGGCATC CGTGCCGCCC TGCACCCGAC CGTCGACCTC GCCACAGAGC CACGCTGGTG CCGGCAGGCC GGAACCTTCG GGCAGGACTC CGAGCTCAGC TCGAAGTACG TCGTCGAGTA CCTTCAAGGA TTCCAGGGCG ACGAGCTGGG TCCGGACAGC GTTGCCTGCA CCACCAAGCA CTTCCCGGGC GGCGGTCCGC AGCGCGACGG CGAAGACGCA CACTTCCCCT ACGGCCGGGA ACAGGTCTAC CCCGGCGGCC GCTTCGACGA GCACCTGGCT CCGTTCCGGG CGGCCATCGC GGAAAAGACC AGCGCCATCA TGCCCTATTA CGGCATGCCG ATCGGCGTCG AGCTGGACGG CGAGCCGGTG GAGGAAGTCG GCTTCGGCTA CAACAGGCAG ATCATCACCA ACCTGCTCCG GGACAAGCTC GGCTACGACG GCGTGGTCCT CAGCGACTGG GAGCTGGTCA ACGACAACAT CGTTGGCGAG CAGGTGCTGC CGGCGAGGGC CTGGGGCGTT GAAGAGCTCA CCGCGCCCGA GCGGATGCTG AAGATCCTCA ACGCGGGCGT GGACCAGTTC GGCGGCGAGG AATGCACCGA GCTGCTCCTT GGCCTGGTCC GGGACGGCCT GGTCAGCGAG GAACGAATCG ATGAGTCCGC GCGCCGCCTG CTGCTGGTCA AGTTCCAGCT CGGCCTGTTC GACAACCCCT TCGTGGACGA GGACGCAGCG GCCGAAATCG TTGGCAACCC CGAGTTCCGG CGGGAGGGCC ACCGCGCGCA GGCAACGTCC GTCACGGTGC TGGCCAACGG AACGCACGAC GGCGGGACGG CCCTGCCGCT GACCGGCTCA CCCGCCGTCT ACGTCGAAGG AATGGACCCG CTGAGTTTCG ACGGTTTCGG GACTGTGGTG CAGGACCCGG ATCAGGCCGA TGTTGCCGTG ATCCGGCTGC ACTCCCCCTG GGACCACCGC GACGACCTGT TCCTGGAACA GCACTTCCAC GCCGGAAGCC TGGACTTCCC GCCCGGGCTG GTCTCCCGGC TCCGGACCCT GGCCGCGAAG GTTCCGCTGA TCATCGACGT GCGGTTGGAC CGTCCAGCCA TCCTCACCCC GTTGGCGGGG TTCGCTGCGG CCCTGGTGGG CACCTTCGGC GTCTCGGACA CCGCCCTGCT CGATGCCCTG TTCGGACGCA TTGAGCCGCA GGGCAGCCTT CCCTTCGATA TTCCCAGGTC CATGGACGCC GTACGTGGTT CACGCTCGGA TGTCCCCGGA GACACCGCCG ATCCCCTCTT CCGGTTCGGG CACGGCCTGC GGCTACCCGC GGCGCCCAAC GCCGGAACCG CCGCTGCTAT CCAGTCAGGA CTGAGCTCAT GA
|
Protein sequence | MTSQTRTSPR PGNPRPGNPR PGNSGNSTQT AGTGPATSVA ADGTRYRDLN GNGIMDPFEN PGLSPHERAA DLVARLSLEE KAGLMFHTVI ETGPGGSLLE TPGNISKSPT STVILGKFMN HFNIHALGTA REAAMWSNAL QQLAAQTPHG IPVTISTDPR HAFIENSGVS FTAAHFSQWP EPIGLAAVGS AELIRRFAEI ARTEYTAVGI RAALHPTVDL ATEPRWCRQA GTFGQDSELS SKYVVEYLQG FQGDELGPDS VACTTKHFPG GGPQRDGEDA HFPYGREQVY PGGRFDEHLA PFRAAIAEKT SAIMPYYGMP IGVELDGEPV EEVGFGYNRQ IITNLLRDKL GYDGVVLSDW ELVNDNIVGE QVLPARAWGV EELTAPERML KILNAGVDQF GGEECTELLL GLVRDGLVSE ERIDESARRL LLVKFQLGLF DNPFVDEDAA AEIVGNPEFR REGHRAQATS VTVLANGTHD GGTALPLTGS PAVYVEGMDP LSFDGFGTVV QDPDQADVAV IRLHSPWDHR DDLFLEQHFH AGSLDFPPGL VSRLRTLAAK VPLIIDVRLD RPAILTPLAG FAAALVGTFG VSDTALLDAL FGRIEPQGSL PFDIPRSMDA VRGSRSDVPG DTADPLFRFG HGLRLPAAPN AGTAAAIQSG LSS
|
| |