Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_1891 |
Symbol | |
ID | 4445580 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 2126563 |
End bp | 2128014 |
Gene Length | 1452 bp |
Protein Length | 483 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639689703 |
Product | glycoside hydrolase family protein |
Protein accession | YP_831375 |
Protein GI | 116670442 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.001415 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCACTGC CTAAAGGATT CCGATGGGGC GGTGCCATCG CCGCAAATCA GGTCGAGGGC GCCTGGCGCG AGGGAGGCCG CGGTGCCGCC GTCTCCGACG TCGCAACCTA CAAGCCCGAC GCCGACCCCA AGGACTACGC GATCCACCAC CAGATCACCG TGGAGAGCAT CAACGCAGCC TTAGCGGATG ACGATGAACG GCTGTTTCCG AAGCGACGCG GCATCGACTT TTATCACCGG TACCCGGGGG ACCTCGCATT ATTCGCCGAG ATGGGCTTTA CAACCTTGCG AGTCTCGATC TCCTGGACCC GGCTCTATCC CACCGGAGAG GAGCTCGAAC CCCAGGCCGA CGGGGTCGCC TTCTACAAGG CGTTGTTCAC TGAGATGCGG CGCCTGAACA TCGAACCCTT GGTGACCCTC TCGCACTACG ACCCACCGAT AGCCCTCGCG CTCAAGCACA ACGGCTGGGT CGCGCGCCGC ACAATCGCGT TATTCGAGCG CTTCGCCCGC ACCTGCTTCA GTGAGTTCGG CGACCTGGTG AACATGTGGC TTACCTTCAA CGAGATCGAC GGCATCATCC GTCACCCATT CACCTCCGGT GGCATCATCG ACGAAACCGT TGAGGGCAGC CTCGAGCAAG CCTGCTACAG CGCACTGCAC CACCAGTTCG TGGCTGCAGC ATCGGTCACC AAAATGCTTC GCGAGATCTC ACCAGGGGCG CAGATGGGTT GCATGCTCAC CATGCTCATG ACATACCCGA ATACCTGTCG TCCCGAAGAC GTCGCCGCCA CGCAAGCGAA AGAGAGGCTG CTCTATCTAT GCACTGACGT GCAGGCCGGA GGCGGCTACC CGCGGCTAGC CCTGCGGGCG CTCGAGCTTC GCGGCGTAAC CATCCCCTTC CTTGACGGCG ACACCAAACT GCTCGCCGAA AATCCAGTCG ACTTCATCTC GTTCAGCTAC TACAACTCGA TGACCGAATC GGTGCGCCCG GATGCCGAGC GCACACCAGG GAACACCGTG CTCGGGGTGA AGAACCCGTT CCTCGATTCG AGCGAATGGG GATGGCAGAT CGACCCGGTC GGCCTCCGGA TCGCGCTGAT CGACCTCTAC GACCGTTACG GCAAACCGTT GTTCATCGTG GAGAACGGCC TGGGTATGCG CGACGAGCTG ACCGCCGAAG GCAAAATCCA CGACCCCTAC CGTATCGGCT ACTTCCGCGC GCATTTCCAG CAGATGATCC AGGCCGTCGA TGAGGGCGTG GAACTCATGG GCTACGTCAG CTGGGCGCCC ATCGACCTCA TCAGTTCGTC AAGCTCACAA ATCTCGAAGA GATACGGCTT CATCTACGTC GATCAAGACG ACCTCGGCCA AGGAAGCGGA GACCGTTACC GGAAGGACTC CTTCTTCTGG TACCAGAAGG TCATCGCGTC GAACGGCGCC GACCTGGAAT GA
|
Protein sequence | MSLPKGFRWG GAIAANQVEG AWREGGRGAA VSDVATYKPD ADPKDYAIHH QITVESINAA LADDDERLFP KRRGIDFYHR YPGDLALFAE MGFTTLRVSI SWTRLYPTGE ELEPQADGVA FYKALFTEMR RLNIEPLVTL SHYDPPIALA LKHNGWVARR TIALFERFAR TCFSEFGDLV NMWLTFNEID GIIRHPFTSG GIIDETVEGS LEQACYSALH HQFVAAASVT KMLREISPGA QMGCMLTMLM TYPNTCRPED VAATQAKERL LYLCTDVQAG GGYPRLALRA LELRGVTIPF LDGDTKLLAE NPVDFISFSY YNSMTESVRP DAERTPGNTV LGVKNPFLDS SEWGWQIDPV GLRIALIDLY DRYGKPLFIV ENGLGMRDEL TAEGKIHDPY RIGYFRAHFQ QMIQAVDEGV ELMGYVSWAP IDLISSSSSQ ISKRYGFIYV DQDDLGQGSG DRYRKDSFFW YQKVIASNGA DLE
|
| |