Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3445 |
Symbol | |
ID | 4444175 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 3879296 |
End bp | 3880795 |
Gene Length | 1500 bp |
Protein Length | 499 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639691269 |
Product | beta-galactosidase |
Protein accession | YP_832920 |
Protein GI | 116671987 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase |
TIGRFAM ID | [TIGR03356] beta-galactosidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCACT CCGCCAAAGA CCTGGCTGCC AGGATCCCGC CGTCCTTCAC GATGGGCGTG GCCACAGCCG CCTTCCAGAT TGAGGGAGCC CTCGACGAGG ACGGCCGCGG GCCCTCCGGG TGGGACGTGT TTGCCCGGAA GCCCGGCGCC ATAGTGGACG ACCACAGCCC TGTCACAGCC TGCGACCACT ACCACCGCAT GCCTGAAGAC GTTGCGCTGA TGAAGGAACT GGGGGTGGAT TCCTACCGGT TCTCGCTGTC CTGGTCCCGC ATCCAGCCCG GCGGCAGCGG GCCGGTCAAT CCGAAGGGCA TCGACTTCTA CGACCGGCTG CTGGACCAGC TGCTGGCCAG CGGCATCTCA CCCATGGTGA CGCTCTACCA CTGGGACACC CCGCTGCCGC TCGACGAGGC CGGCGGCTGG TTGAACCGGG ACACGGCGTA CCGGCTCGGC GAGTTTGCCT CTATTGCGGC TGCGGCCTTT GGCGACCGCG TGGCCCGCTG GGTCACCGTC AATGAGCCTG CCACCGTGAC CACCAACGGC TACGCCCTGG GACTCCACGC GCCGGGCGAA TCGCAGTTGC TCAAAGCGCT CCCCACAGTC CACCACCAGC TCCTCGGCCA CGGCCTGGCC ATGCAGGCAT TGCGGGCTGC CAACGTACCC GGCGAAATAG GCATCACCAA TGTCTATTCG CCCATGGTTC CGGCCTCGAT CAACCCGCTG GACAAGCTGA GCACGGCGCT GATGGACGTT TTCCAGAACA GGCTCTTCGC GGATCCGGTG CTCCTCGGAA AGTACCCTGA CGTCGTCCGG GCCGCCACGT TCTTCAGCTC GTTCAGCCCC TCCGATGAGG ACATGGACCT GATCTCGCAG CCGTTGGACT TCTACGGGCT CAACTACTAC ATGCCAACGC GGGTTGCGGC AGGTCCCGGC GACGGCGCCG TTCCCCCGGG GATGGCCGAG GCCATGGGGG ATGACCTCAG CGGCAGCGCT CCCGGCGCAC CGTTCCACAT CACTGAGTTT CCGGATGCCG AGACGACTGC GTACGGCTGG CCGATCAGGC CCGACTACAT GCCCGTGGCC CTAGCCGAGA TGGCCGAACG CTACCCGGAG CTGCCCCCCG TCTTCATCAC GGAAGGCGGA GCGAGCTTTG AGGATGTGGT GGTCCGGGAC AAGGCCGGGG ACAGGATCAT CATTCCGGAC GAGCGCCGCC TGCGCTATCT TGCCGAGCAC ATCTCCAGCG CGGTGGAGGC CACCAGCCCG GGGGGACCTG CTGAATCCAT TGACCTGCGT GGCTACTACG TATGGTCCCT TCTCGACAAC TTCGAGTGGT CCGCGGGCTA TAAACAGCCC TTCGGACTCC TGCATGTTGA CTTCGAGACG ATGGCCAGAA CGCCCAAGGC GTCCTACTAC TGGCTGCAGG AGCTGCAGGA AGCCCGCAAG GAAGCCGCGG CTGAGGCTGT TGGCGAGAGT GCAGCCGGGG AACCGGAGCC CGTAGCCTGA
|
Protein sequence | MKHSAKDLAA RIPPSFTMGV ATAAFQIEGA LDEDGRGPSG WDVFARKPGA IVDDHSPVTA CDHYHRMPED VALMKELGVD SYRFSLSWSR IQPGGSGPVN PKGIDFYDRL LDQLLASGIS PMVTLYHWDT PLPLDEAGGW LNRDTAYRLG EFASIAAAAF GDRVARWVTV NEPATVTTNG YALGLHAPGE SQLLKALPTV HHQLLGHGLA MQALRAANVP GEIGITNVYS PMVPASINPL DKLSTALMDV FQNRLFADPV LLGKYPDVVR AATFFSSFSP SDEDMDLISQ PLDFYGLNYY MPTRVAAGPG DGAVPPGMAE AMGDDLSGSA PGAPFHITEF PDAETTAYGW PIRPDYMPVA LAEMAERYPE LPPVFITEGG ASFEDVVVRD KAGDRIIIPD ERRLRYLAEH ISSAVEATSP GGPAESIDLR GYYVWSLLDN FEWSAGYKQP FGLLHVDFET MARTPKASYY WLQELQEARK EAAAEAVGES AAGEPEPVA
|
| |