Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_2898 |
Symbol | |
ID | 4444420 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 3263830 |
End bp | 3265230 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639690721 |
Product | glycoside hydrolase family protein |
Protein accession | YP_832377 |
Protein GI | 116671444 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGCTCA TGATTGCCGG TGGCGGCGGA TTCCGTGTCC CGCTCGTGTA CCGGGCGCTC ACCTCCGGCC CCTTCGCGGG GCTGGTGCGC GAGCTTGTGC TCTACGACGT CGATCCCCTG CGGCTGGCCG CCATCAAGGC CGTCCTGGCG TCGATGGCCC CTTCCAACGG TGCACGCGGC CCTGCAGTCA GCACCACCAC CTCGCTGGCC GAGGGGCTCG AAGGCACCGC CATGGTGTTC GCGGCCATCC GGCCCGGAGG AACAGCTGGC CGCACCGCTG ACGAACGCGT GGCAATCGGC CTGGGCCTGC TGGGCCAGGA GACCACGGGC GCCGGCGGAA TCTCCTACGC GCTGCGATCC ATCCCCGGGA TGCTGGCCTT GGCCCGAGAA ATGAAGCAGC GATGCCCCGG GGCATGGTTG GTTAACTTCA CCAACCCGGC CGGGATGGTC ACCGAGGCGC TGGTGCCCGT GCTGGGAAAC CGGGTGATCG GCATCTGCGA CTCGGCCGGC GGCCTGGTCC ACCGGGCGGC CCGGGCGGCC GGCGCCGCCT TGCCGGAGGG AAGGCTCGAC GGCGTGGGCT ACTACGGGCT CAACCACCTG GGCTGGCTGT ACCGGCTGGA ATCCGGCGGA AAGGACCTGC TGCCCGGACT GCTGGCGGAC GCGCAGGCCC TGTCCGGCTT CGAGGAGGGC CGGCTCTTTC CGCAGCCGTT CCTCCAGGCA CTCGGCTGCC TGCCCAACGA GTACCTCTAT TACTACTACG ACACCGCCCG GGCCGTCGGC GCCATCCGCG CCATGCGGCA GACACGCGGC GAGTCGATCC ACGAGCAGCA GTCGGGGCTC TACCCGGCGC TGGCCGCGGC CGGATCCCGC GCCTACGAAC TGTGGGACGC GGCCCGCCGT TCGCGCGAGG AAGGCTACCT CGCCGAGGCA CGCGCCGGCG GCGAACAACG GGACGAGGAG GACCTCGCGG GCGGCGGGTA TGAGCGCGTT GCCCTCGCGG TTATGCGCGC CCTGGCCGGC GGTGCGCCGG ACGATGCAGC GCCCCTCAGC GGTGGCATTG CTGCCGGGAT CACGGAGCTC ATCCTGAACA CCCCCAACGA AGGTGCGGTT CCCGGGCTGC CGGCGGACGC TGTCGTCGAG GTACCCTGCC AGGTGACGCC CGACGGCGCC ATGCCGCTGC CGCAGGACCG GCCCGGAGAC GCGCAGCTTG CCCTCATGCA GCGGGTCAAG GAGGTGGAGC GGCTCACGGT GTCGTCCGTG GTGCAGGGAC GACGGTCCGA CGCTCTGCGG GCTTTCGGGC TCCATCCGCT GATCGACTCC GAGGAACTGG CTGTGAAACT CCTGGCCGGC TACGAGGCGG CGTTTCCGGC GCTCGGGCAG CTGTGGCGCG GCGGGGCCTG A
|
Protein sequence | MRLMIAGGGG FRVPLVYRAL TSGPFAGLVR ELVLYDVDPL RLAAIKAVLA SMAPSNGARG PAVSTTTSLA EGLEGTAMVF AAIRPGGTAG RTADERVAIG LGLLGQETTG AGGISYALRS IPGMLALARE MKQRCPGAWL VNFTNPAGMV TEALVPVLGN RVIGICDSAG GLVHRAARAA GAALPEGRLD GVGYYGLNHL GWLYRLESGG KDLLPGLLAD AQALSGFEEG RLFPQPFLQA LGCLPNEYLY YYYDTARAVG AIRAMRQTRG ESIHEQQSGL YPALAAAGSR AYELWDAARR SREEGYLAEA RAGGEQRDEE DLAGGGYERV ALAVMRALAG GAPDDAAPLS GGIAAGITEL ILNTPNEGAV PGLPADAVVE VPCQVTPDGA MPLPQDRPGD AQLALMQRVK EVERLTVSSV VQGRRSDALR AFGLHPLIDS EELAVKLLAG YEAAFPALGQ LWRGGA
|
| |