Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3388 |
Symbol | |
ID | 4444117 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 3810231 |
End bp | 3812438 |
Gene Length | 2208 bp |
Protein Length | 735 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639691211 |
Product | glycoside hydrolase, clan GH-D |
Protein accession | YP_832863 |
Protein GI | 116671930 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3345] Alpha-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATCCCC TGCACCTCCG CTCCGCCGGC ACCAGCCTGG TGATCAGCTT CGACAGCGGG GAGGCCGAGG TCATTCACTG GGGCGCCGAT CTGGGCGCTT CACTCCCCGA TCTGGCCATC CTCGGCGAAC CGATCCCGCC CTCCGCCATC GACGCCTCCG TCCCCGCCGG GCTGCTGCCG CAGGCATCCT CCAGCTGGCG TGGACGCCCG GCCCTCCGGG GGCACCGGAT CGCCGACGGC GTGCCCGGCT ACGACTTTTC CGTCCGCCTG CGCGTCACGG ACGTCAAGAC CGCAGGAAGT TCAGCTGTGA TCGTCCAGTC TGATCCCGAT GCCGGAATCT CTGTTGAATC CACGCTGGAA CTGCACGCCG GCGGGCTGCT GGAAATGCGC CACACCGTCA CCAACACCGG CACTTCGCCT TTTCAGCTCG ACGAACTGGC CACGGTGCTG CCGGTGGCTC CCGACGCCGT CGAACTCCTT GACCTGACCG GACGCTGGTG CCGTGAACGC CACCCTCAGC GCCGCGCCAT CCAGCAGGGC ACCTGGGTGC GGACCGGCCG GCACGGCCGG ACCGGCCACG ACTCCTCCCT GCTGCTGGCC GCCGGCACGG CAGGCTTCGG CAACCGCCAC GGCAAGGTCT GGGCCACCCA CCTTGCCTGG AGCGGAAACC ATGAGCAGTT CGCGGACAGC ATCGGGGACG GACGGACCGT CATCGGTGGT TCCGAGCTGC TGGGTCCGGC CGAAGTGGTC CTCCAGCCGA ACGGCAGCTA CACCACCCCC GCTCTCTTCG CGGCCTACTC GGACCGCGGC CTGGACGGTA TCAGCGAAGC GTTCTACAGC TGGTTCAGGA ACCGGCCGCA CCATGTGCTG CCTTCGGCGT CAGCCGTCTC AGGAGCAGCT CATGCGGGCA CCGGCAAGGC CCGGCCTGTA GTGCTAAACG TCTGGGAAGC TGTCTACTTC AACCACGATC TGGGTGTATT GGTCGAACTT GCCGATTCCG CGGCGGACCT GGGCGTGGAG CGCTTTGTCC TCGACGACGG GTGGTTCCGC GGCCGCCGGC ACGACCAGGC AGGCCTGGGC GACTGGTACG TGGACGAGGG CCTCTGGCCG GACGGGCTCA CACCCCTGAT CGACGCCGTC ACATCGCGCG GCATGGAATT CGGCCTCTGG GTGGAGCCCG AAATGATCAA CCTGGACTCC GACACCGCGC GCGCCCACCC GGACTGGATC GTCGGGCCGG CCGCACGGTC CCACAAGGAC GGCGGCCGGC TGCCGTTGAC CTGGCGCCAC CAGCACGTCA TCGACCTGGT CAATCCCGAG GCCTGGCAGT ACGTTTTCGA CCGCATTGAC GCCCTGTTGC GCGAAAACAA CATCAGCTAC CTGAAGTGGG ACCAGAACCG GGACCTCACC GAGCACGGCC ACGCCGGGCG CGCCTCCGTC CACGAACAGA CCCTGGCCGC CTACCGCCTC TTCGATGAGC TCAGGAAAGC CCATCCGGGC CTCGAAATCG AGAGCTGCTC TTCCGGCGGG GCACGCGTGG ACCTGGGCAT CCTGGAACGC ACGGACCGGA TCTGGGCTTC GGACTGCAAC GATGCCCTGG AACGCCAGAC CATCCAGCGC TGGACCGGGC TGGTGGTGCC GCCGGAACTG GTCGGAGGAC ACATCGGCCC CACTACGTCA CACACCACGG CCCGCACGCA CGACGTTTCG TTCCGCGCCA TCACGGCCCT GTTCGGACAC TTCGGCCTCG AATGGGACGT CCGCCAGGTT CACGGCGCGG AGCGCGAAGA ACTCAAGCGG TTCATCGGGC TCTACAAGGA GCACCGCGGC CTGATCCACT CGGGCCGGAT GGTCCGGGCG GATGTTGCCG ACGATTCGCT GATGCTGCAC GGCGTCGTTT CCCACGGCAG CCCAGCAACC GGGGACACGG CGGCACTGTT CGCGCTGGTC AGCACCAGGA CGTCGCCCGC GGAGCGTCCG GGCCGCATCG CCATTCCGGG ACTGGACCAG GACCGCAGCT ACCGCGTGGA GGCCATCTTC CCGACGCCCG GCGATGCCGA CTACGCGCAC AACTACACCC AGGCGCAGCC CCCCGCATGG CTGACCGCGG GTGCAGAAGC CAGCGGCCGG TTCCTGTCCG AGGTGGGCCT GCCCATGCCC GTCCTCAACC CGGAGCACGC ACTGCTGCTC AGCTTCACTG CCGTGTAG
|
Protein sequence | MDPLHLRSAG TSLVISFDSG EAEVIHWGAD LGASLPDLAI LGEPIPPSAI DASVPAGLLP QASSSWRGRP ALRGHRIADG VPGYDFSVRL RVTDVKTAGS SAVIVQSDPD AGISVESTLE LHAGGLLEMR HTVTNTGTSP FQLDELATVL PVAPDAVELL DLTGRWCRER HPQRRAIQQG TWVRTGRHGR TGHDSSLLLA AGTAGFGNRH GKVWATHLAW SGNHEQFADS IGDGRTVIGG SELLGPAEVV LQPNGSYTTP ALFAAYSDRG LDGISEAFYS WFRNRPHHVL PSASAVSGAA HAGTGKARPV VLNVWEAVYF NHDLGVLVEL ADSAADLGVE RFVLDDGWFR GRRHDQAGLG DWYVDEGLWP DGLTPLIDAV TSRGMEFGLW VEPEMINLDS DTARAHPDWI VGPAARSHKD GGRLPLTWRH QHVIDLVNPE AWQYVFDRID ALLRENNISY LKWDQNRDLT EHGHAGRASV HEQTLAAYRL FDELRKAHPG LEIESCSSGG ARVDLGILER TDRIWASDCN DALERQTIQR WTGLVVPPEL VGGHIGPTTS HTTARTHDVS FRAITALFGH FGLEWDVRQV HGAEREELKR FIGLYKEHRG LIHSGRMVRA DVADDSLMLH GVVSHGSPAT GDTAALFALV STRTSPAERP GRIAIPGLDQ DRSYRVEAIF PTPGDADYAH NYTQAQPPAW LTAGAEASGR FLSEVGLPMP VLNPEHALLL SFTAV
|
| |