Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_0226 |
Symbol | |
ID | 4447317 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 236713 |
End bp | 238260 |
Gene Length | 1548 bp |
Protein Length | 515 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639688022 |
Product | Alpha-N-arabinofuranosidase |
Protein accession | YP_829727 |
Protein GI | 116668794 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3534] Alpha-L-arabinofuranosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCACTA CCGAATCCGC CGCAGGCCGG ATAACCGCCA AGATCACCCT GGACGCGGCC TTCACCGTAG GGCCGGTCCG GCGCCGCACC TTCGGCGCTT TTGTGGAGCA CCTGGGACGC TGCGTCTACA CCGGCATCTT CGAGCCGGGC CACCCGAAGG CCGACGGGGA CGGCTTCCGG ACGGACGTCC TGGAACTTAC CCGCGAGCTG GGCGTGTCCA CGGTGCGCTA TCCGGGCGGG AACTTTGTCT CCGGCTACCG CTGGGAGGAC GGCGTGGGGC CGGTGGAGGA GCGGCCGGCG CGGCTTGATC TGGCATGGCA CTCCACCGAT CCCAACCACG TTGGCGTGGA CGAGTTCGCC AAGTGGTCCG CCAAGGCCGG CGTCGAGCCG ATGATGGCCG TGAACCTGGG AACCCGGGGG ACCCAGGAGG CCCTGGACCT GCTGGAGTAC TGCAATATCG ACGGCGGCAC GGCCTTTTCC GACCAGCGCC GGGCCAACGG GGCCGAAAAC GGCTACGGCA TCAAAATGTG GTGCCTGGGC AACGAGATGG ACGGCTTCTG GCAGATCGGC CACAAGAACG CAATGGAATA CGGCCGGCTC GCTGCCGACA CCGCCCGCGC CATGCGGATG GTGGAGCCGG ACCTGGAACT GGTGGCCTGC GGCAGCTCCG CGCCCACCAT GGCCACTTTC GGTGAGTGGG AGCGCGTGGT CCTGACCGAA ACCTACGAAC TCGTGGACCT GATCTCCGCC CACCAGTATT TCGAGGACTT CGGCGACCTG CAGGAACACC TCTCCTCGGG CCACCGGATG GAGGCGTTCA TCAAGGACAT CGTGGCGCAC ATCGACCACG TGAAGTCGGT CAAGAAGTCC GCTAAACAGG TGAACATCTC CTTCGACGAG TGGAACGTCT GGCACATGAG CCGCGACGAA TCCAAGACGC CCGCCGGCAA GGACTGGCCC GTGGCCCCCG TGCTGCTCGA GGACCGCTAC ACCGTGGCGG ACGCCGTGGT TGTGGGCGAC CTCCTCATCA CGCTGCTCCG CAACACCGAC CGCGTCCACT CAGCCAGCCT GGCGCAGCTG GTCAACGTCA TCGCCCCGAT CATGACAGAG CCCGGCGGCC GCGCCTGGAA GCAGACCACC TTCCACCCGT TCGCCCTGAC GTCGGAGCAC GCCAAGGGCA CGGTGCTGCA GCTCGCCGTC GAGTCCCCGC GGCTCAGCGG CGGCAAGACC GCGGACTTTG CCGCGCTGTC CGCCGTGGCC ACCTTCGACG CCGCCGCCGG GGAAGCCGTG GTGTTTGCCG TGAACCGCTC GGCCACTGAC TCAATCACGT TGAGCGCCGC CGTCGCCGGG CTGGGCAACG CGAAGGTCAT CGAGTCGGTC ACGTACGCCA ACAAGGACCC GTACTGGCAG GCCACGGCGG ACGACTCCAC CTCGGTGCTG CCCGGCCAAA ACGTCAGCGT AAAGCTCGAC GGCGGCCGTC TCACCGCGGA GCTCCCCGCC GTCTCCTGGA GCATGATCCG CCTGGCCGTC GACGGGGCGG GCAGCTAG
|
Protein sequence | MSTTESAAGR ITAKITLDAA FTVGPVRRRT FGAFVEHLGR CVYTGIFEPG HPKADGDGFR TDVLELTREL GVSTVRYPGG NFVSGYRWED GVGPVEERPA RLDLAWHSTD PNHVGVDEFA KWSAKAGVEP MMAVNLGTRG TQEALDLLEY CNIDGGTAFS DQRRANGAEN GYGIKMWCLG NEMDGFWQIG HKNAMEYGRL AADTARAMRM VEPDLELVAC GSSAPTMATF GEWERVVLTE TYELVDLISA HQYFEDFGDL QEHLSSGHRM EAFIKDIVAH IDHVKSVKKS AKQVNISFDE WNVWHMSRDE SKTPAGKDWP VAPVLLEDRY TVADAVVVGD LLITLLRNTD RVHSASLAQL VNVIAPIMTE PGGRAWKQTT FHPFALTSEH AKGTVLQLAV ESPRLSGGKT ADFAALSAVA TFDAAAGEAV VFAVNRSATD SITLSAAVAG LGNAKVIESV TYANKDPYWQ ATADDSTSVL PGQNVSVKLD GGRLTAELPA VSWSMIRLAV DGAGS
|
| |