Gene Arth_0226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0226 
Symbol 
ID4447317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp236713 
End bp238260 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content68% 
IMG OID639688022 
ProductAlpha-N-arabinofuranosidase 
Protein accessionYP_829727 
Protein GI116668794 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3534] Alpha-L-arabinofuranosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCACTA CCGAATCCGC CGCAGGCCGG ATAACCGCCA AGATCACCCT GGACGCGGCC 
TTCACCGTAG GGCCGGTCCG GCGCCGCACC TTCGGCGCTT TTGTGGAGCA CCTGGGACGC
TGCGTCTACA CCGGCATCTT CGAGCCGGGC CACCCGAAGG CCGACGGGGA CGGCTTCCGG
ACGGACGTCC TGGAACTTAC CCGCGAGCTG GGCGTGTCCA CGGTGCGCTA TCCGGGCGGG
AACTTTGTCT CCGGCTACCG CTGGGAGGAC GGCGTGGGGC CGGTGGAGGA GCGGCCGGCG
CGGCTTGATC TGGCATGGCA CTCCACCGAT CCCAACCACG TTGGCGTGGA CGAGTTCGCC
AAGTGGTCCG CCAAGGCCGG CGTCGAGCCG ATGATGGCCG TGAACCTGGG AACCCGGGGG
ACCCAGGAGG CCCTGGACCT GCTGGAGTAC TGCAATATCG ACGGCGGCAC GGCCTTTTCC
GACCAGCGCC GGGCCAACGG GGCCGAAAAC GGCTACGGCA TCAAAATGTG GTGCCTGGGC
AACGAGATGG ACGGCTTCTG GCAGATCGGC CACAAGAACG CAATGGAATA CGGCCGGCTC
GCTGCCGACA CCGCCCGCGC CATGCGGATG GTGGAGCCGG ACCTGGAACT GGTGGCCTGC
GGCAGCTCCG CGCCCACCAT GGCCACTTTC GGTGAGTGGG AGCGCGTGGT CCTGACCGAA
ACCTACGAAC TCGTGGACCT GATCTCCGCC CACCAGTATT TCGAGGACTT CGGCGACCTG
CAGGAACACC TCTCCTCGGG CCACCGGATG GAGGCGTTCA TCAAGGACAT CGTGGCGCAC
ATCGACCACG TGAAGTCGGT CAAGAAGTCC GCTAAACAGG TGAACATCTC CTTCGACGAG
TGGAACGTCT GGCACATGAG CCGCGACGAA TCCAAGACGC CCGCCGGCAA GGACTGGCCC
GTGGCCCCCG TGCTGCTCGA GGACCGCTAC ACCGTGGCGG ACGCCGTGGT TGTGGGCGAC
CTCCTCATCA CGCTGCTCCG CAACACCGAC CGCGTCCACT CAGCCAGCCT GGCGCAGCTG
GTCAACGTCA TCGCCCCGAT CATGACAGAG CCCGGCGGCC GCGCCTGGAA GCAGACCACC
TTCCACCCGT TCGCCCTGAC GTCGGAGCAC GCCAAGGGCA CGGTGCTGCA GCTCGCCGTC
GAGTCCCCGC GGCTCAGCGG CGGCAAGACC GCGGACTTTG CCGCGCTGTC CGCCGTGGCC
ACCTTCGACG CCGCCGCCGG GGAAGCCGTG GTGTTTGCCG TGAACCGCTC GGCCACTGAC
TCAATCACGT TGAGCGCCGC CGTCGCCGGG CTGGGCAACG CGAAGGTCAT CGAGTCGGTC
ACGTACGCCA ACAAGGACCC GTACTGGCAG GCCACGGCGG ACGACTCCAC CTCGGTGCTG
CCCGGCCAAA ACGTCAGCGT AAAGCTCGAC GGCGGCCGTC TCACCGCGGA GCTCCCCGCC
GTCTCCTGGA GCATGATCCG CCTGGCCGTC GACGGGGCGG GCAGCTAG
 
Protein sequence
MSTTESAAGR ITAKITLDAA FTVGPVRRRT FGAFVEHLGR CVYTGIFEPG HPKADGDGFR 
TDVLELTREL GVSTVRYPGG NFVSGYRWED GVGPVEERPA RLDLAWHSTD PNHVGVDEFA
KWSAKAGVEP MMAVNLGTRG TQEALDLLEY CNIDGGTAFS DQRRANGAEN GYGIKMWCLG
NEMDGFWQIG HKNAMEYGRL AADTARAMRM VEPDLELVAC GSSAPTMATF GEWERVVLTE
TYELVDLISA HQYFEDFGDL QEHLSSGHRM EAFIKDIVAH IDHVKSVKKS AKQVNISFDE
WNVWHMSRDE SKTPAGKDWP VAPVLLEDRY TVADAVVVGD LLITLLRNTD RVHSASLAQL
VNVIAPIMTE PGGRAWKQTT FHPFALTSEH AKGTVLQLAV ESPRLSGGKT ADFAALSAVA
TFDAAAGEAV VFAVNRSATD SITLSAAVAG LGNAKVIESV TYANKDPYWQ ATADDSTSVL
PGQNVSVKLD GGRLTAELPA VSWSMIRLAV DGAGS