Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4317 |
Symbol | |
ID | 8335671 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 4900365 |
End bp | 4901879 |
Gene Length | 1515 bp |
Protein Length | 504 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644957420 |
Product | Alpha-N-arabinofuranosidase |
Protein accession | YP_003115022 |
Protein GI | 256393458 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3534] Alpha-L-arabinofuranosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.341945 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.00470977 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGCTGACAG CACATGCCAC CCTCGACCCG GCCTTCCGCG TCGCGCCGGT CGACCGCCGC CTGTTCGGTT CCTTCGTGGA GCACATGGGC CGCTGCGTCT ACGGCGGGAT CCACGAGCCC GGACACCCCG AGTCCGACGC CGACGGAAAC CGGCTGGACG TCCTGGAACT CACCCGCGAC CTCGGGATCA GCGTCATCCG CTACCCCGGC GGCAACTTCG TGTCCGGATA CCGCTGGGAG GACGGCGTCG GGCCGGTCTC CGAGCGGCCG CGCCGGCAGG ACCTGGCCTG GCGCTCCATC GAGACCAACG AGTTCGGCCT GAACGAGTTC ATGGCCTGGG CGAAGCTGGC GAACGTGGAG CCGATGATGG CGGTGAACCT GGGGACCCGG GGCATCCAAG AGGCCTGCGA CCTGCTGGAG TACGCCAACC ACCCGGGCGG CACGTACCTG TCCGACCTGC GCCGCAAGCA CGGCGTCGAG GAGCCGCACG CGATCAAGCT GTGGTGCCTG GGCAACGAGA TGGACGGCCC GTGGCAGACC GGCCACAAGA CCGCCGAGGA GTACGGCAGA CTCGCCGCCG AGACCGCCAA GGCGATGCGC CAGGTCGACT CCGGCATCGA GCTGGTCGCC TGCGGCAGCT CCAACGCCCT CATGCCGACC TTCGGCTCCT GGGAGTCCAC GGTCCTGGAG CACACCTTCG ACTACGTGGA CTACATCTCC CTGCACGCCT ACTACGAGCA ACACGGCCAG GACCGCGCCA GCTTCCTGGC CGCCGCAGCC GGCATGGACC GCTTCATCGA CGGCGTGGTG GCCACCGCCG ACCATGTCTC CGCGAAGAAG AAGTCCCGCA AGAAGCTCAA CCTCTCCTTC GACGAGTGGA ACCTGTGGCA GGAAAGCCGC TTCGCCGGCC ACACGAACCT GGACTGGGAA CAGGCGCCCC GACTGATCGA GGACACCTAC ACCGTGCAGG ACGCGGTGGT CTTCGGCAGC CTGCTGATGT CCCTGCTCCG CCACGCCGAC CGGGTCGCCA TCGCCTGCCT CGCGCAGCTG GTGAACGTGA TCGGCCCGAT CCGCGCCGAA CCGGACCGCC CAGCCTGGGC ACAGACCATC TTCCACCCCT TCGCACTCAC CGCTCGGCAC GCAGTCGGCG ACGTGCTCCG CGTGGAGACA GCCACCGACC TCTACGACAC CGCCGAGCAC GGCGACGTAC CGCTGGTCGA CGTGGTGGCT ACGCACGACC GCGAAAGCGG GCAGCTCACG GTGTTCGCGA TGAACCGGCA CACCGAGGAC CGGGCCCGCC TGGAGATCGA CCTGCGGGCT TTCGGCGACC TGGTGGTCAC CGAGCACCTG CACCTCGGCG GCGAGCAGGA CCTCGACGCC GTGAACTCGA TCGAGACGCC GCAGGCGGTC GCGCCGATCA GCGTTGAGGG CGCGAGCATC GAGGGACGTC GGCTGACGGC TTCGCTCCCG CCGGTGTCCT GGAACATGAT CCGGTTCGTT CCCGCCAATC GGTAG
|
Protein sequence | MLTAHATLDP AFRVAPVDRR LFGSFVEHMG RCVYGGIHEP GHPESDADGN RLDVLELTRD LGISVIRYPG GNFVSGYRWE DGVGPVSERP RRQDLAWRSI ETNEFGLNEF MAWAKLANVE PMMAVNLGTR GIQEACDLLE YANHPGGTYL SDLRRKHGVE EPHAIKLWCL GNEMDGPWQT GHKTAEEYGR LAAETAKAMR QVDSGIELVA CGSSNALMPT FGSWESTVLE HTFDYVDYIS LHAYYEQHGQ DRASFLAAAA GMDRFIDGVV ATADHVSAKK KSRKKLNLSF DEWNLWQESR FAGHTNLDWE QAPRLIEDTY TVQDAVVFGS LLMSLLRHAD RVAIACLAQL VNVIGPIRAE PDRPAWAQTI FHPFALTARH AVGDVLRVET ATDLYDTAEH GDVPLVDVVA THDRESGQLT VFAMNRHTED RARLEIDLRA FGDLVVTEHL HLGGEQDLDA VNSIETPQAV APISVEGASI EGRRLTASLP PVSWNMIRFV PANR
|
| |