Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_5007 |
Symbol | |
ID | 8336361 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 5732605 |
End bp | 5735157 |
Gene Length | 2553 bp |
Protein Length | 850 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 644958106 |
Product | Alpha-galactosidase |
Protein accession | YP_003115708 |
Protein GI | 256394144 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3345] Alpha-galactosidase |
TIGRFAM ID | [TIGR01451] conserved repeat domain |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.285302 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGATAC CCAAACTCAT CGCGCTCGCC GCGTCGGCGG CGCTGTGGGC GACAGCGGGA CCGGCGTGGG CCGGGTCTCA GGCCCGGCAC GCGCCGGCCA CGGTCCCGAT CAACAACCTG GCCCGCACGC CGTATCAGGG CTGGAACACC TACTACGGCC TCGGGTCCAC CTTCACCGAG CAGACCATCA AGGACGAGGC CGACGCGCTG GTGAGCAAGG GTCTGGCCGC CGCGGGCTAC AACTACGTCT GGATCGACGG CGGCTGGTGG AACGGCGCGC GGGACGCCTC CGGCGCCATC ACCGTAGACT CGACTCAGTG GCCTGACGGG ATGAAGGCGG TCGCCGACTA CATCCACTCG CGCGGGCTGA AAGCAGGCAT CTACACCGAC TCCGGCCTCA ACGGCTGCGG CGGCGCCAAC CAGGGCAGCT ACGGCCGCTA CCAGCAGGAC GTCAACCAGT TCGCCGGGTG GGGCTACGAC GCGGTGAAGG TCGACTTCTG CGGCAGCGAG CAGATGGGAC TGGACCCGGC CACCGTCTAC GGCCAGTTCC GCGACGCAGT CCTGAACAAC AGCAGCCACC GCCCGATGCT GTTCAACATC TGCAACCCCT TCATCCCCGA GACCGGCGCC GCGCCCGGCC GCAGCGCGTT CGACTCCTAC ACGTTCGGCC CGAGCACCGG GAACTCCTGG CGGACCGACA CCGACATCGG CTTCCCGAAC GACGTCCGCT ACTCCGACGT GCTGCGCAAC CTGGACGCCG ACGCCGCGCA CCCGGAAGCC GCCGGTCCCG GGCACTGGAA CGACCCGGAC TATCTGGGCC CTGATCTGGG CATGACCGAC GCCGAGTCCC GCTCGCAGTT CTCAATGTGG TCGATCGTCG CGGCTCCGCT GATGATCGGC TCGGATGTAC GCAAGCTCTC CGACAGCGCC GTCGCTATGC TCACCAACGC CGAAGTGCTC GCGGTAGACC AGGACCGGCT GGGAATTCAG GGCACTGCGT TGTCCGCCCC GACCGCCTCC GGCGCCCAGG TCTGGACCAA GCCGCTCGCC AACGGCGACG TCGCTGTCGC GCTGCTCAAC CGCGGCACGA CCCCGCAGCT GATCTCCACG ACCGCCGGCA AGATCGGCCT GTCCACGTCG GGCAGCTACG CCGTGCGCGA TCTGTGGCAG CACTCGACGA CCGAGTCGGC CGGCACGATC TCCGCGACCG TCGCCCCGCA CGACGTCGTG CTGTATCGGG TCTCCCGCAA CGGAAACCCG GCGACCATGA CGCCGGCGAC CACGCTGTCC CCGGCGACCC TGACCGCGAC CGCGCAGGCC GCGCTTCCCC TGGTCGCCCC CGGCGACTCC TTCCCGGTCT CGGCCACGTT CACCGACAAC GGCCGGCTCG CCGTCCGGAA CGTGAAGCTC ACGCTCGCCG TCCCGGCGGG CTGGACCGCC ACGCCGACCA CCCCGGCCGC GAAGGACCGC CTCGACAGCG GGCAATCGAT TGCGGCGACC TGGCAGGTGA CCGCCGCTCC AGGCGCTCTG CCCGGTACGG ACCAGCTCGC GGTGACTGCG GGCTACGACT GGCAGGGTGC TTACTCCGGT GCGACATCAA GGCTCCAGAC ACTCAGCGCC ACCGAGTCGA CGCAAGTCCA GGTCCCCGCT GCGCCGCCGT CGGGCACCGG TCCGCTGAGC CACCATCCCT GGCTCGACGC GGCGAGCGGC TACCTCGTGC CCCGGGTCGA CCTCGACGAT GCCGGCGGCG GGCCGCTGAC GATGCACGGC GTCGGGTACC CGACCGGTGT CGGCACCGCG TCGCCCTCGA CGATCGACTA CTACGTCGGC GGCCAGTGCA GCACGCTCAC CGCCACGGTC GGCATCGACG ACTCGGCGGA CTTCGACCCG ACCGGCGGGA CGGCGGTGTT CCAGGTCTAC GGCGACGGCG TGAAGCTGTA TGACAGTGGT CTGGTGACCC GGGCCGCGCC TCAGAGCGCG TCGGTGAATC TGGGTACTGC GAAGGTGATC AGCCTGGTCG TCGGGGACGG CGGCGACGGC GGTTACAACG ACCGCACGGA CTGGGGCGGG CTGCGGATCA CCTGCGGCGC GCCGGTCGGC ACGCAGCCCA GCGGACCCTG GCCGCACTTC GCGCCCTCGT CCTCGGTGTC CGCGACGGCC ACCAGCGCCA ACGCCGGCTA CCCGGCGGGC AACGCGGTGG ACGGCCAGGT GACCACTTTG TGGCACTCGC AGTTCAGTCC GGTCCACGAC CCGCTGCCGA TCTCGCTGAC GATGGACCTC GGCTCGGTGC AGACGGTCAC CGGACTGACC TACCAACCCC GCCTCGACGG CGCGATCACC GGTACCATCA CCGGTTACAC CGTCGAGGTC AGCAGCGACG GCGTCACCTT CACCCCGGCG GCAGCGGCGG GGACGTGGAC GCAGGACGCG CTGCTGAAGT CCGTTGAATT CGCTCCGGTG TCGGCTCGCT ATGTGCGACT GACTGCGACT GCGGCAGCCG ACGGCTACGC CTCGGCGGCT GACGTCAGCG TGGCGGCGCG ACCGACCGCC TGA
|
Protein sequence | MLIPKLIALA ASAALWATAG PAWAGSQARH APATVPINNL ARTPYQGWNT YYGLGSTFTE QTIKDEADAL VSKGLAAAGY NYVWIDGGWW NGARDASGAI TVDSTQWPDG MKAVADYIHS RGLKAGIYTD SGLNGCGGAN QGSYGRYQQD VNQFAGWGYD AVKVDFCGSE QMGLDPATVY GQFRDAVLNN SSHRPMLFNI CNPFIPETGA APGRSAFDSY TFGPSTGNSW RTDTDIGFPN DVRYSDVLRN LDADAAHPEA AGPGHWNDPD YLGPDLGMTD AESRSQFSMW SIVAAPLMIG SDVRKLSDSA VAMLTNAEVL AVDQDRLGIQ GTALSAPTAS GAQVWTKPLA NGDVAVALLN RGTTPQLIST TAGKIGLSTS GSYAVRDLWQ HSTTESAGTI SATVAPHDVV LYRVSRNGNP ATMTPATTLS PATLTATAQA ALPLVAPGDS FPVSATFTDN GRLAVRNVKL TLAVPAGWTA TPTTPAAKDR LDSGQSIAAT WQVTAAPGAL PGTDQLAVTA GYDWQGAYSG ATSRLQTLSA TESTQVQVPA APPSGTGPLS HHPWLDAASG YLVPRVDLDD AGGGPLTMHG VGYPTGVGTA SPSTIDYYVG GQCSTLTATV GIDDSADFDP TGGTAVFQVY GDGVKLYDSG LVTRAAPQSA SVNLGTAKVI SLVVGDGGDG GYNDRTDWGG LRITCGAPVG TQPSGPWPHF APSSSVSATA TSANAGYPAG NAVDGQVTTL WHSQFSPVHD PLPISLTMDL GSVQTVTGLT YQPRLDGAIT GTITGYTVEV SSDGVTFTPA AAAGTWTQDA LLKSVEFAPV SARYVRLTAT AAADGYASAA DVSVAARPTA
|
| |