Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_7918 |
Symbol | |
ID | 8339295 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 9189632 |
End bp | 9191239 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644961002 |
Product | Alpha-galactosidase |
Protein accession | YP_003118582 |
Protein GI | 256397018 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3345] Alpha-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.542076 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTCATC AGGCCCCGCG CAGTCGACAG TTGCGCGCCG CCTTCGCAGC GCTCGCCCTC GTGGTCGGGA CGATGTTCGG AGTAGCGGTA GCGGCACCGG CACAAGCCCT CGGCAACGGT CTGGCACTCA CGCCGCCGAT GGGATGGAAC GACTGGAACT CCTTCGGGTG CAACGTCTCG GAGAGCCTGG TCGAGCAGAC GGCTGATCTC ATCGTCTCGT CCGGGATGAA GGACGCCGGC TATCAGTATG TGAACATCGA CGACTGCTGG ATGTCGTCGA ACCGGGACGC CGGCGGCAAC CTGGTCCCGG ATCCGGCGAA GTTCCCGGAC GGCATCTCGG GGACCGCCGC GTACGTCCAC AGCAAGGGTC TCAAGCTCGG GATCTACGAG AGCGCGGGCA CCGCGACCTG CGCCGGCTAT CCGGGCAGCC TGAACCACGA GCAGGCTGAC GCGAACTCCT TCGCGTCCTG GGGCGTGGAC TACCTCAAGT ACGACAACTG CAACAACCAG GGAATCCCGG CTCAGACCCG CTACACGGCG ATGCGCGACG CGCTGGCGAA AACCGGCCGG CCGATCGTGT ACAGCCTGTG CAACTGGGGT CAAGAGTCGG TCTGGACCTG GGGCGCCGGG GTCGGCAACC TGTGGCGCAC CACCGGTGAC ATCAGCGCCA ACTTCGGCAG CATGCTGTCC AACTTCCACA ACACCGTCGG CCTGGCGTCC TCCGCCGGAC CCGGCGGCTG GAACGACCCG GACATGCTCG AAGTGGGCAA CGGGATGTCG TTCACCGAGG ATCGCGCGGA GATGTCGCTG TGGGCCGAGA TGGCCGCGCC GCTGATCTCC GGGACGGACC TGCGCAAGGC GACGACGGCC ACGCTGTCGC TGTACACGAA CAAGGACGTC ATCGCCGTCG ACCAGGACTC GCTGGGCAAG GCCGGCCGCG AGATCGCGTC CTCCGGCGGC GCGGACGTGC TGGCCAAGCC GTTGGCGAAC GGCGACGTCG CAGTCGCGCT GTTCAACGAG AACTCCTCGG CGCAGACGAT CTCGACGTCC GCGTCGGCGA TCGGGATCGG TTCAGCTTCC TCGTACAAGC TGAACAACCT GTGGTCGCAT GTGCTCACCT CGACCTCCGG CTCGATCAGT GCCTCGGTGC CGGGCCACGG CGTAGTGCTG TACCGGGTGT CCGCGGGCAG CGGGAGCAGT GTCGGCAGTG CGCACCGGTT CCTCGGCGCG TCGTCCGGCC GCTGCCTGGA CGTGCCGAAC GGCAGCACGA CGACCGGCAC GCAGCTGGAC ATCTGGGACT GCGGCACCGG GACCAACCAG TCCTTCACCC CGACCTCGGA CAAGGAACTG CGCGTCTACA ACGGCAGCCT GTGCCTGGAC GCCAGCGGAC AGGGCACGAC CGCCGGCACC AAGGTCATCA CCTGGACCTG CAACGGCCAG ACCAACCAAC AGTGGAACCT GAACGCCGAC GGGACGGTCA CCGGCGTGCA GTCCGGGCTG TGCCTGGACG TGACCGGCGG GAACGTGGCC TCCGGCAACG TCAACGGCAC GCTGATCGAA CTGTGGGGCT GCAACGGAGG CGCGAACCAG CAGTGGAGCC TGGGGTAG
|
Protein sequence | MRHQAPRSRQ LRAAFAALAL VVGTMFGVAV AAPAQALGNG LALTPPMGWN DWNSFGCNVS ESLVEQTADL IVSSGMKDAG YQYVNIDDCW MSSNRDAGGN LVPDPAKFPD GISGTAAYVH SKGLKLGIYE SAGTATCAGY PGSLNHEQAD ANSFASWGVD YLKYDNCNNQ GIPAQTRYTA MRDALAKTGR PIVYSLCNWG QESVWTWGAG VGNLWRTTGD ISANFGSMLS NFHNTVGLAS SAGPGGWNDP DMLEVGNGMS FTEDRAEMSL WAEMAAPLIS GTDLRKATTA TLSLYTNKDV IAVDQDSLGK AGREIASSGG ADVLAKPLAN GDVAVALFNE NSSAQTISTS ASAIGIGSAS SYKLNNLWSH VLTSTSGSIS ASVPGHGVVL YRVSAGSGSS VGSAHRFLGA SSGRCLDVPN GSTTTGTQLD IWDCGTGTNQ SFTPTSDKEL RVYNGSLCLD ASGQGTTAGT KVITWTCNGQ TNQQWNLNAD GTVTGVQSGL CLDVTGGNVA SGNVNGTLIE LWGCNGGANQ QWSLG
|
| |