Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_6906 |
Symbol | |
ID | 8338272 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 7975935 |
End bp | 7977821 |
Gene Length | 1887 bp |
Protein Length | 628 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644959993 |
Product | glycoside hydrolase family 16 |
Protein accession | YP_003117584 |
Protein GI | 256396020 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2273] Beta-glucanase/Beta-glucan synthetase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00349487 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.000227833 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGTCATCTA GTTCCAACGA CGCTGCACCC TGGTCGCGCC GTCAATTCCT TACAACCGCA AGCGTTACTG CCGCCGTCGG GCTCATCGCT GCGCCCCGCG CCGCGGCTGG TCAGCCTTCG GGTGCTTCGG CTGGCGCCGC GGTCGCACCG ACCATCACCA CGGCTGCCGC CTCAGGACCG CCCAGCAGCG ACTATGTGTT GGTCTGGAAC GACGAGTTCA GCGGCACGAG CATCGACACC AGCAAGTGGG GCTTCAACTG GCCGGACACG ATGCCCAACA CCCGGTTCTT CGTCGATGTC AGCGCGTACA ACCCCAGCTA CGTCACCGCG CACGACGGCT ACGCGTGGAT CAAGACCGAC GCCACTTCCA CCGTCATTGA CGGCGTGACC CTGCCGTACC AGACCGGGTT CCTGTCCACG AAGGGCAAGT TCGAGTTCCA GTACGGCTAC GTCGAGACCC GCTTCAGACA TGCCGCCTCG GCGACTCCCT CGGCTCCCGA ACACACCGGC ATCGGCGGCT ACCACGCGTT CTGGATGCTC CCGCACTACA ACGACTCGTC TTCAACAACT TCGGGCACGC CGTCGCTGCA CGGCACGGAC GTCAACGGCG CCGACGGGCG TGGCAGCGAG GTGGACATCA CTGAATGGGG CGCCAGCGGC ACCTCTGTCA ACAACAGCGA ATTCTGGGGA GGCTACGGCA CCGGCGGCAC CGGCTTCACG GGACCGCACC TGACGACCAC CAACAGCGAT CCGCAGGGCT GGCACACCTA CGGGCTTGAG TGGACGCCGA CCAAGCTGGT CTTCTACCAG GACGGCGTCG TGACCAACAC CATCAGCAAC ACCAACGCGT TCATCTCGTT CTACCCCGAG TTCCTGATGA TCCAGACCGG GATCGTCAAA TGGCAGACGA CCCCCAACAA CCTGCCCGAC TACCTACAGA TCGACTACGT ACGCTGCTAC CAGCTGCCCA ACGTCACCTA CCAGGCCGAG AGCCAGGCGC TGTCCGGGGG AGCGGGCGTC AACACCAACC ACACGGGCTA CACCGGCACT GGGTTCGTCG ACGGCTTCGC CGCAAGCGTC GGCGCGACCC TGACCACAAC CGTCAACACC TCGGCGACGG CAACCAAGAC CCTGACGATC CGCTACTCCG CCGGACCGAT CAGCGGTGCT CCAACCAACC GAGTGCTCGG CCTCTATGTC AACGGCACGA AGGTCAGGGA CGTGACCTTC ACCGGCACCG CGGACTGGAA CACCTGGGCG AACAATACCC AGTCGATCAC GCTGCCCGGA GGAGCCGGCA ACACGATCGC GATCATCAGC GAACAAACCG GCAATGCCAG CGGCGTCAAC ATCGACTACA TAGCACTGCC CAACAGCGCC GACGTGGTCT CCGGAAACCG GGTCGTGAAC GCCGGCTTTG ACGACAACGC CGCCTCCTTC GGCCTGGGGG CGAACACGAT CGCCGACATC GCGGGTTGGT ACGTCTGGTC GCCCACCGGC TCGGACGACG ACGCCTCGTA CCTCGAGTAC GGTTCCGACT ACTCCAACGG ATGGAACGCG GTCCACTACA AGGCTTCGAA CTACGTGGTC TACACGGGCC AGGACATCAC GCTGCCGAAC GGCACCTACA ACCTCTCGGC CTACGTGCGA TCCAGCGGCG GCCAGAATGC CTGCTACCTC GAGGCCAAGG GGTTCGGCGG GTCGGCGGTG CACCAGAACG TCACAGCGGA GTCGAAGTAT AGGAAGGTGT CCATCAACGG GATCCACGTC ACCGACGGGA AGATCACGGT TGGTTTCTAC TCCGACGCTC TTTCGGCACA GTGGCTCGCG TTCGACAACG TGCAGCTACT TCCGGCGACT GGTCCGATCA CAGTGCCTGC GACCTGA
|
Protein sequence | MSSSSNDAAP WSRRQFLTTA SVTAAVGLIA APRAAAGQPS GASAGAAVAP TITTAAASGP PSSDYVLVWN DEFSGTSIDT SKWGFNWPDT MPNTRFFVDV SAYNPSYVTA HDGYAWIKTD ATSTVIDGVT LPYQTGFLST KGKFEFQYGY VETRFRHAAS ATPSAPEHTG IGGYHAFWML PHYNDSSSTT SGTPSLHGTD VNGADGRGSE VDITEWGASG TSVNNSEFWG GYGTGGTGFT GPHLTTTNSD PQGWHTYGLE WTPTKLVFYQ DGVVTNTISN TNAFISFYPE FLMIQTGIVK WQTTPNNLPD YLQIDYVRCY QLPNVTYQAE SQALSGGAGV NTNHTGYTGT GFVDGFAASV GATLTTTVNT SATATKTLTI RYSAGPISGA PTNRVLGLYV NGTKVRDVTF TGTADWNTWA NNTQSITLPG GAGNTIAIIS EQTGNASGVN IDYIALPNSA DVVSGNRVVN AGFDDNAASF GLGANTIADI AGWYVWSPTG SDDDASYLEY GSDYSNGWNA VHYKASNYVV YTGQDITLPN GTYNLSAYVR SSGGQNACYL EAKGFGGSAV HQNVTAESKY RKVSINGIHV TDGKITVGFY SDALSAQWLA FDNVQLLPAT GPITVPAT
|
| |