Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amir_1831 |
Symbol | |
ID | 8326016 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Actinosynnema mirum DSM 43827 |
Kingdom | Bacteria |
Replicon accession | NC_013093 |
Strand | + |
Start bp | 2014885 |
End bp | 2016009 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 644942380 |
Product | cellulose-binding family II |
Protein accession | YP_003099625 |
Protein GI | 256375965 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.783622 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGGAC CCGATGCGGT CAAGCCCGGA GTGGCGCGCA GGGCCGCCCT GGCCGCCGTG CTCTCCGCGC TGACCCTGCT GTGCGGCTTC CTGATCACGG TTCCCACCTC GCAGGCCGCC GCCGCCGCGC CCGTGCGCAT CATGCCGCTC GGCGACTCCA TCACCGGCTC GCCCGGCTGC TGGCGCGCGC TGCTCGACGT CGACCTCAAG GCCGCCGGCT ACACGAACAT CGACTTCGTG GGCACGCTCC CGTCGCAGGG GTGCGGCATC CCCCACGACG GCGACAACGA GGGCCACGGC GGGCTCCTGG TCACCAACGT GGCCGGACAG AACCAGCTCG TGCCGTGGTT GCAGGCCACC AGCCCCGACG TCGTGGTGAT GCACTTCGGC ACCAACGACG TGTGGAGCAA CATCGCGCCC GCGACGATCC TGGCGGCGTA CGGCAAGCTC GTCGACCAGA TGCGGGCGCA GAACCCGGCG ATGCGCATCC TGATCGCCAA GCTCATCCCG ATGGGCACCC CGCAGTGCGC CGACTGCGGG CAGCGCGTGG TGGCGTTCAA CGCGGCGATC CAGCCGTGGG CGACGTCGAA GACCACGGCC GCCTCGCCGA TCGTCGTGGT GGACCAGTGG ACCGGGTTCG ACACCGCCAC CGACACCTAC GACGGCGTGC ACCCGAACGC CTCGGGCGAC CGGAAGATCG CGAACCGGTG GTTCCCGGTG CTGCGGGACG CGCTCGGCGG CGTCCTGCCG ACGACGACGA CGACGGGTGG CACGACGACC ACGACCGGTG GCACCACCAC GACCACCATC CCGCCGCCCA CCGGCAGCTG CGCCGCGTCG GTGACCGTGG TGAACCGGTG GCAGGGCGGG TTCCAGGCCG ACGTGGTCTT CCGCAACACC AGCACCGTCG CGTCCACGGC GTGGAGCGTG CGGTTCTCGC TGGGCGGCGG GATGAGCGTC GCGCAGGCGT GGAACGGGAC GGCGACCACC AGCGGTTCGG TGGCGACCGT CGCGAACGCG GCGTGGAACG GCTCGGTGGC GCCCGCGGGT TCGGCGACGG CCGGGATCAT CGTCAACGGC GACCCGACGT CCTGGACGCC GTCGGCGAGC TGCACGAAGA GCTGA
|
Protein sequence | MTGPDAVKPG VARRAALAAV LSALTLLCGF LITVPTSQAA AAAPVRIMPL GDSITGSPGC WRALLDVDLK AAGYTNIDFV GTLPSQGCGI PHDGDNEGHG GLLVTNVAGQ NQLVPWLQAT SPDVVVMHFG TNDVWSNIAP ATILAAYGKL VDQMRAQNPA MRILIAKLIP MGTPQCADCG QRVVAFNAAI QPWATSKTTA ASPIVVVDQW TGFDTATDTY DGVHPNASGD RKIANRWFPV LRDALGGVLP TTTTTGGTTT TTGGTTTTTI PPPTGSCAAS VTVVNRWQGG FQADVVFRNT STVASTAWSV RFSLGGGMSV AQAWNGTATT SGSVATVANA AWNGSVAPAG SATAGIIVNG DPTSWTPSAS CTKS
|
| |