Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amir_1868 |
Symbol | |
ID | 8326053 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Actinosynnema mirum DSM 43827 |
Kingdom | Bacteria |
Replicon accession | NC_013093 |
Strand | + |
Start bp | 2062832 |
End bp | 2065066 |
Gene Length | 2235 bp |
Protein Length | 744 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 644942417 |
Product | cellulose-binding family II |
Protein accession | YP_003099662 |
Protein GI | 256376002 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTCACCA ACTCGTTCCG CGCGCGGTTG GGCGCGGCCG TCGCCGCCGC CACCGCGCTG AGCGGCGGAC TGGTCGTCGT GGGCACGGCT TCGGCCGCCC CAGCCTGCAA GGTCGACTAC ACCGTGCAGA ACCAGTGGCA GGGCGGCTTC TCCGCCAACG TCACCGTCAC CAACCTCGGC GACCCCGTCG ACGGGTGGTC CCTCGCCTGG ACCGCCGCCT CCGGCGAGAA GGTCGACCAG GGGTGGAACG CGGCGTTCTC CTCCTCGGGC ACGTCGATCA CCGCGTCCAA CGTGGACTGG AACAGGGCCA TCCCCACCGG TGGCTCGGTC TCGTTCGGGT TCAACGGTTC CCGGACCGGC GAGCCGGTCG TCCCGTCGAG CTTCGCGCTC AACGGCACCA CGTGCACCGG CGGCGTCGCG CCCACCACGA CCACCACCCC CGGCGGCCCC ACCACCACGA CCGGGCCGAC CACCACGACC ACCGGCGGCG GCACCGCGCC GCTGCCCGAC CCGACCGGGC GCAAGCAGGT CGAGCGGCTC GACCGCGGCC TGATCAGCGT CCGCTCCGGC AGCGGCAACC TGGTCAGCTG GCGGCTCCTG GGCGGCGACC CGCAGAACGT GGCGTTCACC GTCTACCGGG GCGGCGCCAA GCTCACCGCC TCGCCGATCA CCGGGTCCAC GAACTACCTG GACAGCGGCG CGCCGGCCGA CGCCTCCTAC ACGGTCCGCG CGGTGGTCAA CGGCGTCGAG CAGCCCGCGT CCCCGGCCTC GCTGGCCTTC GGCAACGGCT ACCGGGACGT GCCGATCCAG CCCCCGTCCG GCTCCTACGC CGCGAACGAC GCGAGCGTCG GCGACCTGGA CGGCGACGGG GTCTACGAGT TCGTCCTCAA GTGGGAGCCC AACAACGCCA AGGACAACTC CCAGTCCGGC GTCACCGACG TGGTCTACGT CGACGCCTAC AAGCTCAACG GCGCCAGGCT GTGGCGCGTC AACCTGGGCC GCAACATCCG GGCGGGCGCG CACTACACCC AGTTCCAGGT CTACGACTAC GACGGCGACG GCAAGGCCGA GGTGGCCATG AAGACCGCCG ACGCCACCGT GGACGGGCGC GGCACGGTGA TCGGCAACGC CTCCGCCGAC CACCGCAACT CCTCCGGCTA CGTGCTGGCG GGCCCCGAGT ACCTGACCAT GTTCAACGGC CAGACCGGCG CCGCGATGTC CACGGTCGAC TACGACCCGC CGCGCGGCAC CGTCGCCGAC TGGGGCGATA ACTACGGCAA CCGGGTCGAC CGGTTCCTCG CCGGGACCGC CTACCTGGAC GGCGCACGCC CCTCGCTGAT CATGGCTCGT GGGTACTACA CCCGCGCGGT GATCGCGGCC TGGGACTTCC GCAACGGGGC GCTCACCAAG CGCTGGGTCT TCGACTCCAA GGCCTCCGGC AACAGCGGCT ACGCGGGGCA GGGCAACCAC TCGCTCACCA TCGGCGACGT CGACGCGGAC GGCCGGGACG AGATCGTCTA CGGCGCGGCG GCCGTCGACG ACAGCGGCAA GGGCCTGTGG AACACCGGGC TCGGGCACGG GGACGCCGCG CACCTGGGCG ACCTCGACCC GTCGCGGGCG GGCCTGGAGT ACTTCAAGGT CGACGAGGAC ACCTCCAAGC CCGGCTCGTT CTTCGCCGAC GCCCGCACCG GTTCCCGCGT GTGGACCACG GCGTCCGGCG GCGACAACGG GCGCGGCGTC GCGGGCGACA TCTGGGCGGG CAGCCCCGGA GCCGAGGCGT GGTCGGCGCG GGACGCGAAC CTGCGCAGCG CCAAGGGCGC GGAGATCGGG CGCAAGCCCG GCTCCATCAA CTTCCTGGCC TGGTGGGACG GCGACCCGGT GCGGGAGCTG GTGGACCAGA CCCGGATCGA CAAGTACGGC ACCGGCGGCG ACACCCGCCT GCTGACCGCC TCCGACGTGC ACAGCAACAA CGGCACCAAG GCCACCCCGT CGCTGTCCGG CGACCTGTTC GGCGACTGGC GCGAGGAGGT GGTGTGGCCC ACCACGAACA ACACCGCGCT GCGGATCCAC TCCACCCCGC ACCAGACCGA CCGGCGCATC CCGACCCTGA TGCACGACAC CATGTACCGG GTCGCGATCG CCTGGCAGAA CACGGCCTAC AACCAGCCGC CGCACCCGAG CTTCTTCATC GGCGACGGCA TGGCCACCCC GCCCTGGCCG GACGTCCACT ACTGA
|
Protein sequence | MLTNSFRARL GAAVAAATAL SGGLVVVGTA SAAPACKVDY TVQNQWQGGF SANVTVTNLG DPVDGWSLAW TAASGEKVDQ GWNAAFSSSG TSITASNVDW NRAIPTGGSV SFGFNGSRTG EPVVPSSFAL NGTTCTGGVA PTTTTTPGGP TTTTGPTTTT TGGGTAPLPD PTGRKQVERL DRGLISVRSG SGNLVSWRLL GGDPQNVAFT VYRGGAKLTA SPITGSTNYL DSGAPADASY TVRAVVNGVE QPASPASLAF GNGYRDVPIQ PPSGSYAAND ASVGDLDGDG VYEFVLKWEP NNAKDNSQSG VTDVVYVDAY KLNGARLWRV NLGRNIRAGA HYTQFQVYDY DGDGKAEVAM KTADATVDGR GTVIGNASAD HRNSSGYVLA GPEYLTMFNG QTGAAMSTVD YDPPRGTVAD WGDNYGNRVD RFLAGTAYLD GARPSLIMAR GYYTRAVIAA WDFRNGALTK RWVFDSKASG NSGYAGQGNH SLTIGDVDAD GRDEIVYGAA AVDDSGKGLW NTGLGHGDAA HLGDLDPSRA GLEYFKVDED TSKPGSFFAD ARTGSRVWTT ASGGDNGRGV AGDIWAGSPG AEAWSARDAN LRSAKGAEIG RKPGSINFLA WWDGDPVREL VDQTRIDKYG TGGDTRLLTA SDVHSNNGTK ATPSLSGDLF GDWREEVVWP TTNNTALRIH STPHQTDRRI PTLMHDTMYR VAIAWQNTAY NQPPHPSFFI GDGMATPPWP DVHY
|
| |