Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amir_3536 |
Symbol | |
ID | 8327726 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Actinosynnema mirum DSM 43827 |
Kingdom | Bacteria |
Replicon accession | NC_013093 |
Strand | + |
Start bp | 4109560 |
End bp | 4110729 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 644944034 |
Product | cellulose-binding family II |
Protein accession | YP_003101274 |
Protein GI | 256377614 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0870492 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCACCC CGTTCCGGCT CGCGGCGCTC GCCGCGGCCC TGGTCGCGGT CACCGGGTTG GTCGTGCTCG GCGGCGGTGG AGGCGGCGGC ACCGCCGTCG CCGCGCCCGC CTGCGCCGTC GACTACACGC CCAACCAGTG GGCCACCGGC TTCACCACCG AGGTCGCGGT CACCAACCGC GCCGCGCCCG TCACCACCTG GACGCTGACC TGGACCTGGG CGGGCAACCA GACCGTCACC TCGGCCTGGA ACGCCCAGGT CACCCAGACC GGCGCCAAGG TCACCGCCCG CGACGTCGGC TGGAACGGCC AGCTCGGCAC CGGCGCGACC GCCCGCTTCG GGTTCCAGGG CGCCTACTCC GGGACCAGCG CCGCCCCCAC CGACTTCGCC CTCAACGGCA CCCCCTGCAA CGGCGACACC CCGCCCACCA GCACCACCAC CCCGCCGACC ACCACGCCGC CGACCACGAC CACCACCCCG CAGCAGCCCA CCGGCTGCGG CACCGCGACC CTGTGCGACG ACTTCGAGTC CCAGACCGGG ACCACCCCCT CCGGCAAGTG GAGCGTCGGC GCGGCCAACT GCACCGGCAC CGGCACGGTC ACCGTCGACA GCTCCGTCGC GCGCAGCGGC TCCAAGTCGG TCCGGGTCAA CGGCGGCGTC GGCTACTGCA ACCACATCTT CTTCGGCGCC AGCGTGAGCG GACCGGTCGT GCACGGCCGG TTCCACGTCC GGCACACCAC CGCGCTGCCC GCCGCGCACG TGACGTTCAT GGCGCTCAAG GACTCCGCCG ACGGCGGCAA GGACCTGCGC ATGGGCGGCC AGAACGGCGC GCTGCAGTGG AACCGCGAGT CCGACGACGC CACCCTGCCC GCGCAGAGCC CCCAGGGCGT CGCGCAGAGC ATCCCGCTGC CCACCGGCCG CTGGACCTGC GTGCAGTTCA CCCTCGACGG CACGAGCGGC AGGCTCAGCA CCAGCGTCGA CGGGGTCGCC GTCCCCGGCC TGCAGGTCGA CGGCGCCCCC ACGCCCGACA TCGACCAGCA GTGGCTCGCC CGCGCCAACT GGCGGCCCAA CACGGTGGAC GTGCGGCTGG GCTGGGAGAG CTACGGCGAC GGCGGCGACA CCCTTTGGTA TGACGATGTC GCCTTCGGGA GTGCACCTCT GGCCTGCTAG
|
Protein sequence | MRTPFRLAAL AAALVAVTGL VVLGGGGGGG TAVAAPACAV DYTPNQWATG FTTEVAVTNR AAPVTTWTLT WTWAGNQTVT SAWNAQVTQT GAKVTARDVG WNGQLGTGAT ARFGFQGAYS GTSAAPTDFA LNGTPCNGDT PPTSTTTPPT TTPPTTTTTP QQPTGCGTAT LCDDFESQTG TTPSGKWSVG AANCTGTGTV TVDSSVARSG SKSVRVNGGV GYCNHIFFGA SVSGPVVHGR FHVRHTTALP AAHVTFMALK DSADGGKDLR MGGQNGALQW NRESDDATLP AQSPQGVAQS IPLPTGRWTC VQFTLDGTSG RLSTSVDGVA VPGLQVDGAP TPDIDQQWLA RANWRPNTVD VRLGWESYGD GGDTLWYDDV AFGSAPLAC
|
| |