Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amir_4152 |
Symbol | |
ID | 8328345 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Actinosynnema mirum DSM 43827 |
Kingdom | Bacteria |
Replicon accession | NC_013093 |
Strand | - |
Start bp | 4885000 |
End bp | 4886349 |
Gene Length | 1350 bp |
Protein Length | 449 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644944616 |
Product | cellulose-binding family II |
Protein accession | YP_003101853 |
Protein GI | 256378193 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCAGGG TATTCGCCTT GCTCGGCGCC ACGCTGCTGG CCATCGCCGG CCTCGTGGTC GTCGCCCAAC CGCCGCAGGC GCAAGCCGCC GTCGGCCTGC ACGTCAGCGG CACGAAGATC GTGGAGGCCA ACGGGCAGCC GTTCGTCATG CGCGGGGTCA ACCACCCGCA CGTCTGGTAC ACCGGCCGCA CCAGCTCGTT CGCCGACGTC AAGGCGCTCG GCTCGAACAC GGTGCGCGTG GTGCTGGGCA GCGGCAAGCG GTGGGGGCCG TCCAGCGACA CCGCCGCGGT CATCGCGCTG TGCAAGCAGA ACAAGCTGAT CTGCGTGCTG GAGGTGCACG ACACCACCGG GTACGGAGAG GAGGCCGCGG CCGCCTCGCT CGACGAGGCG GTGGACTACT GGATCTCCCA GAAGTCCGCG CTGGTCGGCC AGGAGGACTA CGTCGTCATC AACATCGGCA ACGAGCCCAT CGGCAACACC AACGCCGCCC AGTGGACCGA CGCGACCGTC AACGCGGTCA AGAAGATGCG GACCAACGGG TTCCAGCACC TGCTGATGGT CGACGGCCCG AACTGGGGCC AGGACTGGCA GTACGTCATG CGCGACAACG CGCAGCGCGT CCTGGACGCC GACACCCAGC GCAACACCGT GCTGTCGATC CACATGTACG CGGTGTTCAG CACGCCCGCC AGCATCATCG ACTACCTGGA CCGCTTCCAG GCCAACGGGT GGCCGCTGGT GATCGGCGAG TTCGGCTGGA AGTTCGCCTC CGGCGAGGTC GACCACGAGA CCATCCTCGC CCAGGCGCAG GCCAGGGGGA TGGGCTACCT GGGCTGGTCG TGGAGCGGCA ACACCGATCC GATCCTGGAC ATGGCCACCG ACTTCGACCC GGCCAGGCTG ACCACGTGGG GGCAGCGCAT CTTCAACGGC GCCAACGGGA TCAAGGCCAC CTCGCGCGAG GCGTCGATCT ACGGCGGCGG CAACCAGCCC ACCAGCACGA CGACGACGAC CACCACCACG ACGACCACCA CGACCACGGT CCCGCCCACC GGCTCGTGCA CCGCGAGCTA CACCACCACC GGCCAGTGGC AGGGCGGGTT CCAGGCGGAG GTGCGGGTCA CGGCGGGGTC CAAGGCGATC AAGTCGTGGA CGGTGACCTG GACGTTCGCG GACGGCCAGA CCGTGAGCAA CGCGTGGAAC GCCGACGTCA CCTCGTCCGG CGCGACCGTG ACCGCGCGCA ACGCGTCGTA CAACGGGTCG CTGGGGGCGG GCGCGAGCAC CGCGTTCGGC TTCCTCGGCA CGACGCGCGG CGCGAACACC GCGCCGACGC TGAGCTGCGC GGCGTCCTAG
|
Protein sequence | MTRVFALLGA TLLAIAGLVV VAQPPQAQAA VGLHVSGTKI VEANGQPFVM RGVNHPHVWY TGRTSSFADV KALGSNTVRV VLGSGKRWGP SSDTAAVIAL CKQNKLICVL EVHDTTGYGE EAAAASLDEA VDYWISQKSA LVGQEDYVVI NIGNEPIGNT NAAQWTDATV NAVKKMRTNG FQHLLMVDGP NWGQDWQYVM RDNAQRVLDA DTQRNTVLSI HMYAVFSTPA SIIDYLDRFQ ANGWPLVIGE FGWKFASGEV DHETILAQAQ ARGMGYLGWS WSGNTDPILD MATDFDPARL TTWGQRIFNG ANGIKATSRE ASIYGGGNQP TSTTTTTTTT TTTTTTVPPT GSCTASYTTT GQWQGGFQAE VRVTAGSKAI KSWTVTWTFA DGQTVSNAWN ADVTSSGATV TARNASYNGS LGAGASTAFG FLGTTRGANT APTLSCAAS
|
| |