Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tfu_2990 |
Symbol | |
ID | 3581914 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermobifida fusca YX |
Kingdom | Bacteria |
Replicon accession | NC_007333 |
Strand | + |
Start bp | 3501674 |
End bp | 3502774 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637686721 |
Product | cellulose-binding family II protein |
Protein accession | YP_291046 |
Protein GI | 72163389 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 0.826139 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCGTA CCTGGGCACG TGCGGTCACC GGCGCGTGCA GCGCACTGGT GGTAGCAGCG GCCACGCTCG TCATCGGCGG CGCCCCGGCA GCCTCGGCAG CCGACGGCTG CTCTGCGGAA TACACGGTTG CCAGCGACTG GGGAAGCGGT TTCGTCGGCA ACGTCACCGT CACCAACACC AGCGGAACGC CCGCCACCGG GTGGACGGTC CAGTGGACCC TGCCGTCCGG TCACACCATC ACCAACACGT GGAACGCCGA ACTGTCCGTG AACGGCTCCA CAGTGACCGC CACCAACGCC TCGTGGAACG GGTCGCTCCC CGTGGGCGGC AGCGCGTCCT TCGGTTTCCA GGGCACGGGG TCGGGCGCCT CCTCCCTCCC CACCGACATC GCGTGCTTCC TCGACGCCCC GGGCGGCGGC ACCCCGGGCG GCCCTAACGA GCCCGGTGGC CCCGATGAGC CCGGCGGTCC TGGAACCCCG GGCGAACCGG TGCGGATCAT GCCGCTGGGC GACTCGATCA CCGGCTCCCC CGGCTGCTGG CGGGCCCTCC TGTGGCGTGA CCTGACCGAC GCCGGATACA CCAACATCGA CTTCGTCGGC TCCCGCGCCG GCGACGGCTG CGGCTTCCCC TACGACCACG AAAACGAAGG CCACGGGGGC ATGCTGGTCA CCAACCTGGC TCGCAGCGGA CAACTGTCCA CCTGGCTGTC CGCCACCAAC CCCGACATCG TGCTCATGCA CTTCGGCACC AACGACGTCT GGAGCTCTCT GCCCACCCAG ACGATCCTCG ACGCCTACAG CACGCTGGTC AGTCAGATGC GGGCCAACAA CCCGAACATG ACGATCCTCG TGGCCCAGAT CATCCCTATG GACTCGGCGC GAAGCTGCGC CACCTGCGCC CAGGGCGTGC AAGCCCTCAA CGCTGCGATC CCCGCGTGGG CGGCCAGCGA AAGCACCGCC CAGTCCCCCG TCATCGTCGT GGACCAGTGG ACCGGATTCG ACACTGACGC CGACACCTAC GACGGGGTGC ACCCCAACGC TTCCGGGGAC GCCAAGATCG CGCAGAACTG GCTGGAGGCG CTGATCCCGC TGCTTGACTA A
|
Protein sequence | MSRTWARAVT GACSALVVAA ATLVIGGAPA ASAADGCSAE YTVASDWGSG FVGNVTVTNT SGTPATGWTV QWTLPSGHTI TNTWNAELSV NGSTVTATNA SWNGSLPVGG SASFGFQGTG SGASSLPTDI ACFLDAPGGG TPGGPNEPGG PDEPGGPGTP GEPVRIMPLG DSITGSPGCW RALLWRDLTD AGYTNIDFVG SRAGDGCGFP YDHENEGHGG MLVTNLARSG QLSTWLSATN PDIVLMHFGT NDVWSSLPTQ TILDAYSTLV SQMRANNPNM TILVAQIIPM DSARSCATCA QGVQALNAAI PAWAASESTA QSPVIVVDQW TGFDTDADTY DGVHPNASGD AKIAQNWLEA LIPLLD
|
| |