Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tfu_2009 |
Symbol | |
ID | 3580882 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermobifida fusca YX |
Kingdom | Bacteria |
Replicon accession | NC_007333 |
Strand | - |
Start bp | 2347443 |
End bp | 2349452 |
Gene Length | 2010 bp |
Protein Length | 669 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637685702 |
Product | cellulose-binding family II protein |
Protein accession | YP_290065 |
Protein GI | 72162408 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.872108 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCGAGAGT ATCCCCCCAC GCGGCGCCGA CCGGTGCGCT TCGGCGCGGC CCTTGCCGCA TTCGTGCTCG GTGCCACCGG TGCTGCGGCT CTCCCCAGCC CAGCCCACGC CGCCGCCGGA TGCAGCGTCG ACTACAGCGT CAACCAGTGG AACGACGGTT TCACTGGAAC CGTCACCGTC ACCAACCTGG GCGACCCCGT CAACGGCTGG ACCCTCAGTT GGCGCTTCCC TTCCGGGGAG CGGATCACCC ACGGGTGGAA CGCTGAGTTC CAGACCAACG GCGCCGAGGT CACCGCCGCC AACACTCCCT GGAACGCCAG CGTCCCCACG GGCGGCACCG TCAGTTTTGG CTTCAACGCC ACCCACAACG GCACGGTCGG CATCCCCGAG TCCTTCACCT TCAACGGCAC CGTATGCACT GACCAGCCCA CGCCGGGCGG GCCAGAAGAA CCCGGCGGCC CCGAGGAGCC CGGCGGGCCT GAAGAACCAG GCGAACCCGA AGAACCCGCC CTCCCCGAAC CCACCGGAGC CCGCCAAGCG GAACGGCTCG ACCGCGGACT GATCAGCGTG CGCAGCGGCA ACGGGAACCT GGTGAGCTGG CGGCTGCTCG GCTCGGACCC CCGCGACATC GCGTTCAACG TCTACCGCGG ATCCACCCGC GTCAACTCCA CCCCCCTCAC CTCCGCCACC TCCTACCTTG ACGCCGGCGC CCCAGCCGAC GCCTCCTACA CAGTGCGGCC GGTCGTCGAC GGCGTGGAAC TGGGCCCCTC CGCAGCCTCC CTCAACTTCA CCAACGGCTA CCTGGACGTG CCGTTGCAGC GTCCCGCGGG CGGCACCGTC CACGGCTCCT CCTACACCTA TGAGGCCAAC GACGCCAGCG TCGGCGACCT CGACGGCGAC GGCCGCTACG AGATCGTCCT GAAGTGGGAG CCCACCAACG CCAAGGACAA CTCCCAGTCC GGCTACACCG GGCCGGTCCT CATCGACGCC TACGAACTCG ACGGCACCCT CCTGTGGCGG ATCAACCTGG GCATCAACAT CCGCGCCGGG GCCCACTACA CCCAGTTCCA GGTCTATGAC TACGACGGCG ACGGGCGCGC CGAAGTCGCC ATGAAAACCG CTGACGGGAC CCGCGACGGC ACCGGGGCGG TGATCGGCTC CGCCAACGCC GACTACCGCA ACTCCTCCGG ATACGTCCTG TCCGGACCGG AATACCTCAC CGTCTTCGAC GGGCGCACCG GCCGGGCCCT GGACACAGTG GACTACGTGC CGCCCCGCGG CAACGTGTCC TCGTGGGGCG ACTCCTACGG CAACCGGGTG GACCGCTTCC TCGCCGGCAC CGCCTACCTG GACGGGAAAC GGCCCAGCAT GATCTTCTCC CGCGGCTACT ACACCCGAAC CGTCATCACA GCGTGGGACT TCCGCGACGG ACGACTCACC CGCCGGTGGA CCTTCGACAC CAACAGCTCC ACCAACACCG GACGCGGCTA CGAAGGCCAA GGGTTCCACT CCCTGTCCAT CGCGGACGCC GACGGCGACG GCCGTGACGA GATCATGTTC GGCGCCATGG CCGTCGACGA TGACGGGCGC GGCATGTGGA CCACCGGATA CGGCCACGGC GACGCCCTGC ACGTCGGCGA CTTCGTCCCC GCCCGGCCCG GACTCGAGGT GTACGGGGTT TCCGAAAGCT CCTCGCAGCC CAACGCTTGG CTCGCCGACG CGCGCACAGG AAGCACCCTG TGGCGCACCG CCTCCGGCGA CGACAACGGG CGCGGCGTCG CCGGAGATAT CTGGGCAGGC AGCCCCGGCG CCGAATTCTG GTCCTCACGG GTGGACGGCC TACTCAACAC CTCCGGAACC GCCATCGGCC GCAAACCCAG CTCGATCAAC TTCCTCGTCT GGTGGGACGG AGACCCCAGC CGGGAACTGC TGGACCAGAC CCGGATCGAC AAGTACGGTC CGAACGGCGA CACGCGAGGA GCGCGTCCGC GCTCAACTCC ACTCCTTTAG
|
Protein sequence | MREYPPTRRR PVRFGAALAA FVLGATGAAA LPSPAHAAAG CSVDYSVNQW NDGFTGTVTV TNLGDPVNGW TLSWRFPSGE RITHGWNAEF QTNGAEVTAA NTPWNASVPT GGTVSFGFNA THNGTVGIPE SFTFNGTVCT DQPTPGGPEE PGGPEEPGGP EEPGEPEEPA LPEPTGARQA ERLDRGLISV RSGNGNLVSW RLLGSDPRDI AFNVYRGSTR VNSTPLTSAT SYLDAGAPAD ASYTVRPVVD GVELGPSAAS LNFTNGYLDV PLQRPAGGTV HGSSYTYEAN DASVGDLDGD GRYEIVLKWE PTNAKDNSQS GYTGPVLIDA YELDGTLLWR INLGINIRAG AHYTQFQVYD YDGDGRAEVA MKTADGTRDG TGAVIGSANA DYRNSSGYVL SGPEYLTVFD GRTGRALDTV DYVPPRGNVS SWGDSYGNRV DRFLAGTAYL DGKRPSMIFS RGYYTRTVIT AWDFRDGRLT RRWTFDTNSS TNTGRGYEGQ GFHSLSIADA DGDGRDEIMF GAMAVDDDGR GMWTTGYGHG DALHVGDFVP ARPGLEVYGV SESSSQPNAW LADARTGSTL WRTASGDDNG RGVAGDIWAG SPGAEFWSSR VDGLLNTSGT AIGRKPSSIN FLVWWDGDPS RELLDQTRID KYGPNGDTRG ARPRSTPLL
|
| |