Gene Tfu_2009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTfu_2009 
Symbol 
ID3580882 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermobifida fusca YX 
KingdomBacteria 
Replicon accessionNC_007333 
Strand
Start bp2347443 
End bp2349452 
Gene Length2010 bp 
Protein Length669 aa 
Translation table11 
GC content69% 
IMG OID637685702 
Productcellulose-binding family II protein 
Protein accessionYP_290065 
Protein GI72162408 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.872108 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGAGAGT ATCCCCCCAC GCGGCGCCGA CCGGTGCGCT TCGGCGCGGC CCTTGCCGCA 
TTCGTGCTCG GTGCCACCGG TGCTGCGGCT CTCCCCAGCC CAGCCCACGC CGCCGCCGGA
TGCAGCGTCG ACTACAGCGT CAACCAGTGG AACGACGGTT TCACTGGAAC CGTCACCGTC
ACCAACCTGG GCGACCCCGT CAACGGCTGG ACCCTCAGTT GGCGCTTCCC TTCCGGGGAG
CGGATCACCC ACGGGTGGAA CGCTGAGTTC CAGACCAACG GCGCCGAGGT CACCGCCGCC
AACACTCCCT GGAACGCCAG CGTCCCCACG GGCGGCACCG TCAGTTTTGG CTTCAACGCC
ACCCACAACG GCACGGTCGG CATCCCCGAG TCCTTCACCT TCAACGGCAC CGTATGCACT
GACCAGCCCA CGCCGGGCGG GCCAGAAGAA CCCGGCGGCC CCGAGGAGCC CGGCGGGCCT
GAAGAACCAG GCGAACCCGA AGAACCCGCC CTCCCCGAAC CCACCGGAGC CCGCCAAGCG
GAACGGCTCG ACCGCGGACT GATCAGCGTG CGCAGCGGCA ACGGGAACCT GGTGAGCTGG
CGGCTGCTCG GCTCGGACCC CCGCGACATC GCGTTCAACG TCTACCGCGG ATCCACCCGC
GTCAACTCCA CCCCCCTCAC CTCCGCCACC TCCTACCTTG ACGCCGGCGC CCCAGCCGAC
GCCTCCTACA CAGTGCGGCC GGTCGTCGAC GGCGTGGAAC TGGGCCCCTC CGCAGCCTCC
CTCAACTTCA CCAACGGCTA CCTGGACGTG CCGTTGCAGC GTCCCGCGGG CGGCACCGTC
CACGGCTCCT CCTACACCTA TGAGGCCAAC GACGCCAGCG TCGGCGACCT CGACGGCGAC
GGCCGCTACG AGATCGTCCT GAAGTGGGAG CCCACCAACG CCAAGGACAA CTCCCAGTCC
GGCTACACCG GGCCGGTCCT CATCGACGCC TACGAACTCG ACGGCACCCT CCTGTGGCGG
ATCAACCTGG GCATCAACAT CCGCGCCGGG GCCCACTACA CCCAGTTCCA GGTCTATGAC
TACGACGGCG ACGGGCGCGC CGAAGTCGCC ATGAAAACCG CTGACGGGAC CCGCGACGGC
ACCGGGGCGG TGATCGGCTC CGCCAACGCC GACTACCGCA ACTCCTCCGG ATACGTCCTG
TCCGGACCGG AATACCTCAC CGTCTTCGAC GGGCGCACCG GCCGGGCCCT GGACACAGTG
GACTACGTGC CGCCCCGCGG CAACGTGTCC TCGTGGGGCG ACTCCTACGG CAACCGGGTG
GACCGCTTCC TCGCCGGCAC CGCCTACCTG GACGGGAAAC GGCCCAGCAT GATCTTCTCC
CGCGGCTACT ACACCCGAAC CGTCATCACA GCGTGGGACT TCCGCGACGG ACGACTCACC
CGCCGGTGGA CCTTCGACAC CAACAGCTCC ACCAACACCG GACGCGGCTA CGAAGGCCAA
GGGTTCCACT CCCTGTCCAT CGCGGACGCC GACGGCGACG GCCGTGACGA GATCATGTTC
GGCGCCATGG CCGTCGACGA TGACGGGCGC GGCATGTGGA CCACCGGATA CGGCCACGGC
GACGCCCTGC ACGTCGGCGA CTTCGTCCCC GCCCGGCCCG GACTCGAGGT GTACGGGGTT
TCCGAAAGCT CCTCGCAGCC CAACGCTTGG CTCGCCGACG CGCGCACAGG AAGCACCCTG
TGGCGCACCG CCTCCGGCGA CGACAACGGG CGCGGCGTCG CCGGAGATAT CTGGGCAGGC
AGCCCCGGCG CCGAATTCTG GTCCTCACGG GTGGACGGCC TACTCAACAC CTCCGGAACC
GCCATCGGCC GCAAACCCAG CTCGATCAAC TTCCTCGTCT GGTGGGACGG AGACCCCAGC
CGGGAACTGC TGGACCAGAC CCGGATCGAC AAGTACGGTC CGAACGGCGA CACGCGAGGA
GCGCGTCCGC GCTCAACTCC ACTCCTTTAG
 
Protein sequence
MREYPPTRRR PVRFGAALAA FVLGATGAAA LPSPAHAAAG CSVDYSVNQW NDGFTGTVTV 
TNLGDPVNGW TLSWRFPSGE RITHGWNAEF QTNGAEVTAA NTPWNASVPT GGTVSFGFNA
THNGTVGIPE SFTFNGTVCT DQPTPGGPEE PGGPEEPGGP EEPGEPEEPA LPEPTGARQA
ERLDRGLISV RSGNGNLVSW RLLGSDPRDI AFNVYRGSTR VNSTPLTSAT SYLDAGAPAD
ASYTVRPVVD GVELGPSAAS LNFTNGYLDV PLQRPAGGTV HGSSYTYEAN DASVGDLDGD
GRYEIVLKWE PTNAKDNSQS GYTGPVLIDA YELDGTLLWR INLGINIRAG AHYTQFQVYD
YDGDGRAEVA MKTADGTRDG TGAVIGSANA DYRNSSGYVL SGPEYLTVFD GRTGRALDTV
DYVPPRGNVS SWGDSYGNRV DRFLAGTAYL DGKRPSMIFS RGYYTRTVIT AWDFRDGRLT
RRWTFDTNSS TNTGRGYEGQ GFHSLSIADA DGDGRDEIMF GAMAVDDDGR GMWTTGYGHG
DALHVGDFVP ARPGLEVYGV SESSSQPNAW LADARTGSTL WRTASGDDNG RGVAGDIWAG
SPGAEFWSSR VDGLLNTSGT AIGRKPSSIN FLVWWDGDPS RELLDQTRID KYGPNGDTRG
ARPRSTPLL