Gene Tfu_1959 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTfu_1959 
Symbol 
ID3580184 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermobifida fusca YX 
KingdomBacteria 
Replicon accessionNC_007333 
Strand
Start bp2289980 
End bp2292934 
Gene Length2955 bp 
Protein Length984 aa 
Translation table11 
GC content67% 
IMG OID637685651 
Productcellulose 1,4-beta-cellobiosidase 
Protein accessionYP_290015 
Protein GI72162358 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.54299 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGATCGT TACTGTCTCC CCGGCGCTGG CGCACGCTGG CCTCGGGGGC GCTCGCAGCG 
GCCCTGGCCG CCGCTGTACT CTCCCCCGGC GTCGCGCACG CCGCCGTCGC CTGCTCGGTG
GACTACGACG ACTCCAACGA CTGGGGTAGC GGGTTCGTCG CCGAAGTCAA GGTGACCAAC
GAAGGCAGCG ACCCCATCCA GAACTGGCAA GTAGGCTGGA CCTTCCCCGG TAACCAGCAG
ATCACCAACG GCTGGAACGG CGTGTTCAGC CAGAGCGGCG CCAACGTCAC CGTCCGCTAC
CCGGACTGGA ACCCCAATAT CGCCCCCGGA GCCACCATCT CCTTCGGCTT CCAGGGCACC
TACAGCGGCT CCAACGACGC CCCGACCAGC TTCACCGTCA ACGGCGTCAC CTGCAGCGGA
TCCCAGCCCG CCAACCTGCC GCCCGATGTC ACCCTGACAT CCCCGGCCAA CAACTCGACC
TTCCTGGTCA ACGACCCGAT CGAGCTGACC GCGGTCGCCT CCGACCCCGA CGGCTCGATC
GACCGGGTGG AATTCGCCGC CGACAACACC GTCATCGGCA TCGACACCAC CTCCCCCTAC
AGCTTCACCT GGACGGACGC TGCCGCCGGC TCCTACTCGG TGACCGCGAT CGCCTACGAC
GACCAGGGAG CCAGGACCGT CTCCGCTCCC ATCGCCATCC GAGTGCTGGA CCGGGCCGCC
GTCATCGCCT CACCGCCCAC CGTCCGCGTG CCGCAGGGCG GCACCGCCGA CTTCGAGGTG
CGGCTGTCCA ACCAGCCCTC CGGCAACGTC ACGGTCACCG TGGCGCGCAC GTCGGGCAGC
TCCGACCTGA CCGTCTCCAG CGGCTCCCAA CTCCAGTTCA CCTCCAGCAA CTGGAACCAG
CCGCAGAAGG TGACCATCGC CTCCGCTGAC AACGGCGGAA ACCTGGCCGA GGCGGTCTTC
ACCGTCAGCG CCCCCGGCCA CGACTCGGCC GAGGTGACGG TCCGGGAGAT CGACCCGAAC
ACCAGCTCCT ACGACCAGGC CTTCCTGGAG CAGTACGAGA AGATCAAGGA CCCCGCCAGC
GGCTACTTCC GCGAATTCAA CGGGCTCCTG GTCCCCTACC ACTCGGTGGA GACCATGATC
GTCGAGGCTC CGGACCACGG CCACCAGACC ACGTCCGAGG CGTTCAGCTA CTACCTGTGG
CTGGAGGCGT ACTACGGCCG GGTCACCGGT GACTGGAAGC CGCTCCACGA CGCCTGGGAG
TCGATGGAGA CCTTCATCAT CCCCGGCACC AAGGACCAGC CGACCAACTC CGCCTACAAC
CCGAACTCCC CGGCGACCTA CATCCCCGAG CAGCCCAACG CTGACGGCTA CCCGTCGCCT
CTCATGAACA ACGTCCCGGT GGGTCAAGAC CCGCTCGCCC AGGAGCTGAG CTCCACCTAC
GGGACCAACG AGATCTACGG CATGCACTGG CTGCTCGACG TGGACAACGT CTACGGCTTC
GGGTTCTGCG GCGACGGCAC CGACGACGCC CCCGCCTACA TCAACACCTA CCAGCGTGGT
GCGCGCGAGT CGGTGTGGGA GACCATTCCG CACCCGTCCT GCGACGACTT CACGCACGGC
GGCCCCAACG GCTACCTGGA CCTGTTCACC GACGACCAGA ACTACGCCAA GCAGTGGCGC
TACACCAACG CCCCCGACGC TGACGCGCGG GCCGTCCAGG TGATGTTCTG GGCGCACGAA
TGGGCCAAGG AGCAGGGCAA GGAGAACGAG ATCGCGGGCC TGATGGACAA GGCGTCCAAG
ATGGGCGACT ACCTCCGGTA CGCGATGTTC GACAAGTACT TCAAGAAGAT CGGCAACTGC
GTCGGCGCCA CCTCCTGCCC GGGTGGCCAA GGCAAGGACA GCGCGCACTA CCTGCTGTCC
TGGTACTACT CCTGGGGCGG CTCGCTCGAC ACCTCCTCTG CGTGGGCGTG GCGTATCGGC
TCCAGCTCCT CGCACCAGGG CTACCAGAAC GTGCTCGCTG CCTACGCGCT CTCGCAGGTG
CCCGAACTGC AGCCTGACTC CCCGACCGGT GTCCAGGACT GGGCCACCAG CTTCGACCGC
CAGTTGGAGT TCCTCCAGTG GCTGCAGTCC GCTGAAGGTG GTATCGCCGG TGGCGCCACC
AACAGCTGGA AGGGAAGCTA CGACACCCCG CCGACCGGCC TGTCGCAGTT CTACGGCATG
TACTACGACT GGCAGCCGGT CTGGAACGAC CCGCCGTCCA ACAACTGGTT CGGCTTCCAG
GTCTGGAACA TGGAGCGCGT CGCCCAGCTC TACTACGTGA CCGGCGACGC CCGGGCCGAG
GCCATCCTCG ACAAGTGGGT GCCGTGGGCC ATCCAGCACA CCGACGTGGA CGCCGACAAC
GGCGGCCAGA ACTTCCAGGT CCCCTCCGAC CTGGAGTGGT CGGGCCAGCC TGACACCTGG
ACCGGCACCT ACACCGGCAA CCCGAACCTG CACGTCCAGG TCGTCTCCTA CAGCCAGGAC
GTCGGTGTGA CCGCCGCTCT GGCCAAGACC CTGATGTACT ACGCGAAGCG TTCGGGCGAC
ACCACCGCCC TCGCCACCGC GGAGGGTCTG CTGGACGCCC TGCTGGCCCA CCGGGACAGC
ATCGGTATCG CCACCCCCGA GCAGCCGAGC TGGGACCGTC TGGACGACCC GTGGGACGGC
TCCGAGGGCC TGTACGTGCC GCCGGGCTGG TCGGGCACCA TGCCCAACGG TGACCGCATC
GAGCCGGGCG CGACCTTCCT GTCCATCCGC TCGTTCTACA AGAACGACCC GCTGTGGCCG
CAGGTCGAGG CACACCTGAA CGACCCGCAG AACGTCCCGG CGCCGATCGT GGAGCGCCAC
CGCTTCTGGG CTCAGGTGGA AATCGCGACC GCGTTCGCAG CCCACGACGA ACTGTTCGGG
GCCGGAGCTC CCTGA
 
Protein sequence
MRSLLSPRRW RTLASGALAA ALAAAVLSPG VAHAAVACSV DYDDSNDWGS GFVAEVKVTN 
EGSDPIQNWQ VGWTFPGNQQ ITNGWNGVFS QSGANVTVRY PDWNPNIAPG ATISFGFQGT
YSGSNDAPTS FTVNGVTCSG SQPANLPPDV TLTSPANNST FLVNDPIELT AVASDPDGSI
DRVEFAADNT VIGIDTTSPY SFTWTDAAAG SYSVTAIAYD DQGARTVSAP IAIRVLDRAA
VIASPPTVRV PQGGTADFEV RLSNQPSGNV TVTVARTSGS SDLTVSSGSQ LQFTSSNWNQ
PQKVTIASAD NGGNLAEAVF TVSAPGHDSA EVTVREIDPN TSSYDQAFLE QYEKIKDPAS
GYFREFNGLL VPYHSVETMI VEAPDHGHQT TSEAFSYYLW LEAYYGRVTG DWKPLHDAWE
SMETFIIPGT KDQPTNSAYN PNSPATYIPE QPNADGYPSP LMNNVPVGQD PLAQELSSTY
GTNEIYGMHW LLDVDNVYGF GFCGDGTDDA PAYINTYQRG ARESVWETIP HPSCDDFTHG
GPNGYLDLFT DDQNYAKQWR YTNAPDADAR AVQVMFWAHE WAKEQGKENE IAGLMDKASK
MGDYLRYAMF DKYFKKIGNC VGATSCPGGQ GKDSAHYLLS WYYSWGGSLD TSSAWAWRIG
SSSSHQGYQN VLAAYALSQV PELQPDSPTG VQDWATSFDR QLEFLQWLQS AEGGIAGGAT
NSWKGSYDTP PTGLSQFYGM YYDWQPVWND PPSNNWFGFQ VWNMERVAQL YYVTGDARAE
AILDKWVPWA IQHTDVDADN GGQNFQVPSD LEWSGQPDTW TGTYTGNPNL HVQVVSYSQD
VGVTAALAKT LMYYAKRSGD TTALATAEGL LDALLAHRDS IGIATPEQPS WDRLDDPWDG
SEGLYVPPGW SGTMPNGDRI EPGATFLSIR SFYKNDPLWP QVEAHLNDPQ NVPAPIVERH
RFWAQVEIAT AFAAHDELFG AGAP