Gene Tfu_0620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTfu_0620 
Symbol 
ID3580649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermobifida fusca YX 
KingdomBacteria 
Replicon accessionNC_007333 
Strand
Start bp719475 
End bp721265 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content67% 
IMG OID637684310 
Productcellobiohydrolase 
Protein accessionYP_288681 
Protein GI72161024 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.250178 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAG TTCGTGCCAC GAACAGACGT TCGTGGATGC GGCGCGGCCT GGCAGCCGCC 
TCTGGACTGG CGCTTGGCGC CTCCATGGTG GCGTTCGCTG CTCCGGCCAA CGCCGCCGGC
TGCTCGGTGG ACTACACGGT CAACTCCTGG GGTACCGGGT TCACCGCCAA CGTCACCATC
ACCAACCTCG GCAGTGCGAT CAACGGCTGG ACCCTGGAGT GGGACTTCCC CGGCAACCAG
CAGGTGACCA ACCTGTGGAA CGGGACCTAC ACCCAGTCCG GGCAGCACGT GTCGGTCAGC
AACGCCCCGT ACAACGCCTC CATCCCGGCC AACGGAACGG TTGAGTTCGG GTTCAACGGC
TCCTACTCGG GCAGCAACGA CATCCCCTCC TCCTTCAAGC TGAACGGGGT TACCTGCGAC
GGCTCGGACG ACCCCGACCC CGAGCCCAGC CCCTCCCCCA GCCCTTCCCC CAGCCCCACA
GACCCGGATG AGCCGGGCGG CCCGACCAAC CCGCCCACCA ACCCCGGCGA GAAGGTCGAC
AACCCGTTCG AGGGCGCCAA GCTGTACGTG AACCCGGTCT GGTCGGCCAA GGCCGCCGCT
GAGCCGGGCG GTTCCGCGGT CGCCAACGAG TCCACCGCTG TCTGGCTGGA CCGTATCGGC
GCCATCGAGG GCAACGACAG CCCGACCACC GGCTCCATGG GTCTGCGCGA CCACCTGGAG
GAGGCCGTCC GCCAGTCCGG TGGCGACCCG CTGACCATCC AGGTCGTCAT CTACAACCTG
CCCGGCCGCG ACTGCGCCGC GCTGGCCTCC AACGGTGAGC TGGGTCCCGA TGAACTCGAC
CGCTACAAGA GCGAGTACAT CGACCCGATC GCCGACATCA TGTGGGACTT CGCAGACTAC
GAGAACCTGC GGATCGTCGC CATCATCGAG ATCGACTCCC TGCCCAACCT CGTCACCAAC
GTGGGCGGGA ACGGCGGCAC CGAGCTCTGC GCCTACATGA AGCAGAACGG CGGCTACGTC
AACGGTGTCG GCTACGCCCT CCGCAAGCTG GGCGAGATCC CGAACGTCTA CAACTACATC
GACGCCGCCC ACCACGGCTG GATCGGCTGG GACTCCAACT TCGGCCCCTC GGTGGACATC
TTCTACGAGG CCGCCAACGC CTCCGGCTCC ACCGTGGACT ACGTGCACGG CTTCATCTCC
AACACGGCCA ACTACTCGGC CACTGTGGAG CCGTACCTGG ACGTCAACGG CACCGTTAAC
GGCCAGCTCA TCCGCCAGTC CAAGTGGGTT GACTGGAACC AGTACGTCGA CGAGCTCTCC
TTCGTCCAGG ACCTGCGTCA GGCCCTGATC GCCAAGGGCT TCCGGTCCGA CATCGGTATG
CTCATCGACA CCTCCCGCAA CGGCTGGGGT GGCCCGAACC GTCCGACCGG ACCGAGCTCC
TCCACCGACC TCAACACCTA CGTTGACGAG AGCCGTATCG ACCGCCGTAT CCACCCCGGT
AACTGGTGCA ACCAGGCCGG TGCGGGCCTC GGCGAGCGGC CCACGGTCAA CCCGGCTCCC
GGTGTTGACG CCTACGTCTG GGTGAAGCCC CCGGGTGAGT CCGACGGCGC CAGCGAGGAG
ATCCCGAACG ACGAGGGCAA GGGCTTCGAC CGCATGTGCG ACCCGACCTA CCAGGGCAAC
GCCCGCAACG GCAACAACCC CTCGGGTGCG CTGCCCAACG CCCCCATCTC CGGCCACTGG
TTCTCTGCCC AGTTCCGCGA GCTGCTGGCC AACGCCTACC CGCCTCTGTA A
 
Protein sequence
MSKVRATNRR SWMRRGLAAA SGLALGASMV AFAAPANAAG CSVDYTVNSW GTGFTANVTI 
TNLGSAINGW TLEWDFPGNQ QVTNLWNGTY TQSGQHVSVS NAPYNASIPA NGTVEFGFNG
SYSGSNDIPS SFKLNGVTCD GSDDPDPEPS PSPSPSPSPT DPDEPGGPTN PPTNPGEKVD
NPFEGAKLYV NPVWSAKAAA EPGGSAVANE STAVWLDRIG AIEGNDSPTT GSMGLRDHLE
EAVRQSGGDP LTIQVVIYNL PGRDCAALAS NGELGPDELD RYKSEYIDPI ADIMWDFADY
ENLRIVAIIE IDSLPNLVTN VGGNGGTELC AYMKQNGGYV NGVGYALRKL GEIPNVYNYI
DAAHHGWIGW DSNFGPSVDI FYEAANASGS TVDYVHGFIS NTANYSATVE PYLDVNGTVN
GQLIRQSKWV DWNQYVDELS FVQDLRQALI AKGFRSDIGM LIDTSRNGWG GPNRPTGPSS
STDLNTYVDE SRIDRRIHPG NWCNQAGAGL GERPTVNPAP GVDAYVWVKP PGESDGASEE
IPNDEGKGFD RMCDPTYQGN ARNGNNPSGA LPNAPISGHW FSAQFRELLA NAYPPL