Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tfu_0620 |
Symbol | |
ID | 3580649 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermobifida fusca YX |
Kingdom | Bacteria |
Replicon accession | NC_007333 |
Strand | - |
Start bp | 719475 |
End bp | 721265 |
Gene Length | 1791 bp |
Protein Length | 596 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637684310 |
Product | cellobiohydrolase |
Protein accession | YP_288681 |
Protein GI | 72161024 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.250178 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAAAG TTCGTGCCAC GAACAGACGT TCGTGGATGC GGCGCGGCCT GGCAGCCGCC TCTGGACTGG CGCTTGGCGC CTCCATGGTG GCGTTCGCTG CTCCGGCCAA CGCCGCCGGC TGCTCGGTGG ACTACACGGT CAACTCCTGG GGTACCGGGT TCACCGCCAA CGTCACCATC ACCAACCTCG GCAGTGCGAT CAACGGCTGG ACCCTGGAGT GGGACTTCCC CGGCAACCAG CAGGTGACCA ACCTGTGGAA CGGGACCTAC ACCCAGTCCG GGCAGCACGT GTCGGTCAGC AACGCCCCGT ACAACGCCTC CATCCCGGCC AACGGAACGG TTGAGTTCGG GTTCAACGGC TCCTACTCGG GCAGCAACGA CATCCCCTCC TCCTTCAAGC TGAACGGGGT TACCTGCGAC GGCTCGGACG ACCCCGACCC CGAGCCCAGC CCCTCCCCCA GCCCTTCCCC CAGCCCCACA GACCCGGATG AGCCGGGCGG CCCGACCAAC CCGCCCACCA ACCCCGGCGA GAAGGTCGAC AACCCGTTCG AGGGCGCCAA GCTGTACGTG AACCCGGTCT GGTCGGCCAA GGCCGCCGCT GAGCCGGGCG GTTCCGCGGT CGCCAACGAG TCCACCGCTG TCTGGCTGGA CCGTATCGGC GCCATCGAGG GCAACGACAG CCCGACCACC GGCTCCATGG GTCTGCGCGA CCACCTGGAG GAGGCCGTCC GCCAGTCCGG TGGCGACCCG CTGACCATCC AGGTCGTCAT CTACAACCTG CCCGGCCGCG ACTGCGCCGC GCTGGCCTCC AACGGTGAGC TGGGTCCCGA TGAACTCGAC CGCTACAAGA GCGAGTACAT CGACCCGATC GCCGACATCA TGTGGGACTT CGCAGACTAC GAGAACCTGC GGATCGTCGC CATCATCGAG ATCGACTCCC TGCCCAACCT CGTCACCAAC GTGGGCGGGA ACGGCGGCAC CGAGCTCTGC GCCTACATGA AGCAGAACGG CGGCTACGTC AACGGTGTCG GCTACGCCCT CCGCAAGCTG GGCGAGATCC CGAACGTCTA CAACTACATC GACGCCGCCC ACCACGGCTG GATCGGCTGG GACTCCAACT TCGGCCCCTC GGTGGACATC TTCTACGAGG CCGCCAACGC CTCCGGCTCC ACCGTGGACT ACGTGCACGG CTTCATCTCC AACACGGCCA ACTACTCGGC CACTGTGGAG CCGTACCTGG ACGTCAACGG CACCGTTAAC GGCCAGCTCA TCCGCCAGTC CAAGTGGGTT GACTGGAACC AGTACGTCGA CGAGCTCTCC TTCGTCCAGG ACCTGCGTCA GGCCCTGATC GCCAAGGGCT TCCGGTCCGA CATCGGTATG CTCATCGACA CCTCCCGCAA CGGCTGGGGT GGCCCGAACC GTCCGACCGG ACCGAGCTCC TCCACCGACC TCAACACCTA CGTTGACGAG AGCCGTATCG ACCGCCGTAT CCACCCCGGT AACTGGTGCA ACCAGGCCGG TGCGGGCCTC GGCGAGCGGC CCACGGTCAA CCCGGCTCCC GGTGTTGACG CCTACGTCTG GGTGAAGCCC CCGGGTGAGT CCGACGGCGC CAGCGAGGAG ATCCCGAACG ACGAGGGCAA GGGCTTCGAC CGCATGTGCG ACCCGACCTA CCAGGGCAAC GCCCGCAACG GCAACAACCC CTCGGGTGCG CTGCCCAACG CCCCCATCTC CGGCCACTGG TTCTCTGCCC AGTTCCGCGA GCTGCTGGCC AACGCCTACC CGCCTCTGTA A
|
Protein sequence | MSKVRATNRR SWMRRGLAAA SGLALGASMV AFAAPANAAG CSVDYTVNSW GTGFTANVTI TNLGSAINGW TLEWDFPGNQ QVTNLWNGTY TQSGQHVSVS NAPYNASIPA NGTVEFGFNG SYSGSNDIPS SFKLNGVTCD GSDDPDPEPS PSPSPSPSPT DPDEPGGPTN PPTNPGEKVD NPFEGAKLYV NPVWSAKAAA EPGGSAVANE STAVWLDRIG AIEGNDSPTT GSMGLRDHLE EAVRQSGGDP LTIQVVIYNL PGRDCAALAS NGELGPDELD RYKSEYIDPI ADIMWDFADY ENLRIVAIIE IDSLPNLVTN VGGNGGTELC AYMKQNGGYV NGVGYALRKL GEIPNVYNYI DAAHHGWIGW DSNFGPSVDI FYEAANASGS TVDYVHGFIS NTANYSATVE PYLDVNGTVN GQLIRQSKWV DWNQYVDELS FVQDLRQALI AKGFRSDIGM LIDTSRNGWG GPNRPTGPSS STDLNTYVDE SRIDRRIHPG NWCNQAGAGL GERPTVNPAP GVDAYVWVKP PGESDGASEE IPNDEGKGFD RMCDPTYQGN ARNGNNPSGA LPNAPISGHW FSAQFRELLA NAYPPL
|
| |