Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tfu_1959 |
Symbol | |
ID | 3580184 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermobifida fusca YX |
Kingdom | Bacteria |
Replicon accession | NC_007333 |
Strand | + |
Start bp | 2289980 |
End bp | 2292934 |
Gene Length | 2955 bp |
Protein Length | 984 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637685651 |
Product | cellulose 1,4-beta-cellobiosidase |
Protein accession | YP_290015 |
Protein GI | 72162358 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.54299 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGATCGT TACTGTCTCC CCGGCGCTGG CGCACGCTGG CCTCGGGGGC GCTCGCAGCG GCCCTGGCCG CCGCTGTACT CTCCCCCGGC GTCGCGCACG CCGCCGTCGC CTGCTCGGTG GACTACGACG ACTCCAACGA CTGGGGTAGC GGGTTCGTCG CCGAAGTCAA GGTGACCAAC GAAGGCAGCG ACCCCATCCA GAACTGGCAA GTAGGCTGGA CCTTCCCCGG TAACCAGCAG ATCACCAACG GCTGGAACGG CGTGTTCAGC CAGAGCGGCG CCAACGTCAC CGTCCGCTAC CCGGACTGGA ACCCCAATAT CGCCCCCGGA GCCACCATCT CCTTCGGCTT CCAGGGCACC TACAGCGGCT CCAACGACGC CCCGACCAGC TTCACCGTCA ACGGCGTCAC CTGCAGCGGA TCCCAGCCCG CCAACCTGCC GCCCGATGTC ACCCTGACAT CCCCGGCCAA CAACTCGACC TTCCTGGTCA ACGACCCGAT CGAGCTGACC GCGGTCGCCT CCGACCCCGA CGGCTCGATC GACCGGGTGG AATTCGCCGC CGACAACACC GTCATCGGCA TCGACACCAC CTCCCCCTAC AGCTTCACCT GGACGGACGC TGCCGCCGGC TCCTACTCGG TGACCGCGAT CGCCTACGAC GACCAGGGAG CCAGGACCGT CTCCGCTCCC ATCGCCATCC GAGTGCTGGA CCGGGCCGCC GTCATCGCCT CACCGCCCAC CGTCCGCGTG CCGCAGGGCG GCACCGCCGA CTTCGAGGTG CGGCTGTCCA ACCAGCCCTC CGGCAACGTC ACGGTCACCG TGGCGCGCAC GTCGGGCAGC TCCGACCTGA CCGTCTCCAG CGGCTCCCAA CTCCAGTTCA CCTCCAGCAA CTGGAACCAG CCGCAGAAGG TGACCATCGC CTCCGCTGAC AACGGCGGAA ACCTGGCCGA GGCGGTCTTC ACCGTCAGCG CCCCCGGCCA CGACTCGGCC GAGGTGACGG TCCGGGAGAT CGACCCGAAC ACCAGCTCCT ACGACCAGGC CTTCCTGGAG CAGTACGAGA AGATCAAGGA CCCCGCCAGC GGCTACTTCC GCGAATTCAA CGGGCTCCTG GTCCCCTACC ACTCGGTGGA GACCATGATC GTCGAGGCTC CGGACCACGG CCACCAGACC ACGTCCGAGG CGTTCAGCTA CTACCTGTGG CTGGAGGCGT ACTACGGCCG GGTCACCGGT GACTGGAAGC CGCTCCACGA CGCCTGGGAG TCGATGGAGA CCTTCATCAT CCCCGGCACC AAGGACCAGC CGACCAACTC CGCCTACAAC CCGAACTCCC CGGCGACCTA CATCCCCGAG CAGCCCAACG CTGACGGCTA CCCGTCGCCT CTCATGAACA ACGTCCCGGT GGGTCAAGAC CCGCTCGCCC AGGAGCTGAG CTCCACCTAC GGGACCAACG AGATCTACGG CATGCACTGG CTGCTCGACG TGGACAACGT CTACGGCTTC GGGTTCTGCG GCGACGGCAC CGACGACGCC CCCGCCTACA TCAACACCTA CCAGCGTGGT GCGCGCGAGT CGGTGTGGGA GACCATTCCG CACCCGTCCT GCGACGACTT CACGCACGGC GGCCCCAACG GCTACCTGGA CCTGTTCACC GACGACCAGA ACTACGCCAA GCAGTGGCGC TACACCAACG CCCCCGACGC TGACGCGCGG GCCGTCCAGG TGATGTTCTG GGCGCACGAA TGGGCCAAGG AGCAGGGCAA GGAGAACGAG ATCGCGGGCC TGATGGACAA GGCGTCCAAG ATGGGCGACT ACCTCCGGTA CGCGATGTTC GACAAGTACT TCAAGAAGAT CGGCAACTGC GTCGGCGCCA CCTCCTGCCC GGGTGGCCAA GGCAAGGACA GCGCGCACTA CCTGCTGTCC TGGTACTACT CCTGGGGCGG CTCGCTCGAC ACCTCCTCTG CGTGGGCGTG GCGTATCGGC TCCAGCTCCT CGCACCAGGG CTACCAGAAC GTGCTCGCTG CCTACGCGCT CTCGCAGGTG CCCGAACTGC AGCCTGACTC CCCGACCGGT GTCCAGGACT GGGCCACCAG CTTCGACCGC CAGTTGGAGT TCCTCCAGTG GCTGCAGTCC GCTGAAGGTG GTATCGCCGG TGGCGCCACC AACAGCTGGA AGGGAAGCTA CGACACCCCG CCGACCGGCC TGTCGCAGTT CTACGGCATG TACTACGACT GGCAGCCGGT CTGGAACGAC CCGCCGTCCA ACAACTGGTT CGGCTTCCAG GTCTGGAACA TGGAGCGCGT CGCCCAGCTC TACTACGTGA CCGGCGACGC CCGGGCCGAG GCCATCCTCG ACAAGTGGGT GCCGTGGGCC ATCCAGCACA CCGACGTGGA CGCCGACAAC GGCGGCCAGA ACTTCCAGGT CCCCTCCGAC CTGGAGTGGT CGGGCCAGCC TGACACCTGG ACCGGCACCT ACACCGGCAA CCCGAACCTG CACGTCCAGG TCGTCTCCTA CAGCCAGGAC GTCGGTGTGA CCGCCGCTCT GGCCAAGACC CTGATGTACT ACGCGAAGCG TTCGGGCGAC ACCACCGCCC TCGCCACCGC GGAGGGTCTG CTGGACGCCC TGCTGGCCCA CCGGGACAGC ATCGGTATCG CCACCCCCGA GCAGCCGAGC TGGGACCGTC TGGACGACCC GTGGGACGGC TCCGAGGGCC TGTACGTGCC GCCGGGCTGG TCGGGCACCA TGCCCAACGG TGACCGCATC GAGCCGGGCG CGACCTTCCT GTCCATCCGC TCGTTCTACA AGAACGACCC GCTGTGGCCG CAGGTCGAGG CACACCTGAA CGACCCGCAG AACGTCCCGG CGCCGATCGT GGAGCGCCAC CGCTTCTGGG CTCAGGTGGA AATCGCGACC GCGTTCGCAG CCCACGACGA ACTGTTCGGG GCCGGAGCTC CCTGA
|
Protein sequence | MRSLLSPRRW RTLASGALAA ALAAAVLSPG VAHAAVACSV DYDDSNDWGS GFVAEVKVTN EGSDPIQNWQ VGWTFPGNQQ ITNGWNGVFS QSGANVTVRY PDWNPNIAPG ATISFGFQGT YSGSNDAPTS FTVNGVTCSG SQPANLPPDV TLTSPANNST FLVNDPIELT AVASDPDGSI DRVEFAADNT VIGIDTTSPY SFTWTDAAAG SYSVTAIAYD DQGARTVSAP IAIRVLDRAA VIASPPTVRV PQGGTADFEV RLSNQPSGNV TVTVARTSGS SDLTVSSGSQ LQFTSSNWNQ PQKVTIASAD NGGNLAEAVF TVSAPGHDSA EVTVREIDPN TSSYDQAFLE QYEKIKDPAS GYFREFNGLL VPYHSVETMI VEAPDHGHQT TSEAFSYYLW LEAYYGRVTG DWKPLHDAWE SMETFIIPGT KDQPTNSAYN PNSPATYIPE QPNADGYPSP LMNNVPVGQD PLAQELSSTY GTNEIYGMHW LLDVDNVYGF GFCGDGTDDA PAYINTYQRG ARESVWETIP HPSCDDFTHG GPNGYLDLFT DDQNYAKQWR YTNAPDADAR AVQVMFWAHE WAKEQGKENE IAGLMDKASK MGDYLRYAMF DKYFKKIGNC VGATSCPGGQ GKDSAHYLLS WYYSWGGSLD TSSAWAWRIG SSSSHQGYQN VLAAYALSQV PELQPDSPTG VQDWATSFDR QLEFLQWLQS AEGGIAGGAT NSWKGSYDTP PTGLSQFYGM YYDWQPVWND PPSNNWFGFQ VWNMERVAQL YYVTGDARAE AILDKWVPWA IQHTDVDADN GGQNFQVPSD LEWSGQPDTW TGTYTGNPNL HVQVVSYSQD VGVTAALAKT LMYYAKRSGD TTALATAEGL LDALLAHRDS IGIATPEQPS WDRLDDPWDG SEGLYVPPGW SGTMPNGDRI EPGATFLSIR SFYKNDPLWP QVEAHLNDPQ NVPAPIVERH RFWAQVEIAT AFAAHDELFG AGAP
|
| |