Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tfu_1627 |
Symbol | |
ID | 3580405 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermobifida fusca YX |
Kingdom | Bacteria |
Replicon accession | NC_007333 |
Strand | - |
Start bp | 1887518 |
End bp | 1890514 |
Gene Length | 2997 bp |
Protein Length | 998 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637685321 |
Product | cellulose 1,4-beta-cellobiosidase |
Protein accession | YP_289685 |
Protein GI | 72162028 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.234171 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAGCGC TCCCATGGTG GGCCTCCGCT GTGAGGTCAT CCTCCCAGTT CGAATCCCCC TACGGAAGGA CTTCCGTGCT TAGGAGACCC AGATCTCGAT CCCCCCTTGT CGCCCTCACC GCGGCGACTT GCGCAGTCGC GCTCGGGGGT ACGGCGGTTC CCGCCCAGGC AGACGAAGTC AACCAGATTC GCAACGGCGA CTTCAGCTCC GGCACCGCAC CCTGGTGGGG AACCGAGAAC ATCCAACTCA ACGTCACCGA CGGGATGCTG TGCGTCGACG TCCCCGGCGG CACCGTCAAC CCGTGGGACG TGATCATCGG CCAGGACGAC ATCCCCCTCA TCGAAGGTGA GTCCTACGCC TTCTCCTTCA CTGCCTCCAG CACCGTCCCC GTCTCCATCC GCGCCCTGGT GCAAGAGCCC GTGGAGCCGT GGACCACCCA GATGGACGAG CGTGCCCTGC TCGGCCCCGA GGCAGAAACC TACGAATTCG TCTTCACCTC CAACGTCGAC TGGGACGACG CCCAAGTCGC CTTCCAGATC GGCGGCTCCG ACGAACCGTG GACCTTCTGC CTCGACGACG TCGCCCTGCT CGGCGGCGCC GAACCCCCGG TCTACGAACC CGACACCGGA CCGCGGGTCC GCGTCAACCA GGTCGGCTAC CTCCCGCACG GTCCCAAGAA GGCGACCGTG GTCACCGACG CCACCAGCGC GCTCACCTGG GAGCTTGCCG ACGCCGACGG TAACGTGGTC GCCAGCGGCC AGACCAAGCC GCACGGCGCG GACTCCAGCT CCGGGCTCAA CGTCCACACC GTCGACTTCA GCTCCTACAC CACGAAGGGA AGCGACTACA CGCTCACCGT CGACGGTGAA ACCAGCTACC CCTTCGACAT CGACGAAAGC GTCTACGAGG AACTGCGCGT CGACGCGCTG TCGTTCTACT ACCCGCAGCG CAGCGGCATC GAGATCCTCG ACTCCATCGC CCCCGGCTAC GGACGCCCGG CCGGCCACAT CGGCGTGCCC CCCAACCAGG GCGATACCGA CGTGCCGTGC GCGCCCGGCA CCTGCGACTA CTCCCTGGAC GTCTCCGGCG GCTGGTACGA CGCGGGCGAC CACGGCAAAT ACGTGGTCAA CGGCGGTATC TCGGTGCACC AGATCATGAG CATCTACGAG CGCTCCCAGC TCGCCGACAC CGCCCAGCCC GACAAGCTGG CCGACTCCAC CCTGCGCCTG CCCGAAACCG GCAACGGCGT GCCCGACGTG CTCGACGAAG CACGCTGGGA GATGGAGTTC CTCCTCAAGA TGCAGGTGCC CGAAGGCGAA CCGCTCGCCG GCATGGCGCA CCACAAGATC CACGACGAAC AGTGGACCGG GCTGCCGCTG CTGCCCTCCG CTGACCCGCA GCCGCGCTAC CTGCAGCCGC CGTCCACCGC GGCCACGCTG AACCTGGCCG CCACCGCCGC CCAGTGCGCT CGCGTGTTCG AACCCTTCGA CGAGGATTTC GCCGCCGAGT GCCTGGCTGC CGCGGAAACC GCGTGGGACG CCGCCAAGGC CAACCCGAAC ATCTACGCGC CTGCCTTCGG TGAAGGCGGC GGCCCGTACA ACGACAACAA CGTCACCGAC GAGTTCTACT GGGCCGCGGC CGAACTGTTC CTCACCACCG GCAAGGAGGA GTACCGCGAC GCGGTGACCT CGTCGCCGCT GCACACCGAC GACGAAGAGG TCTTCCGCGA CGGCGCCTTC GACTGGGGAT GGACTGCTGC GCTGGCCCGC CTCCAGCTGG CCACGATCCC CAACGACCTC GCCGACCGCG ACCGGGTGCG CCAGTCCGTG GTCGATGCCG CCGACATGTA CCTCGCCAAC GTCGAGACCA GCCCGTGGGG CCTGGCCTAC AAGCCGAACA ACGGCGTGTT CGTCTGGGGC TCCAACAGCG CTGTCCTCAA CAACATGGTG ATCCTGGCGG TCGCCTTCGA CCTCACCGGT GACACCAAAT ACCGCGACGG CGTGCTGGAA GGCATGGACT ACATCTTCGG CCGCAACGCG CTGAACCAGT CCTACGTCAC CGGCTACGGC GACAAGGACT CCCGCAACCA GCACAGCCGC TGGTACGCCC ACCAGCTCGA CCCCCGGTTG CCCAACCCGC CCAAGGGCAC GCTGGCCGGT GGACCCAACT CCGACTCCAC CACCTGGGAC CCGGTGGCCC AGTCCAAGCT GACCGGGTGC GCCCCCCAGA TGTGCTACAT CGACCACATC GAGTCGTGGT CCACCAACGA GCTGACCATC AACTGGAACG CCCCCCTGTC GTGGATCGCG TCCTTCATCG CCGACCAGGA CGACGCCGGC GAGCCCGGCG GAGAAGAGCC CGGACCGGGC GACGACGAGA CCCCGCCGAG CAAGCCTGGG AACCTGAAGG CCAGCGACAT CACCGCGACC AGCGCCACCC TGACCTGGGA CGCCTCCACC GACAACGTCG GAGTGGTCGG CTACAAGGTC TCCCTGGTCC GCGACGGTGA CGCTGAAGAG GTGGGCACCA CCGCGCAGAC CAGCTACACG CTCACCGGGC TGAGCGCGGA CCAGGAGTAC ACCGTCCAGG TGGTCGCCTA CGACGCGGCA GGCAACCTCT CCACGCCAGC CACCGTCACC TTCACCACCG AGAAGGAGGA CGAGACTCCC ACGCCCAGCG CCTCCTGCGC GGTGACGTAC CAGACCAACG ACTGGCCGGG CGGCTTCACC GCCTCGGTGA CGCTGACCAA CACCGGCAGC ACCCCGTGGG ACTCCTGGGA ACTGCGCTTC ACCTTCCCGT CGGGACAGAC TGTCAGCCAC GGCTGGAGCG CCAACTGGCA GCAGAGCGGC AGTGACGTGA CCGCCACCTC CTTGCCGTGG AACGGATCAG TTCCGCCGGG CGGCTCAGTC AACATCGGCT TCAACGGAAC CTGGGGCGGT TCGAACACCA AACCTGAGAA GTTCACCGTC AACGGCGCGG TCTGCTCCAT CGGCTGA
|
Protein sequence | MGALPWWASA VRSSSQFESP YGRTSVLRRP RSRSPLVALT AATCAVALGG TAVPAQADEV NQIRNGDFSS GTAPWWGTEN IQLNVTDGML CVDVPGGTVN PWDVIIGQDD IPLIEGESYA FSFTASSTVP VSIRALVQEP VEPWTTQMDE RALLGPEAET YEFVFTSNVD WDDAQVAFQI GGSDEPWTFC LDDVALLGGA EPPVYEPDTG PRVRVNQVGY LPHGPKKATV VTDATSALTW ELADADGNVV ASGQTKPHGA DSSSGLNVHT VDFSSYTTKG SDYTLTVDGE TSYPFDIDES VYEELRVDAL SFYYPQRSGI EILDSIAPGY GRPAGHIGVP PNQGDTDVPC APGTCDYSLD VSGGWYDAGD HGKYVVNGGI SVHQIMSIYE RSQLADTAQP DKLADSTLRL PETGNGVPDV LDEARWEMEF LLKMQVPEGE PLAGMAHHKI HDEQWTGLPL LPSADPQPRY LQPPSTAATL NLAATAAQCA RVFEPFDEDF AAECLAAAET AWDAAKANPN IYAPAFGEGG GPYNDNNVTD EFYWAAAELF LTTGKEEYRD AVTSSPLHTD DEEVFRDGAF DWGWTAALAR LQLATIPNDL ADRDRVRQSV VDAADMYLAN VETSPWGLAY KPNNGVFVWG SNSAVLNNMV ILAVAFDLTG DTKYRDGVLE GMDYIFGRNA LNQSYVTGYG DKDSRNQHSR WYAHQLDPRL PNPPKGTLAG GPNSDSTTWD PVAQSKLTGC APQMCYIDHI ESWSTNELTI NWNAPLSWIA SFIADQDDAG EPGGEEPGPG DDETPPSKPG NLKASDITAT SATLTWDAST DNVGVVGYKV SLVRDGDAEE VGTTAQTSYT LTGLSADQEY TVQVVAYDAA GNLSTPATVT FTTEKEDETP TPSASCAVTY QTNDWPGGFT ASVTLTNTGS TPWDSWELRF TFPSGQTVSH GWSANWQQSG SDVTATSLPW NGSVPPGGSV NIGFNGTWGG SNTKPEKFTV NGAVCSIG
|
| |