Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tfu_1612 |
Symbol | |
ID | 3581099 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermobifida fusca YX |
Kingdom | Bacteria |
Replicon accession | NC_007333 |
Strand | - |
Start bp | 1863968 |
End bp | 1866745 |
Gene Length | 2778 bp |
Protein Length | 925 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637685306 |
Product | cellulose-binding family II protein |
Protein accession | YP_289670 |
Protein GI | 72162013 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGCAA CAGCACAGCG AACACCACCC CCGCCCACCC CCCGCAGACG CGGCATCATC GCCCGTGCAC TCACCTGCAT CGCGGCCGCC GCAACCGTGG CCGCAGTCGG CCTCGTCCAC TCCGCTGCCG CCCCCGCCTC CGCCACCACC GGCTACACCT GGCGCAACGT CGAAATCGTC GGCGGCGGCT TCGTCCCCGG CATCGTCTTC AACCAGAGCG AACCGGACCT GATCTACGCG CGCACCGACA TCGGCGGCGC CTACCGGTGG GACCCCGCCA CCGAACGGTG GATCCCCCTG CTCGACCACG TCGGCTGGGA CGACTGGGGA CACAGCGGCG TCGTCAGCAT CGCTACCGAC CCGGTCGACC CCGACCGCGT GTATGCGGCC GTCGGCACTT ACACCAACGA CTGGGACCCC AACAACGGCG CGATCAAACG CTCCACCGAC CGCGGAGAAA CCTGGGAAAC CACTGAACTA CCGTTCAAAC TCGGCGGCAA CATGCCCGGC CGCGGCATGG GGGAGCGGCT CGCCATCGAC CCCAACGACA ACAGCGTGCT CTACCTGGGC GCACCCAGCG GCCACGGCCT GTGGAAGAGC ACCGACTACG GCAAGACCTG GCAGAAAGTC ACCAGCTTCC CCAACCCCGG AAACTACGTC GCCGACCCCT CCGACGTCGG CGGATACCTG GGAGACAACC AGGGCGTCGT CTGGGTCGTC TTCGACCCCA CCAGCTCCTC GCCCGGCCAC GTCACCAAAG ACATCTACGT CGGGGTGGCC GACAAGCAGA ACACCGTCTA CCGCTCCACC GACGGCGGCC AGACCTGGGA GCGCATCCCC GGACAGCCCA CCGGCTTCCT CGCGCAAAAA GGCGTCTTCG ACCACGTCAA CGGACTGCTC TACATCGCCA CCAGCGACAC CGGCGGCCCT TACGACGGAT CCGACGGTGA AGTGTGGCGC TACGACACCA CGACCGGCAC CTGGACCGAC ATCACCCCCG CCGATCCCGA CGGCTTCGAA TACGGTTTCA GCGGCCTGAC CATTGACCGG CAGAACCCGG ACACGATCAT GGTGGTCAGC CAGATCCTGT GGTGGCCCGA CATCCAAATC TGGCGTTCCA CCGACCGCGG AGAAACCTGG AGCCGCATCT GGGAGTTCAG CGGATACCCC GACCGCACCC TGCGCTACAA CCACGACATC TCCGCGGCCC CGTGGCTGGA CTTCAACCGG CAGGACAACC CGCCCGAAGT CAGCCCCAAG CTCGGTTGGA TGACCCAGGC CTTCGAAATC GACCCGTTCA ACTCCGACCG CATGCTGTAC GGCACCGGAG CCACCATCTA CGGCAGCGAC AACCTCACCA ACTGGGACGA AGGCAAGAAG ATCGACATCA AGGTCCGCGC TCAAGGTATT GAAGAAACCG CGGTCCAGGA CCTCATCGCC CCGCCCGGCG ACACCGAACT GGTCTCCGCC CTCGGCGACA TCGGCGGTTT CGTCCACGAC GACATCACCG TCGTCCCCGA CGCCATGTTC GACTCGCCCT TCCACGGCAA CACCCGCAGC ATCGACTTCG CAGAACTCAA CCCGAGCGTC ATGGCCCGGG TAGGGGAAGC GGTCGACGGG GAAGTCGACT CCCACATCGG CATCTCCACC AGCGGCGGAT CACACTGGTG GGCCGGACAG GAACCCTCCG GAGTCACCGG CGCTGGCACC GTCGCGGTCA ACGCCGACGG GTCGCGCATC GTGTGGAGCC CCGACGGCAC TGGCGTGCAC TACTCCACCA CCCTCGGCTC TTCGTGGACC CCGTCCCAGG GAGTCCCCGC CGGGGCCCGC GTGGAAGCCG ACCGGGTCAA CCCGGACAAG TTCTACGCCT TCGCCAACGG CACGTTCTAC ACCAGCACCG ACGGCGGTGC CACCTTCACG AAATCCTCCG CCGCCGGCCT GCCCACGAAG GGCAACATCC GTTTTGCGGC CGTCCCCGGC CACGAAGGCG ACATCTGGCT GGCAGGCGGG GAAACCAACA GCACCTACGG CATGTGGCGC TCCACCGACT CCGGGGCCAC CTTCACCCGG ATCACCGCCG TGGACGAAGG CGACGTCGTC GGATTCGGCA AACCCGCCCC CGGCCGCAGC TACCCCGCTG TCTACACCTC GTCGAAGATC AACGGGGTGC GCGGCATCTT CCGCTCCGAC GACGCAGGCA CGACCTGGGT CCGGATCAAC GACGACCAGC ACCAGTGGGC CTGGACCGGT GCGGCCATCA CCGGCGACCC CGACGTCTAC GGCCGCGTCT ACATCGGCAC CAACGGCCGC GGCGTCATCG TCGGCGACCT GGACGGGCCG CCACCGCAGC CCACCGAGGA GCCGACAGAA GAACCCTCCA CCCCGCCCAC GGAAGAACCC ACCGAGGAGC CCACGGAGGA ACCGTCCACT CCGCCAACGG AGGAGCCGCC CGGTGACGCC GCCTGCGCTG TCTCCTACCA GGTCCTCAAC GAGTGGGGCG GCGGCTTCCA AGGCGAGGTG ACCATCACCA ACACCGGCGA CACGCCGATC AACGGCTGGG AGCTGACGTG GACCTTCCCC GACAACCAGC AGATCACCCA GGCATGGAAC ACCCAGCTCA CCCAGTCGGG AGCCAAGGTG ACCGCCCGCG ACGCGGGATG GAACAGCACC ATCGCCCCCG GCGGCACTGC GAGCTTCGGA TTCCTCGGCT CGCCTGCCCC CGGCAGCAAG CCGACCGAGT TCACCCTCAA CGGGACCCCG TGTTCGGCAG CCGGCTAG
|
Protein sequence | MTATAQRTPP PPTPRRRGII ARALTCIAAA ATVAAVGLVH SAAAPASATT GYTWRNVEIV GGGFVPGIVF NQSEPDLIYA RTDIGGAYRW DPATERWIPL LDHVGWDDWG HSGVVSIATD PVDPDRVYAA VGTYTNDWDP NNGAIKRSTD RGETWETTEL PFKLGGNMPG RGMGERLAID PNDNSVLYLG APSGHGLWKS TDYGKTWQKV TSFPNPGNYV ADPSDVGGYL GDNQGVVWVV FDPTSSSPGH VTKDIYVGVA DKQNTVYRST DGGQTWERIP GQPTGFLAQK GVFDHVNGLL YIATSDTGGP YDGSDGEVWR YDTTTGTWTD ITPADPDGFE YGFSGLTIDR QNPDTIMVVS QILWWPDIQI WRSTDRGETW SRIWEFSGYP DRTLRYNHDI SAAPWLDFNR QDNPPEVSPK LGWMTQAFEI DPFNSDRMLY GTGATIYGSD NLTNWDEGKK IDIKVRAQGI EETAVQDLIA PPGDTELVSA LGDIGGFVHD DITVVPDAMF DSPFHGNTRS IDFAELNPSV MARVGEAVDG EVDSHIGIST SGGSHWWAGQ EPSGVTGAGT VAVNADGSRI VWSPDGTGVH YSTTLGSSWT PSQGVPAGAR VEADRVNPDK FYAFANGTFY TSTDGGATFT KSSAAGLPTK GNIRFAAVPG HEGDIWLAGG ETNSTYGMWR STDSGATFTR ITAVDEGDVV GFGKPAPGRS YPAVYTSSKI NGVRGIFRSD DAGTTWVRIN DDQHQWAWTG AAITGDPDVY GRVYIGTNGR GVIVGDLDGP PPQPTEEPTE EPSTPPTEEP TEEPTEEPST PPTEEPPGDA ACAVSYQVLN EWGGGFQGEV TITNTGDTPI NGWELTWTFP DNQQITQAWN TQLTQSGAKV TARDAGWNST IAPGGTASFG FLGSPAPGSK PTEFTLNGTP CSAAG
|
| |