Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tfu_1607 |
Symbol | |
ID | 3581094 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermobifida fusca YX |
Kingdom | Bacteria |
Replicon accession | NC_007333 |
Strand | - |
Start bp | 1855478 |
End bp | 1858264 |
Gene Length | 2787 bp |
Protein Length | 928 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637685301 |
Product | exo-1,4-beta-glucosidase |
Protein accession | YP_289665 |
Protein GI | 72162008 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.124023 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACCGAAC CCTTCCGCGA CCCCACCCTG CCCCCACACG AACGCGTCCG CGACCTCCTA GCCCGACTCA CCACCGAAGA AAAAATCGGC CTCCTCCACC AATACCAACG CCCCATCCCC CGACTCGGTA TCGCATCTTT CCGCACCGGA ACCGAAGCAC TCCACGGCCT CGCCTGGCAC GGACCAGCCA CCGTCTTCCC CCAAGCCATC GGCCTCGCCA GCACCTGGGA CCCCGACCTC GTCCAACAAG TCGGCGCAGC CACCGCCGCC GAAGTCCTCG TCTTCCACAC CAAAAACCCC GCCACTGTCG GCCGCAACGT CTGGGCCCCC GTCGTCAACC CTCTCCGCGA CCCCCGCTGG GGCCGAAACG AAGAAGGCTA CTCCGAAGAC CCCTGGCTCA CCGGAGTCAT GGCCGTCTCC TACGCCCGCG GCCTCGCCGG ACCCCACCCC CACCGCATGG ACACCGCCCC CACCCTCAAA CACTTCCTCG CCTACAACAA CGAAACCGAC CGCTGCACCA GCTCCAGCCA CCTCCCCCCA CGCGTCCTCC ACGAATACGA ACTCCCCGCC TTCCTCCCCG CCCTCCGCGA AGGCGTCGCA GTCGCCGTCA TGCCCTCCTA CAACCTCGTC AACGGCCGCC CCGCCCACCT CAGCCCACTC ATCAACGACG TCCTCCGCGC CGCCGCCCCC GACGAACTCA TGGTCGTCAG CGACGCCATG GCCCCCGGCA ACCTCGTCGA CCCCCAGCAC TACTACGACG ACCACGCCAC CGCCTACGCG CACGCCCTCC GCGCCGGAAT CGACAGCTTC ACCCAAGACG ACGACCGCGC CGAAGCCACT CTCGCCCACC TCCGCGACGC CCTCGACCGC GGCCTCATCA CCGAAGAAGA CCTCGACCGC GCAGCCACCC ACATCCTGTC GGTCCGCGTC CGCCTCGGCG AATTCGACCC CGAACCCCTC CGCCGCGTCG ACCCCGACAC CGTCAACAGC CCCGCCCACC AAGCCCTCGC CCGCACCGCA GCCCGCCGCT CCATCGTCCT CCTCAAAAAC GACGGCATCC TCCCCCTCCG CGACCCCCGC CGCATCGCCG TCATCGGCCA ACTCGCCGAC ACCCTCATGG AAGACTGGTA CAGCGGCACC CTCCCCTACG CCATCACCGC CCGCGCCGGC CTCGCCGAAC GCACCGAAAC CGTGTTCTGC GAAGGCGTCG ACCGCATCGC GCTCCGCACC AACGAGGGCT ACCTCACCGC CAGCGCCGAC GGCACCCCCA TGACCATCAC CCCCGCCCCC GGCTTCGGCC CCGTCGCCGA ATCCGCCGCC TTCGACCTCT TCGACTGGGG CGGCGCCTGG GCCCTCCGCG CCGTCGTCAA CGGCCGCTAC GTCTCCGAAG ACGAAAACGG CCACCTCACC AACGACCAGC CCGGGCCCAA CGGCTGGGAA GTCCGCCAAA CCTTCCGCTG GCAACCCGAC CCCAACGGCA CCGGCGTCCT GCAACACATC GCCACCGGCC GCTACGTCGC CGTCGGCGAC AACAACACCG TCACCCTCAC CCCGGACGCC GACAGCGCCG CCGTCTTCGC CATCGACACC CTCCGCTCCG GCGCCACCGA AGCCGCCGCC ATCGCCGCCA CCGCCGAAGT CGCCATCGTC GTCGCCGGCG ACCACCCCCT CGTCAACGGA CGCGAAACCG AAGACCGCAC CGATCTCGAC CTGCCCGCCG CCCAAGAAAA AGTGCTCCGC GCCGTCCGCG CCGCCAACCC CGCCACCGTC CTCGTCCTCA CCAGCGGCTA CCCGTTCGGC ATCGTGTGGG CCGACGAACA CATCCCCGCC ATCCTCTGGT CAGCCCACGG CGGCCAAGAA TACGGGCGCG CCCTCGCCGA CGTCCTCTTC GGCGACGCCG ACCCCACCGG ACGCCTCACC CAAACCTGGT ACCGCTCCGC AGCCGAACTC CCCGACCTGT TCGACTACGA CATCATCGCC AACGACGCCA CCTACCTGTA CTACCTCGGC TCCCCGCTCT ACCCCTTCGG GCACGGACTC AGCTACACCA CCTTCGACTA CACCGACCCT GAAGTCCACG TCACCGACGA CCACGTCACC GTCCACGTCA CCGTCACCAA CACCGGCGAC CGCTTCGGTG AAGAAGTCGT CCAGTGCTAC ACCCACCAGC GGGTCTCCCG CGTCAAACAA CCACTCCGCA AACTCCAAGG ATTCGCCCGC GTCGCCCTCC ACCCCGGCGA AACCCGCCGC GTCCGCATCG ACTTCCCCAT CCACGCGCTC GCCATCTGGG ACGTCACCCG ATCCCGGTTC GTCGTCGAAG ACGCACCCCG CACCGTCCAC CTCGGACGCT CCGCCAAAGA CCTGCGGGTC TGCGCCCCCC TCAACGTCCC CGGCGAACCC ATCGGCCCCC GCCCCGCCAC CCGGCTCCAC GCCGCCGACA ACGACGAATA CCACGGCGTC GTGCTGTGCG CCGCCGCCAA AGACCGCGGA GACGCGGTAC GCGCCACCGA ACCCGGCGCC TGGATCGCCT TCCTCGACGT GGACTTCACC CCCGCCCCCA AGGAAGCCGT CATCTGCGCC AACACCGACC AGCCAGGCAC ACTCACCCTG CGCATCGACA ACCCGATCAG CGGCCCCACC GTCGCCACCG CCGACATCCC CGCCGCCACC GACCACTTCG ACTTCACCGA AGTCCGCATC CCCGTCGACA GCGTCACCCG CCGACACGAC CTCTACATCG TGTTCGAAGC CGCAGGTACC GCCCTCTCCT GGATCGACCT GTCCTAA
|
Protein sequence | MTEPFRDPTL PPHERVRDLL ARLTTEEKIG LLHQYQRPIP RLGIASFRTG TEALHGLAWH GPATVFPQAI GLASTWDPDL VQQVGAATAA EVLVFHTKNP ATVGRNVWAP VVNPLRDPRW GRNEEGYSED PWLTGVMAVS YARGLAGPHP HRMDTAPTLK HFLAYNNETD RCTSSSHLPP RVLHEYELPA FLPALREGVA VAVMPSYNLV NGRPAHLSPL INDVLRAAAP DELMVVSDAM APGNLVDPQH YYDDHATAYA HALRAGIDSF TQDDDRAEAT LAHLRDALDR GLITEEDLDR AATHILSVRV RLGEFDPEPL RRVDPDTVNS PAHQALARTA ARRSIVLLKN DGILPLRDPR RIAVIGQLAD TLMEDWYSGT LPYAITARAG LAERTETVFC EGVDRIALRT NEGYLTASAD GTPMTITPAP GFGPVAESAA FDLFDWGGAW ALRAVVNGRY VSEDENGHLT NDQPGPNGWE VRQTFRWQPD PNGTGVLQHI ATGRYVAVGD NNTVTLTPDA DSAAVFAIDT LRSGATEAAA IAATAEVAIV VAGDHPLVNG RETEDRTDLD LPAAQEKVLR AVRAANPATV LVLTSGYPFG IVWADEHIPA ILWSAHGGQE YGRALADVLF GDADPTGRLT QTWYRSAAEL PDLFDYDIIA NDATYLYYLG SPLYPFGHGL SYTTFDYTDP EVHVTDDHVT VHVTVTNTGD RFGEEVVQCY THQRVSRVKQ PLRKLQGFAR VALHPGETRR VRIDFPIHAL AIWDVTRSRF VVEDAPRTVH LGRSAKDLRV CAPLNVPGEP IGPRPATRLH AADNDEYHGV VLCAAAKDRG DAVRATEPGA WIAFLDVDFT PAPKEAVICA NTDQPGTLTL RIDNPISGPT VATADIPAAT DHFDFTEVRI PVDSVTRRHD LYIVFEAAGT ALSWIDLS
|
| |