Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tfu_1074 |
Symbol | |
ID | 3580066 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermobifida fusca YX |
Kingdom | Bacteria |
Replicon accession | NC_007333 |
Strand | - |
Start bp | 1254785 |
End bp | 1256110 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637684769 |
Product | endoglucanase |
Protein accession | YP_289135 |
Protein GI | 72161478 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.0857507 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCCCA GACCTCTTCG CGCTCTTCTG GGCGCCGCGG CGGCGGCCTT GGTCAGCGCG GCTGCTCTGG CCTTCCCGTC GCAAGCGGCG GCCAATGATT CTCCGTTCTA CGTCAACCCC AACATGTCCT CCGCCGAATG GGTGCGGAAC AACCCCAACG ACCCGCGTAC CCCGGTAATC CGCGACCGGA TCGCCAGCGT GCCGCAGGGC ACCTGGTTCG CCCACCACAA CCCCGGGCAG ATCACCGGCC AGGTCGACGC GCTCATGAGC GCCGCCCAGG CCGCCGGCAA GATCCCGATC CTGGTCGTGT ACAACGCCCC GGGCCGCGAC TGCGGCAACC ACAGCAGCGG CGGCGCCCCC AGTCACAGCG CCTACCGGTC CTGGATCGAC GAATTCGCTG CCGGACTGAA GAACCGTCCC GCCTACATCA TCGTCGAACC GGACCTGATC TCGCTGATGT CGAGCTGCAT GCAGCACGTC CAGCAGGAAG TCCTGGAGAC GATGGCGTAC GCGGGCAAGG CCCTCAAGGC CGGGTCCTCG CAGGCGCGGA TCTACTTCGA CGCCGGCCAC TCCGCGTGGC ACTCGCCCGC ACAGATGGCT TCCTGGCTCC AGCAGGCCGA CATCTCCAAC AGCGCGCACG GTATCGCCAC CAACACCTCC AACTACCGGT GGACCGCTGA CGAGGTCGCC TACGCCAAGG CGGTGCTCTC GGCCATCGGC AACCCGTCCC TGCGCGCGGT CATCGACACC AGCCGCAACG GCAACGGCCC CGCCGGTAAC GAGTGGTGCG ACCCCAGCGG ACGCGCCATC GGCACGCCCA GCACCACCAA CACCGGCGAC CCGATGATCG ACGCCTTCCT GTGGATCAAG CTGCCGGGTG AGGCCGACGG CTGCATCGCC GGCGCCGGCC AGTTCGTCCC GCAGGCGGCC TACGAGATGG CGATCGCCGC GGGCGGCACC AACCCCAACC CGAACCCCAA CCCGACGCCC ACCCCCACTC CGACCCCCAC GCCGCCTCCC GGCTCCTCGG GGGCGTGCAC GGCGACGTAC ACGATCGCCA ACGAGTGGAA CGACGGCTTC CAGGCGACCG TGACGGTCAC CGCGAACCAG AACATCACCG GCTGGACCGT GACGTGGACC TTCACCGACG GCCAGACCAT CACCAACGCC TGGAACGCCG ACGTGTCCAC CAGCGGCTCC TCGGTGACCG CGCGGAACGT CGGCCACAAC GGAACGCTCT CCCAGGGAGC CTCCACAGAG TTCGGCTTCG TCGGCTCTAA GGGCAACTCC AACTCTGTTC CGACCCTTAC CTGCGCCGCC AGCTGA
|
Protein sequence | MSPRPLRALL GAAAAALVSA AALAFPSQAA ANDSPFYVNP NMSSAEWVRN NPNDPRTPVI RDRIASVPQG TWFAHHNPGQ ITGQVDALMS AAQAAGKIPI LVVYNAPGRD CGNHSSGGAP SHSAYRSWID EFAAGLKNRP AYIIVEPDLI SLMSSCMQHV QQEVLETMAY AGKALKAGSS QARIYFDAGH SAWHSPAQMA SWLQQADISN SAHGIATNTS NYRWTADEVA YAKAVLSAIG NPSLRAVIDT SRNGNGPAGN EWCDPSGRAI GTPSTTNTGD PMIDAFLWIK LPGEADGCIA GAGQFVPQAA YEMAIAAGGT NPNPNPNPTP TPTPTPTPPP GSSGACTATY TIANEWNDGF QATVTVTANQ NITGWTVTWT FTDGQTITNA WNADVSTSGS SVTARNVGHN GTLSQGASTE FGFVGSKGNS NSVPTLTCAA S
|
| |