Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tfu_0901 |
Symbol | |
ID | 3579455 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermobifida fusca YX |
Kingdom | Bacteria |
Replicon accession | NC_007333 |
Strand | - |
Start bp | 1062716 |
End bp | 1064116 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637684596 |
Product | cellulase |
Protein accession | YP_288962 |
Protein GI | 72161305 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGAAAT CCCCCGCCGC CCGGAAGGGC GGCCCTCCGG TCGCTGTCGC GGTGACCGCG GCCCTCGCCC TGCTGATCGC GCTCCTCTCC CCCGGAGTCG CGCAGGCCGC CGGTCTCACC GCCACAGTCA CCAAAGAATC CTCGTGGGAC AACGGCTACT CCGCGTCCGT CACCGTCCGC AACGACACCT CGAGCACCGT CTCCCAGTGG GAGGTCGTCC TCACCCTGCC CGGCGGCACT ACAGTGGCCC AGGTGTGGAA CGCCCAGCAC ACCAGCAGCG GCAACTCCCA CACCTTCACC GGGGTTTCCT GGAACAGCAC CATCCCGCCC GGAGGCACCG CCTCCTTCGG CTTCATCGCT TCCGGCAGCG GCGAACCCAC CCACTGCACC ATCAACGGCG CCCCCTGCGA CGAAGGCTCC GAGCCGGGCG GCCCCGGCGG TCCCGGAACC CCCTCCCCCG ACCCCGGCAC GCAGCCCGGC ACCGGCACCC CGGTCGAGCG GTACGGCAAA GTCCAGGTCT GCGGCACCCA GCTCTGCGAC GAGCACGGCA ACCCGGTCCA ACTGCGCGGC ATGAGCACCC ACGGCATCCA GTGGTTCGAC CACTGCCTGA CCGACAGCTC GCTGGACGCC CTGGCCTACG ACTGGAAGGC CGACATCATC CGCCTGTCCA TGTACATCCA GGAAGACGGC TACGAGACCA ACCCGCGCGG CTTCACCGAC CGGATGCACC AGCTCATCGA CATGGCCACG GCGCGCGGCC TGTACGTGAT CGTGGACTGG CACATCCTCA CCCCGGGCGA TCCCCACTAC AACCTGGACC GGGCCAAGAC CTTCTTCGCG GAAATCGCCC AGCGCCACGC CAGCAAGACC AACGTGCTCT ACGAGATCGC CAACGAACCC AACGGAGTGA GCTGGGCCTC CATCAAGAGC TACGCCGAAG AGGTCATCCC GGTGATCCGC CAGCGCGACC CCGACTCGGT GATCATCGTG GGCACCCGCG GCTGGTCGTC GCTCGGCGTC TCCGAAGGCT CCGGCCCCGC CGAGATCGCG GCCAACCCGG TCAACGCCTC CAACATCATG TACGCCTTCC ACTTCTACGC GGCCTCGCAC CGCGACAACT ACCTCAACGC GCTGCGTGAG GCCTCCGAGC TGTTCCCGGT CTTCGTCACC GAGTTCGGCA CCGAGACCTA CACCGGTGAC GGCGCCAACG ACTTCCAGAT GGCCGACCGC TACATCGACC TGATGGCGGA ACGGAAGATC GGGTGGACCA AGTGGAACTA CTCGGACGAC TTCCGTTCCG GCGCGGTCTT CCAGCCGGGC ACCTGCGCGT CCGGCGGCCC GTGGAGCGGT TCGTCGCTGA AGGCGTCCGG ACAGTGGGTG CGGAGCAAGC TCCAGTCCTG A
|
Protein sequence | MAKSPAARKG GPPVAVAVTA ALALLIALLS PGVAQAAGLT ATVTKESSWD NGYSASVTVR NDTSSTVSQW EVVLTLPGGT TVAQVWNAQH TSSGNSHTFT GVSWNSTIPP GGTASFGFIA SGSGEPTHCT INGAPCDEGS EPGGPGGPGT PSPDPGTQPG TGTPVERYGK VQVCGTQLCD EHGNPVQLRG MSTHGIQWFD HCLTDSSLDA LAYDWKADII RLSMYIQEDG YETNPRGFTD RMHQLIDMAT ARGLYVIVDW HILTPGDPHY NLDRAKTFFA EIAQRHASKT NVLYEIANEP NGVSWASIKS YAEEVIPVIR QRDPDSVIIV GTRGWSSLGV SEGSGPAEIA ANPVNASNIM YAFHFYAASH RDNYLNALRE ASELFPVFVT EFGTETYTGD GANDFQMADR YIDLMAERKI GWTKWNYSDD FRSGAVFQPG TCASGGPWSG SSLKASGQWV RSKLQS
|
| |