Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tfu_2176 |
Symbol | |
ID | 3580397 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermobifida fusca YX |
Kingdom | Bacteria |
Replicon accession | NC_007333 |
Strand | + |
Start bp | 2552833 |
End bp | 2555475 |
Gene Length | 2643 bp |
Protein Length | 880 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637685881 |
Product | endoglucanase |
Protein accession | YP_290232 |
Protein GI | 72162575 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0658592 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGTCA CTGAACCTCC TCCCCGCCGA CGCGGCCGTC ACAGCCGGGC GCGCCGCTTC CTCACTTCCC TGGGAGCTAC CGCGGCCCTC ACCGCGGGCA TGCTGGGCGT CCCCTTGGCC ACGGGAACCG CCCACGCCGA ACCGGCGTTC AACTACGCCG AAGCCCTCCA GAAGTCGATG TTCTTCTACG AGGCCCAACG CTCCGGGAAA CTCCCGGAGA ACAACCGGGT CTCCTGGCGC GGCGACTCCG GGCTCAACGA CGGCGCGGAC GTGGGACTCG ACCTCACCGG CGGCTGGTAC GACGCCGGCG ACCACGTGAA ATTCGGCTTC CCCATGGCCT TCACCGCGAC CATGCTCGCC TGGGGCGCCA TCGAAAGCCC GGAAGGCTAC ATCCGCTCCG GCCAGATGCC CTACCTCAAG GACAACCTGC GCTGGGTCAA CGACTACTTC ATCAAAGCCC ACCCCTCGCC CAACGTGCTG TACGTGCAGG TCGGCGACGG CGACGCCGAC CACAAGTGGT GGGGTCCGGC CGAAGTCATG CCGATGGAGC GGCCCAGCTT CAAAGTGGAC CCCTCCTGCC CGGGCAGCGA CGTCGCAGCC GAAACCGCCG CGGCCATGGC CGCGTCCTCC ATCGTGTTCG CCGACGACGA CCCTGCGTAC GCGGCCACCC TCGTGCAGCA CGCCAAGCAG CTCTACACGT TCGCCGACAC CTACCGCGGC GTGTACTCCG ACTGCGTGCC CGCCGGAGCG TTCTACAACT CCTGGTCGGG CTACCAGGAC GAGCTCGTCT GGGGCGCCTA CTGGCTGTAC AAGGCCACCG GGGACGACTC CTACTTGGCG AAGGCCGAGT ACGAGTACGA CTTCCTCTCC ACCGAGCAGC AGACCGACCT CCGCAGCTAC CGGTGGACCA TCGCCTGGGA CGACAAGTCC TACGGCACCT ACGTGCTGCT CGCCAAGGAA ACCGGCAAGC AAAAATACAT CGACGACGCC AACCGGTGGC TCGACTACTG GACGGTCGGC GTCAACGGCC AGCGCGTGCC CTACTCCCCC GGCGGGATGG CTGTGCTCGA CACCTGGGGA GCCCTGCGCT ACGCCGCTAA CACCGCGTTC GTCGCCCTCG TCTACGCCAA GGTGATCGAC GACCCCGTCC GCAAGCAGCG GTACCACGAC TTCGCGGTGC GGCAGATCAA CTACGCGCTC GGCGACAACC CGCGGAACTC CAGCTACGTG GTGGGCTTCG GCAACAACCC GCCGCGCAAC CCCCACCACC GCACCGCGCA CGGGTCGTGG ACCGACAGCA TCGCCTCGCC CGCGGAGAAC CGGCACGTCC TCTACGGCGC CCTCGTCGGC GGTCCCGGCT CCCCGAACGA CGCCTACACC GACGACCGGC AGGACTACGT CGCCAACGAA GTCGCCACCG ACTACAACGC CGGATTCTCC AGCGCGCTGG CCATGCTGGT CGAAGAGTAC GGCGGCACCC CGCTGGCGGA CTTCCCGCCC ACCGAGGAGC CCGACGGACC GGAGATCTTC GTGGAAGCCC AGATCAACAC GCCGGGCACC ACGTTCACCG AGATCAAAGC CATGATCCGC AACCAGTCGG GCTGGCCGGC CCGGATGCTG GACAAGGGCA CCTTCCGGTA CTGGTTCACC CTCGATGAAG GCGTGGACCC CGCGGACATC ACGGTGAGCT CCGCCTACAA CCAGTGCGCC ACCCCGGAGG ACGTCCACCA CGTCTCCGGC GACCTGTACT ACGTGGAGAT CGACTGCACC GGGGAGAAGA TCTTCCCCGG CGGCCAGTCG GAGCACCGCC GCGAAGTCCA GTTCCGCATC GCCGGCGGCC CCGGATGGGA CCCCTCCAAC GACTGGTCCT TCCAAGGCAT CGGCAACGAA CTCGCCCCCG CCCCGTACAT CGTGCTCTAC GACGACGGTG TACCGGTGTG GGGCACCGCC CCCGAGGAAG GGGAAGAGCC CGGCGGCGGA GAAGGACCGG GAGGCGGCGA AGAACCCGGC GAGGACGTGA CCCCGCCGAG CGCGCCCGGC TCCCCCGCGG TCCGGGACGT CACCTCCACC AGCGCCGTGC TCACCTGGTC CGCGTCCAGC GACACCGGCG GCAGCGGCGT AGCCGGGTAC GACGTCTTCC TCCGCGCCGG CACAGGCCAG GAGCAGAAGG TGGGATCCAC CACCCGGACC AGCTTCACCT TGACCGGTCT AGAACCCGAC ACCACCTACA TCGCGGCTGT CGTGGCCCGG GACAACGCGG GCAACGTCTC CCAGCGGAGT ACCGTCTCCT TCACCACGCT GGCCGAGAAC GGCGGCGGGC CTGACGCCTC CTGCACGGTG GGCTACAGCA CCAACGACTG GGACTCCGGA TTCACCGCTT CGATCCGGAT CACCTACCAC GGCACGGCCC CGCTCTCCAG CTGGGAGCTC TCCTTCACCT TCCCCGCTGG CCAGCAGGTC ACCCACGGGT GGAACGCCAC GTGGCGCCAG GACGGGGCCG CCGTGACCGC CACCCCCATG TCCTGGAACA GCTCCCTGGC GCCTGGAGCC ACCGTCGAGG TCGGCTTCAA CGGCTCCTGG AGCGGCAGCA ACACACCTCC CACCGATTTC ACCCTCAACG GCGAGCCCTG CGCCCTCGCC TAA
|
Protein sequence | MSVTEPPPRR RGRHSRARRF LTSLGATAAL TAGMLGVPLA TGTAHAEPAF NYAEALQKSM FFYEAQRSGK LPENNRVSWR GDSGLNDGAD VGLDLTGGWY DAGDHVKFGF PMAFTATMLA WGAIESPEGY IRSGQMPYLK DNLRWVNDYF IKAHPSPNVL YVQVGDGDAD HKWWGPAEVM PMERPSFKVD PSCPGSDVAA ETAAAMAASS IVFADDDPAY AATLVQHAKQ LYTFADTYRG VYSDCVPAGA FYNSWSGYQD ELVWGAYWLY KATGDDSYLA KAEYEYDFLS TEQQTDLRSY RWTIAWDDKS YGTYVLLAKE TGKQKYIDDA NRWLDYWTVG VNGQRVPYSP GGMAVLDTWG ALRYAANTAF VALVYAKVID DPVRKQRYHD FAVRQINYAL GDNPRNSSYV VGFGNNPPRN PHHRTAHGSW TDSIASPAEN RHVLYGALVG GPGSPNDAYT DDRQDYVANE VATDYNAGFS SALAMLVEEY GGTPLADFPP TEEPDGPEIF VEAQINTPGT TFTEIKAMIR NQSGWPARML DKGTFRYWFT LDEGVDPADI TVSSAYNQCA TPEDVHHVSG DLYYVEIDCT GEKIFPGGQS EHRREVQFRI AGGPGWDPSN DWSFQGIGNE LAPAPYIVLY DDGVPVWGTA PEEGEEPGGG EGPGGGEEPG EDVTPPSAPG SPAVRDVTST SAVLTWSASS DTGGSGVAGY DVFLRAGTGQ EQKVGSTTRT SFTLTGLEPD TTYIAAVVAR DNAGNVSQRS TVSFTTLAEN GGGPDASCTV GYSTNDWDSG FTASIRITYH GTAPLSSWEL SFTFPAGQQV THGWNATWRQ DGAAVTATPM SWNSSLAPGA TVEVGFNGSW SGSNTPPTDF TLNGEPCALA
|
| |