Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1471 |
Symbol | |
ID | 4810621 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 1787724 |
End bp | 1789409 |
Gene Length | 1686 bp |
Protein Length | 561 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640106892 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001037893 |
Protein GI | 125973983 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATGCA GATATATGGA TTATTTAATA AACTGTGAGT TGGACTCTTT GCCTAATGAA AAATCCGAAA AAATAAACAA ACATATAAGC ACCTGCATGG ACTGCAGAAA TTATCTGGGT GCACTTATGA TATCCAAAAA ATATATTGCC AAAGAACCGG AAATGGATAA ATATTTTTAC ATGCGGGTTA TCAATGCCAT AGACCCTGAC AGATACAAGA AATCGAAACT GACTTTTAAG ATTCTGTCTT CATTGGAAAG GCTAAAGCCT GCTTTTAAAG CGTCTTTGGG AACTTTGGCA ATCTTTGTGG CAGTAGCTTT GTTAATAACC GGCGGAATTT TTGACAATCT CGGCAACTGG ATTGCCAAAA GCAGCAATAA CCGCTCCAAT ACCGGTGAAA CAACAAACCT GACCTTTTTG CGTACAGACG GTCAGAATAT TGTTACCGGT ACCGGTGAAA TCTTCCACAT AAGAGGTGTT ACCCTTACAA ACAACTTTTG GGGCAACTGG GTCAACGGAG AATCGGAAAA ATTGCAAAGT CAAGGTATGG ACCCTATTAT ACGCCCTCTT GTACAGGATG CCTGGGTGCT CACTGATGAT GATTTTGAGC GTATTAAAGA CCTGGGCTGC AACACGGTTT TATATGATAT CAATTACCAG CTTTTTGCAG AAGACAATCC AAACAGGGAA GAAAATCTCA AAAAGCTCAA AGAACATATA AGGCGTTTTT CCTCAATGGA CATATATACG GCTGTTATGC TAATGGCTCC TCCGGGACTG GATTCGATCA ATGACGCCTA CGAAAAGTAC AAACACGGCT CAGAACGTAT AAAATCTGTG TTTGAGGATG ATACCTACTA CGAACAGTGG GTTGAGATGT GGAAGTATTT GGCCGAAGAA CTTAAAGACT TTAAAGGTGT GGCAGGATAT GGACTTATAA ACCAGCCAAG AGCCCCGAGT GAAAGTGAAG GTGGAATCGG GATATTCAGG GAACGCCTGA ACAATGTATG CAGAGAAATA CGTAAAATTG ACAAAAATCA TATCATATTT GTTCCCGAAT ATAACAGCAG AGAGGCCAAT CCCGGCGAAT CCTACTGGAA CGAAAAAACA AATAGTTATG TAATAGACAA CGGTGAGCAA GGTATTATCT GGGAAAGAGG TTTGGTAAAA GTTGATTCAT CAAACGTAGT ATACTTGTTC CACTTTTTCG AACCATACAA CTTTGTCAAT GACGGTGTCG GAGATTTTGA TGCCGAAAGC CTTGAAGCTC AAGTCAGAAA ACGTTATGAA TGGGCTAAAA ATGTCGGCAG GGCTCCGCTT CTTACCGAAT ACGGAATCTC CCGGGTAAAC AGCGTAGACA AACGTGTACA ATGGCTTGAA ACCGTTCACG ACATCTTTGA TAAATACGGT ATCTCGGCTT CATACTTCCA ATATAAAAAT GCCGTAGGTG CTTTTATAAA TGTGAAAACC GGTTTTAACG CTTTATACGG AGAATATGTC AGCTGGGATA GTGAAATCGG CCTGAATCCC TTTTACTTTG TAAATGAACA CGTTGCCACA TCCGCAAAAG AAAATCATTT TGATGAAGCA CTTAAAGAGT ATTACCTTAA AGGTAAAAAC CTGAAAAAAA TTTCAATACT GGACAATCAG CCCATTCTTG AAACATTGCA AAATTTTTGG AAATAG
|
Protein sequence | MKCRYMDYLI NCELDSLPNE KSEKINKHIS TCMDCRNYLG ALMISKKYIA KEPEMDKYFY MRVINAIDPD RYKKSKLTFK ILSSLERLKP AFKASLGTLA IFVAVALLIT GGIFDNLGNW IAKSSNNRSN TGETTNLTFL RTDGQNIVTG TGEIFHIRGV TLTNNFWGNW VNGESEKLQS QGMDPIIRPL VQDAWVLTDD DFERIKDLGC NTVLYDINYQ LFAEDNPNRE ENLKKLKEHI RRFSSMDIYT AVMLMAPPGL DSINDAYEKY KHGSERIKSV FEDDTYYEQW VEMWKYLAEE LKDFKGVAGY GLINQPRAPS ESEGGIGIFR ERLNNVCREI RKIDKNHIIF VPEYNSREAN PGESYWNEKT NSYVIDNGEQ GIIWERGLVK VDSSNVVYLF HFFEPYNFVN DGVGDFDAES LEAQVRKRYE WAKNVGRAPL LTEYGISRVN SVDKRVQWLE TVHDIFDKYG ISASYFQYKN AVGAFINVKT GFNALYGEYV SWDSEIGLNP FYFVNEHVAT SAKENHFDEA LKEYYLKGKN LKKISILDNQ PILETLQNFW K
|
| |