Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3112 |
Symbol | |
ID | 4809743 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3672702 |
End bp | 3673694 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640108545 |
Product | glycosidase, PH1107-related |
Protein accession | YP_001039500 |
Protein GI | 125975590 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2152] Predicted glycosylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.204861 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAGCA GATTCGAAAC AAGGCTTATG GGTTCTGACA GGCATTCTTA CAAAGAACTT TTCAAAAGAT TCCCAGGTAA TCCGATCCTT TCAGCCAAAA ATTGGCCTTA TGCGGCCAAC ACGGTATTTA ATCCCGCAGC CACCATGTAT AACGGCAAAG TTTTACTGCT GATAAGAGTC GAAGACAGAA GGGGTTTTTC TCATCTGACA AAGGCAATCA GCGATGATGG TATAAGCAAC TGGATAATTG ATGACAAACC TACTTTGGAG GCAGAGCCTG AAAAATTTCC CGAGGAAGAA TGGGGAGTTG AAGACCCGAG AATTACATGG ATTGAAGAAT TGGGGAAATT TGCTGTTGTA TATACTGCCT ATTCCAAAGG AGGACCGCTG GTATCTTTGG CGCTTACGGA GGATTTTGAA AACTTTGAGA AACTTGGAGC CATTATGCCT CCCGAGGACA AGGATGCGGC ATTGTTCCCC AGGAGGATTA ACGGAAAATG GGTTCTTATA CACAGACCCA TTTCAATTCA TCATGGACCG GGAGCACATA TATGGATTTC CCGTTCCGAT GACCTCAAAT ATTGGGGTGA TCATCAGATA TTAATCAGAG CCAGAAAAGG CGGATGGTGG GATGCCAACA AAGTGGGACT GAATTGCCCG CCACTTGAGA CTCCTGACGG ATGGCTTATA TTATACCATG GAGTAAGACA GACCGCATCC GGATCCATTT ACAGGCTTGG ACTTGCACTT TTAGACCTTG AGAATCCTTC GAAAGTTTTG CGCAGAAGTG ATGAATGGGT ATTTGGACCG CAGGAGTTTT ATGAAAGAGA AGGAGACGTT GATGATGTGG TGTTCCCCTG CGGATGGGTT TATAATGAAA AGACAGGGGA AATTAAAATA TATTATGGTG CCGCTGATAC TTGCATTGCA ATGGCAACGA CAAATATTAG CGATTTGCTT GATTATATAA AAAGATGTCC TGAACCAAAA TAG
|
Protein sequence | MASRFETRLM GSDRHSYKEL FKRFPGNPIL SAKNWPYAAN TVFNPAATMY NGKVLLLIRV EDRRGFSHLT KAISDDGISN WIIDDKPTLE AEPEKFPEEE WGVEDPRITW IEELGKFAVV YTAYSKGGPL VSLALTEDFE NFEKLGAIMP PEDKDAALFP RRINGKWVLI HRPISIHHGP GAHIWISRSD DLKYWGDHQI LIRARKGGWW DANKVGLNCP PLETPDGWLI LYHGVRQTAS GSIYRLGLAL LDLENPSKVL RRSDEWVFGP QEFYEREGDV DDVVFPCGWV YNEKTGEIKI YYGAADTCIA MATTNISDLL DYIKRCPEPK
|
| |