Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1787 |
Symbol | |
ID | 4810032 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2107797 |
End bp | 2109731 |
Gene Length | 1935 bp |
Protein Length | 644 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640107201 |
Product | glycoside hydrolase 15-related |
Protein accession | YP_001038201 |
Protein GI | 125974291 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3387] Glucoamylase and related glycosyl hydrolases |
TIGRFAM ID | [TIGR01577] oligosaccharide amylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.26438 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGAACA CATACTTTAA TGATGCGATT ATCGGAAATT CCGGCATGCT TGTGTGTCTG ACACGAAATG GTGAGCTTAC AAGACTTTTT TGGCCCAACA TTGACTATCC GCAGCATTTT GAAAAAATGG CGACGGGAAT CTTTTATACG GGCCAGAAAA ACAGCACTTC CTGGTTTTAT GAAGACAACT GGCACCACAC TCAGTATTAT GTGGAAGACA CCAATATTCT AAAAACAATA TGTGAGGATG GCGGCAGAGG ACTTCGAGTG GAGCAGACGG ACTTTGTTCT GAAAGACAGG GATGTAATGG TAAGAAGATA TGTAATAGAA AATATTGGGC CAAATGAAGT GGATCTTGGT TTTGTTCAAT ATTCTTCAAC AGTCTCTACA ACTCCTGAAC TAAGAAGCAC TCTGTTTGAC TTTAATGTGG ATGCTCTTAT TCATTACAGG CATAATTACT ATATTTCAAT TTCATCAGAC AGTGAAGTTG TACAGTTTCA ATTGGGGAAT AACGCTTTTG ACTGTGCAAG GTACACGGAG CTTTACGGAT ATGACTCCAT AGGAATGATG AAGGATGGGG CCATGTCCTT TAATATTGGG AAGATTGAAC CAGGAGGAAA GAAAACTTTT AATCTCTTCA TCTGTGCCTC CCATACCTTA AAAGGCGTTA AAGAGTTGGT AAGATGGTGC AGAAAAATGA ATGTCGATGA GGAGTATGAG AAAACTCGCA AGTATTGGCT GGACTTTCTG AAAAATGCAA GATTGATTGT AACGGGAGAC AAGAATATTG ACAACTTGTA TAAAAGGTCT ATTTTGGTGT TTAAGCTTAT GTCCGATGAA CGGACGGGAG GATTGCTGGC TTCAGCGGAA ATAGATGAAG GATTTACAAG GTGCGGGCGC TATGCATACT GCTGGGGAAG GGATGCGGCA TTTATTACCG GCGCACTGGA TACCGCAGGG CTTACCGAAG CAGTGGACAA ATTCTACCAG TGGGCTGTTA TGACCCAGGA TGATGACGGC TCGTGGCAGC AAAGATATCA CATGGATGGA AACCTTGCCC CATCATGGGG ACTTCAGATA GACGAGACAG GTACCCTTAT ATGGGGTATG TTAAAACACT ATGAGGTCAC AAAGAATATA GATTTTCTTA AAAGCATGTG GGAGAGCATA AAAAAAGGTG TTGAATTTTT AACCCGGTTT ATAGACAGCG ACACAGGATT GCCTGCCCCA AGCTATGATT TGTGGGAGGA AAGGGTTGGA GAACACACTT ATTCCAGTGC CGCAGTATAT GCGGGCATTA AAGCCGGTGC TGAAGCGGCA CGAATTCTTG GTGCTTCCGA AGAATTAATT GAAAAATGGG AAAAGGCGGC TTCTGATATG AAAGCCTCCA TAGAGAAAAA TTTTTGGAGG GATGAGGCCG GAAGATTCAT CAGAAGTGTG CGCACTAAGC TCAATCCATG GGGAAGTGAA CATTCGCCCT ACACGACGGT AATAAAAGTC AATGAAAAAG GGTATTTCAG GGATGTTACT TTGGAAGACT GGACCATTGA TGTAAGCCTT CTTGGAGTTT CCATTCCTTT CGGTGTTTTT GATGTGCATG ACGAACGTGT GAAAAAAACA GTAGAAGCCA TTGAAAGGGC TTTGACTTCC CACCCGGTGG GCGGAATAAA AAGATATGAA AATGACAACT ATATTGGAGG AAACCCGTGG GTATTGGCAA CTTTGTGGGT GGCCCTGTAT TATATTGAGA TAAAAGAATA TGAAAAAGCA AAGGATTATT TGAGATGGGC GACCAAATCC TGTACCGCTT TGGGGCTTTT GCCGGAGCAG GTGAGCAAGG ACAACGGCGA GCCTTGCTGG GTAATACCTC TTACATGGTC CCACGCCATG TATGTATTGG TGCTTGCAGG ACTTAAAGAG GCGGGGGTTT TATAA
|
Protein sequence | MANTYFNDAI IGNSGMLVCL TRNGELTRLF WPNIDYPQHF EKMATGIFYT GQKNSTSWFY EDNWHHTQYY VEDTNILKTI CEDGGRGLRV EQTDFVLKDR DVMVRRYVIE NIGPNEVDLG FVQYSSTVST TPELRSTLFD FNVDALIHYR HNYYISISSD SEVVQFQLGN NAFDCARYTE LYGYDSIGMM KDGAMSFNIG KIEPGGKKTF NLFICASHTL KGVKELVRWC RKMNVDEEYE KTRKYWLDFL KNARLIVTGD KNIDNLYKRS ILVFKLMSDE RTGGLLASAE IDEGFTRCGR YAYCWGRDAA FITGALDTAG LTEAVDKFYQ WAVMTQDDDG SWQQRYHMDG NLAPSWGLQI DETGTLIWGM LKHYEVTKNI DFLKSMWESI KKGVEFLTRF IDSDTGLPAP SYDLWEERVG EHTYSSAAVY AGIKAGAEAA RILGASEELI EKWEKAASDM KASIEKNFWR DEAGRFIRSV RTKLNPWGSE HSPYTTVIKV NEKGYFRDVT LEDWTIDVSL LGVSIPFGVF DVHDERVKKT VEAIERALTS HPVGGIKRYE NDNYIGGNPW VLATLWVALY YIEIKEYEKA KDYLRWATKS CTALGLLPEQ VSKDNGEPCW VIPLTWSHAM YVLVLAGLKE AGVL
|
| |