Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0639 |
Symbol | |
ID | 4808168 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 789712 |
End bp | 790914 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640106053 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001037067 |
Protein GI | 125973157 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0296] 1,4-alpha-glucan branching enzyme |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.401976 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATATC CGTTTTATTA TTGTTGCGAC AATCTTGATA TTAAAAGTAT ATCTGTTATA GGAGATTTCA ATAACTGGGA CGGCGGCAAA AATATTTTGA AAAGGCATGA TGACGGTACA TGGATGACGG AAATCGAACT GTCTCCCGGC AAATATAGAT ACAAATTTTT AATAAATGAC AGCATACTTT TTAATGACCC AAACGCCCTG ATTTACACCT ATGATGACAA GGGCGGCACT GCGTCTTTTA TCATAATTGA TGAAAGCAAC AAGAGGATTA TCAATACGAA ACCCGCTTCG CTCAATATTG AAAGTTACAG CTTTTTGAGC GTTGACAAAG AACTTTCATA CTGTGAAAAC GGTGAAAGAA AATGCTATAA GGACAAACAC TCAAGTATTA TGCTGAAAAT CGGTTTTTCC GACATAACGG GCTTGCATAC GGTAAACGCA ATATGGTATG CTCCCAATAA CGAAATCCAT GAGATTCAGG AAAGTGTCCT AAACGACGGC GAAAAAAATG AAAACAGGAA AAATGAAGCT TTATTTCAGA TCAATATAAA TGATGATACA ATTGCGGGCG AATGGAAACT GCAAATATTT ATAAACGGCA GCCTCGCTTT AGAAGACAGT TTCACAGTAG AATTAAAACA ACCGGAAGTC TTGGATAAAA CGGCTTTGGA CGATGCTGCC ATGCTGTCCG CGGACCTGGA AGTACCTGAT TTCTTAATCG AAGAGGATAT GCAGAACGAG TTTGAAGCAC CTGAATTTGA AGTGGAATCC AATTTGCAAA CCAACGCCGA AAAAGAAGAA ATTTCGGAGG AAGACTTGTT GGGACTCTTT GATGATATTA AAGAAATAGA TATGCCAGAG AAAGATACAT CTGGCAATCA AAAAACTCCT GCCCCGGATG AAACTGAAAT GCCGGCCGAC GGCAATATTA TGAATTTGTT TGATGAAATT AGAATTTCGG AATTAAAAGA CGAAAAAGCA CAGGATACGG AAAACAATAC CGAAAGTACT TCAAAAGAGG CGGAAGAATC CGACAGTGCT ATTGATGAGC TGCTTGAACT AAGAGAGTTT ATTGATTCCT CTCAGAATTC CGAAAAAGAC TCAATTGAGT CCTCCGGCGG CAAATCCGGG GAAGAAGCAA ATGAAGAAGA TATCAGTGAC ATATTTATAG ATATAAAAAA TAGAACTGAT TGA
|
Protein sequence | MKYPFYYCCD NLDIKSISVI GDFNNWDGGK NILKRHDDGT WMTEIELSPG KYRYKFLIND SILFNDPNAL IYTYDDKGGT ASFIIIDESN KRIINTKPAS LNIESYSFLS VDKELSYCEN GERKCYKDKH SSIMLKIGFS DITGLHTVNA IWYAPNNEIH EIQESVLNDG EKNENRKNEA LFQININDDT IAGEWKLQIF INGSLALEDS FTVELKQPEV LDKTALDDAA MLSADLEVPD FLIEEDMQNE FEAPEFEVES NLQTNAEKEE ISEEDLLGLF DDIKEIDMPE KDTSGNQKTP APDETEMPAD GNIMNLFDEI RISELKDEKA QDTENNTEST SKEAEESDSA IDELLELREF IDSSQNSEKD SIESSGGKSG EEANEEDISD IFIDIKNRTD
|
| |