Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2232 |
Symbol | |
ID | 4809970 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2659417 |
End bp | 2660415 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640107638 |
Product | polysaccharide biosynthesis protein CapD |
Protein accession | YP_001038627 |
Protein GI | 125974717 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1086] Predicted nucleoside-diphosphate sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 44 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCAAGG ATAAAAAGAT ATTGGTTATC GGAGGGACAG GCTCCATCGG ACAAGCTCTG ATTGAACGTA TTCTAACTGA AAATCCTGCC GTAATCAGAG TTTACAGCAG AGACGAGTAC AAGCAGTTTT TGCTTGCGGA AAAGTTCAGG GAAAACGCCA ATATCAGATA TCTTATAGGC GACGTAAGAG ATGAGGACAG ACTCGACAGG GCAATGAATG ATATCGATAT TGTGTTTAAT CTTGCGGCCT TAAAACACGT ACCTGCATGT GAGTACAACC CGTTTGAGGC TGTCAGGACC AATGTTATCG GAAGCCAGAA TGTTATAGCC TGTGCCATGG CAAACAAGGT GAAAAAAGTA ATTTATACGA GCAGTGACAA GGCTGTTTCT CCAACCAATA CCATGGGTGC AACAAAGCTT CTGGCAGAAA GACTCATGTC ATCATCCAAC TATTCAAGAG GAAACACGGG TACGGTGTTT GCTTCTGTCA GATTCGGAAA TGTAATGGGC ACAAGAGGTT CAGTGATTCC GCTTTTCAAG CAGCAGGTTT TGGAAAAGGG ATATATTACG GTGACCGAAC CGGAGATGAC GAGGTTTATG ATGTCCCTCT CGCAGGCGGT GGAACTTACA ATTAAGGCAT GCAGCATTGC AAGAGGCGGG GAAGTTTTTG TGCTGAAAAT GCCTGTAATC CGGTTAAAAG ACCTGGCAGA AGTGATTATT GAGGATATGT GTAAAAAGAA CAATATTGAT CCCGAAAAGA TAGAGATTAA AAAGATTGGA CTGAGACCCG GAGAAAAGAT GTACGAAGAA CTTATGTCGG AAGATGAATC GACAAAAGCC GTCGAACTGA ATGATATGTA TGTAATAATG CCACCTTTTG AAAATAGTTA CTATATAAAT GCCAGAAAGG CGGTTATAGG CAATTACAGT TCACAAAAAG AAAAAACCCT GACCAAGTCA CAAATAAGGG AACTGCTTGT CAGGGAAAAA TTAATTTGA
|
Protein sequence | MFKDKKILVI GGTGSIGQAL IERILTENPA VIRVYSRDEY KQFLLAEKFR ENANIRYLIG DVRDEDRLDR AMNDIDIVFN LAALKHVPAC EYNPFEAVRT NVIGSQNVIA CAMANKVKKV IYTSSDKAVS PTNTMGATKL LAERLMSSSN YSRGNTGTVF ASVRFGNVMG TRGSVIPLFK QQVLEKGYIT VTEPEMTRFM MSLSQAVELT IKACSIARGG EVFVLKMPVI RLKDLAEVII EDMCKKNNID PEKIEIKKIG LRPGEKMYEE LMSEDESTKA VELNDMYVIM PPFENSYYIN ARKAVIGNYS SQKEKTLTKS QIRELLVREK LI
|
| |