Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1085 |
Symbol | |
ID | 4811383 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 1291979 |
End bp | 1293271 |
Gene Length | 1293 bp |
Protein Length | 430 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640106507 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_001037510 |
Protein GI | 125973600 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATCG CCTTGATATG CACCGAGAAA CTTCCCGTTC CTCCGGTCGC GGGAGGCGCG GTTCAGCTAT ATATAAGCGA AATATTGCCA TATTTAAAAG AACGTCACAA TATAACAGTT TTTTCTAAAA TTCATCCCGG ACTTTCCCCG GATGAAGTGG TCGACAATGT AAGATATATC CGGGTACCGG CCGCCAATGC TTCGAAGTAT GTAAAGAATG TAAAAGACCT ACTGGATGAG AGTTTTGAAC TGATACACAT TTTCAACCGC CCGAAGTGGG TACTGGACTT TAGTGAAAAA CTTCCTTCGG CAAAGTTCAG CCTTAGTCTT CACAATGAAA TGTTTTTGCC GGATAAAATA CCTTATGAAA AGGCTGTTGA ATGTATAAAC AGAGTTGAAT TCATAAACAC GGTGAGCAAG TTCATAGCCG ACGGTGTAAA ACAGCTTTAT CCCATGGCAG AAGACAAGTT AAGAGTGGTT TACTCGGGAG TCAACATTGA AAAATACAAG CCCAACTGGT CACCGGAAGG AATTTGCAAC AAAGAGCTTC TGAAAAAGAA GCTTGGAATA GAAAACAAGC GGGTAATACT TCATGTCAGC AGATTAAGTC CAAAAAAAGG TACCCATATA GTTCTGTCTG CCATGAAAAA AGTTATGGAC TGTTTTGATG ATGTTGCTTT GGTAATAATC GGGAGCAAAT GGTACGGTAA AAATGAAGAA GATGATTATA CAAAGCAATG CAAGGCCCTT GCAGAACAAT TAAGCGGTCC GGTTGTTTTT ACAGGCTTTA TTCCTCCGTC TGAAATTCCG CCTTATTATA ACGTGGGTGA TATATTTGTA TGTGCATCCC AGTGGAATGA GCCCCTGGCA AGGATACATT ATGAGGCAAT GGCTGCGGGC CTCCCCATTA TTACAACCGA CCGGGGCGGA AATGCAGAAA TATTCGAAGA CAATGTCAAC GGCATTATAA TAAAGGACTA CAAAAATCCG GACTCTTTTG CCGACAATAT AATCTATCTT CTGAACAATC CTCATACAGC CCTGGAAATG GGCAAAAAAG CTTTTGAGTC CGCACTCTCC AGATTCACCT GGAAGAAAGT GGCGGATGAG GTTTTGGCTC CAATCCAAAA CTTCGACCAA AGAATAACTG TTAATGACAA TAAAACACAG AGTGGCCTGG CAAAAGAAAA TATAATTGAA GAAGATATAA TGAAAGAAAA TAACGATGAA AAAGCAACGG AAGAAAAAAG CACAGAAGAG ATTGAAACTT TTTTCGACGA CACAAATTTT TAA
|
Protein sequence | MKIALICTEK LPVPPVAGGA VQLYISEILP YLKERHNITV FSKIHPGLSP DEVVDNVRYI RVPAANASKY VKNVKDLLDE SFELIHIFNR PKWVLDFSEK LPSAKFSLSL HNEMFLPDKI PYEKAVECIN RVEFINTVSK FIADGVKQLY PMAEDKLRVV YSGVNIEKYK PNWSPEGICN KELLKKKLGI ENKRVILHVS RLSPKKGTHI VLSAMKKVMD CFDDVALVII GSKWYGKNEE DDYTKQCKAL AEQLSGPVVF TGFIPPSEIP PYYNVGDIFV CASQWNEPLA RIHYEAMAAG LPIITTDRGG NAEIFEDNVN GIIIKDYKNP DSFADNIIYL LNNPHTALEM GKKAFESALS RFTWKKVADE VLAPIQNFDQ RITVNDNKTQ SGLAKENIIE EDIMKENNDE KATEEKSTEE IETFFDDTNF
|
| |