Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3114 |
Symbol | |
ID | 4809677 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3674971 |
End bp | 3676146 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640108547 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_001039502 |
Protein GI | 125975592 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTTCGA ACAGAATTCG GAATGTAGCG TTTCTGAGTA CTTATCCGCC AAGGGAGTGC GGACTTGCAA CTTTTACCGA TGATTTGGTA AGGGAGCTGG ACAAGGTTGA ACTTATAAAC AACCCAAAAG TTATTGCTGT AAGTGATAAT GATTACAGTT ACGGCAGCAG GGTGATTATG GAGCTTAAGC AGCACGAAAG GGAAAGCTAT ACAAAGATTG CTGAAGAGAT TAACAACTCG GATATTGAAC TTCTTGTTAT AGAGCATGAG TACGGTATAT TCGGAGGAGA AGACGGGGAA TATATTCTGG ATCTTGCGGA AAAAATTCAA ATTCCCTTTA TTCTCACGGT GCATACCGTA CTTCCCAGTC CCAAGGAAAA ACAAAAGAAA ATACTTGAAG TGCTGGGAGA AAAGAGCGCA AGGGTAGTTA CCATGGCTAA AAATACGATA CCTATACTTG AAAAAGTATA TGGTATTGAC CCGGCAAAGA TTGAAGTAAT ACACCATGGT GTACCGTATA AAATTCTTGA ACCCAGAGAA AAGCTAAAGA AAAAATTCGG GCTTGAAAAC CGCACTGTAA TAAGTACTTT TGGGCTGATA AGTCCGGGCA AAGGTTTGGA ATACGGAATT GAAGCTGTTG CAAAGCTGGC AAAGAAGTAC AAAGATATTG TTTACCTGAT TCTTGGACAG ACACATCCTT GTGTAAAAAG GGAGTTTGGC GAGGTTTACA GGGAAAAACT TGTGCAAATG GTTGAAGAAC TTGGTGTAAA AGAGCATGTA TGGTTTGTAG ACAAATATCT TACCAGGGAT GAAATTATGA ACTATTTGCA GCTATCGGAT ATCTACATGA CGCCGTATCT CGGAAAAGAC CAGGCGGTAA GCGGTACTTT GGCTTATGCG GTAGGATACG GCAGAGTAAT TATATCTACT CCGTACAGCT ATGCCAAGGA AATGCTCGCA GAGGGAAGAG GACTTTTGGC AGAGTTTGAG GATGCAGATT CTTTGGCAAA ACATATTGAA TATGTTCTGG ACAATCCCGA GGCAAAGAAA GAGATGGAGA GGCGAACATT AAGTCTTGGA AGAACCATGA TGTGGGAAAA TGTGGCAAGT TGCTATTCCA GGCTTTTTAT CGACACTCTT GAAGAAACAA AGCTCTCGGG GAGTATGATA GGATGA
|
Protein sequence | MASNRIRNVA FLSTYPPREC GLATFTDDLV RELDKVELIN NPKVIAVSDN DYSYGSRVIM ELKQHERESY TKIAEEINNS DIELLVIEHE YGIFGGEDGE YILDLAEKIQ IPFILTVHTV LPSPKEKQKK ILEVLGEKSA RVVTMAKNTI PILEKVYGID PAKIEVIHHG VPYKILEPRE KLKKKFGLEN RTVISTFGLI SPGKGLEYGI EAVAKLAKKY KDIVYLILGQ THPCVKREFG EVYREKLVQM VEELGVKEHV WFVDKYLTRD EIMNYLQLSD IYMTPYLGKD QAVSGTLAYA VGYGRVIIST PYSYAKEMLA EGRGLLAEFE DADSLAKHIE YVLDNPEAKK EMERRTLSLG RTMMWENVAS CYSRLFIDTL EETKLSGSMI G
|
| |