Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2341 |
Symbol | |
ID | 4808975 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2791172 |
End bp | 2792338 |
Gene Length | 1167 bp |
Protein Length | 388 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640107748 |
Product | glycosyl transferase family protein |
Protein accession | YP_001038736 |
Protein GI | 125974826 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGATAT TCATTAAAGT TTTATTTTAT GTAAGCGGAT TTATCATATT TTGGGCCATG ATAGGATATC CCGTATCACT TAAATTAATT GGAAAATGCT ATAAATCGCG TAAGTTGGAA AAAGACTATA ATCACCAGCC TACTGTAACG GTTATGGTTG TTGCACATAA CGAGGAAAAA GTTATACTGG AAAAACTGAA TAATATTCTG GAACTGGACT ATCCTCAAGA CAAAATTGAG ATTCTGGTTG CTTCCGACAA CAGTACAGAT CAGACCAACA ATATTGTAAA AGAATTTATT AAAAAGCATC CCGAGCGAAA AATCAGGCTC TATGAGGTTA AAGCCCGGAA GGGAAAAACA AATGCGCAAA ATGAAGCTCA AAAGACTGTA ACAACGGAAT ACCTGGTTAT GACGGATGCC AACTCAATGC TTGACAGAAA TGCGGTAAAA GAATTAATGG CGGCGTTTAC ATCGGATGAT ATTGCGTATG TTTGCGGAAG GCTATCAATT GTGAATCGGG AAGCCAGCGA TGTCAGCAGT GCGGAGGCCG GTTACTGGGA CAGTGACCTT GCAACCCGTG AAATTGAAGG AAGAATTCAG ACAATAACGG CCGGAAACGG TGCTCTGTAT GCTTGCAGAA ACAGCGAATA TCATGATTTT GATCATATAC AATGCCATGA TGCTGCAATG CCCCTATATT ATGCGTTAAA AGGAAAAAGG GCCATATGCA ACCACGATGC TGTGGCATAT GAAAAAGCGG GAGAAGTAAT AGAGGATGAA TTTAAAAGAA AAGTACGTAT GAATCGTACG ATATTAATGG CTATTTTGCC TGATATAAGG ATACTTAATG TTTTTAAATA CAAGTGGTTC TCATACTTTT ATTTCGGACA CAGGACATGC AGGTATCTGT TATGGATAGC ACATTTAATT GTGCTGCTTT CCAATGCTTT ATTGCTGGCA AATTCAAAAT TTTATTTATT AACTTTTACC GGACAGTTGC TTTTTTATTT GATTAGCCTG ATAGGAACTG TCACCAGGAC AAAAAATAAA TATGTATCTC TTATTTATTA TTATACAGTG ACGATAATTG CCCAGTGGTT TGGAGTTTAT AATATTGTAA CCGGGAGGGC AAAACCCTTC TGGGAGAAAG CGGAGAGCAC AAGATAG
|
Protein sequence | MGIFIKVLFY VSGFIIFWAM IGYPVSLKLI GKCYKSRKLE KDYNHQPTVT VMVVAHNEEK VILEKLNNIL ELDYPQDKIE ILVASDNSTD QTNNIVKEFI KKHPERKIRL YEVKARKGKT NAQNEAQKTV TTEYLVMTDA NSMLDRNAVK ELMAAFTSDD IAYVCGRLSI VNREASDVSS AEAGYWDSDL ATREIEGRIQ TITAGNGALY ACRNSEYHDF DHIQCHDAAM PLYYALKGKR AICNHDAVAY EKAGEVIEDE FKRKVRMNRT ILMAILPDIR ILNVFKYKWF SYFYFGHRTC RYLLWIAHLI VLLSNALLLA NSKFYLLTFT GQLLFYLISL IGTVTRTKNK YVSLIYYYTV TIIAQWFGVY NIVTGRAKPF WEKAESTR
|
| |