Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0876 |
Symbol | |
ID | 4810494 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 1050392 |
End bp | 1051684 |
Gene Length | 1293 bp |
Protein Length | 430 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640106292 |
Product | glycosyl transferase family protein |
Protein accession | YP_001037303 |
Protein GI | 125973393 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000000359561 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTTCAA GGTTTATAGA CTACATCAAA AAAAACAAAT ATCTTGTCAT AATATTTATA GGGGTAATAC TCAGGCTCGT ATGGATTTTT GCCATGCCCA CGTATCCCGA AACCGACTTT ATGTGGTATC ATGTAAAGGG AAAGGAAATT GCTGAAGGAA AAGGATTTTT AAACGGAATT TATCCCTATT ACACCGGAAG GCCGGGATTT CCCACAGCTT TCAGACCCAT AGGCTATCCC GGGACTTTGG CAATCCTCTA TTTCATATTC GGTGCCCACT TTATAGTGGC AAAACTGTTT AATATTGTGC TTTCCACTCT TATCATGTTT TTGACTTACA AGCTTGCCGA CAAGTTTTTC GGTAACAAAA TTGCTCTTTT GACTTTGCTC CTTTATGCCT TGTCGCCCTT AGCAATAGCA TACAACAGTA TCATATGCTC GGAAATTCTG TTTTCTGCGC TGTTAATGTT GTCTGTTTAT CTGTTTTTCA ATAAAAACAA TCCCCTTCTC ATTGGGCTTC TAATTGGTTA TTTAACTCTT GTAAGACCCA TAGGAGTGTT TATTCCGTCA ATATTTGTAC TGTATGAATT TATCCGAAAA GACGTGGGAC TTAAGCATAA AATCAAATAT GTTGCAGTTT TTGCAGTAGC AGTGGGATTG GTAATTGCTC CATGGATAAT AAGAAATTAC ATTGTTTTCG GCGAACCAAT CTTCTCCACC AACGGCGGCT ATGTTTTCTA CGTAAATAAC AACGACTATG CAACCGGTTC GTGGAGTGAC CCCTTCAAGT ACCCCGACAG TCCGATGCTG AAGTACAAAA CGGAAGACGG ATTTGATGAA TTGGCAATTC ACAAACTTGG CAAACAACTT GCTAGAGAAT GGATAAAGAA AAATCCCAAA AGATTCATTG AGCTGGCATT TCTCCGTATT GCCAATTCAT ACTGGTTCAA AACCGAAGAT ATAATGTGGG CGTTTACCAT AGGCATCAAC CAGTGGCACC CTGTAACTTC TAAGGCTGTC AAGCTTCAAA AACTTTTATA CCGGCCTTTT TACATCGTAC TTTTCATATT CATTATATAT GCTTTGATAA GGTTTATACG ACAGAGAAAA ATTGACTTTA CCACATTCAT CCTTCTTATA TTTCTCTACT TTAATGCAAT GATGTTTGTG CTGGAGGGAA ACTCAAGATA TGTTTTTCCT CTTCACCCGA TTTACACGAT AGGCGTATCC TTTGTTATAT ACAATGTACT TAAAAAGCTG CTGCCTGAAC GTTTTTCAGC AGTTTTGTCT TAA
|
Protein sequence | MISRFIDYIK KNKYLVIIFI GVILRLVWIF AMPTYPETDF MWYHVKGKEI AEGKGFLNGI YPYYTGRPGF PTAFRPIGYP GTLAILYFIF GAHFIVAKLF NIVLSTLIMF LTYKLADKFF GNKIALLTLL LYALSPLAIA YNSIICSEIL FSALLMLSVY LFFNKNNPLL IGLLIGYLTL VRPIGVFIPS IFVLYEFIRK DVGLKHKIKY VAVFAVAVGL VIAPWIIRNY IVFGEPIFST NGGYVFYVNN NDYATGSWSD PFKYPDSPML KYKTEDGFDE LAIHKLGKQL AREWIKKNPK RFIELAFLRI ANSYWFKTED IMWAFTIGIN QWHPVTSKAV KLQKLLYRPF YIVLFIFIIY ALIRFIRQRK IDFTTFILLI FLYFNAMMFV LEGNSRYVFP LHPIYTIGVS FVIYNVLKKL LPERFSAVLS
|
| |