Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3171 |
Symbol | |
ID | 4809621 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3746814 |
End bp | 3749261 |
Gene Length | 2448 bp |
Protein Length | 815 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640108604 |
Product | S-layer-like domain-containing protein |
Protein accession | YP_001039559 |
Protein GI | 125975649 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1361] S-layer domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.412777 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAAAA TTAAGATTGT GAATATTCTG ATACTTTTGG TTCTTTTAAT CAGTTTTAAT TTGGAGACAG TCAATGTGTT TGCCGGTCCA AATGATATTA TAATTACAAA GGTAGTGCAA AAAGAGGAGA ATGTTTCAGC CGGTAAAAGT TTTAAACTCG AGGTTTACTA TAAAAATGTG TTGGGTGTAC CTTTGAAAGA TGTTTATATT TCCGTAGATA AAAGTTCTTC TTTTTATATT GATAATGATC ATTATCAAAC CGAGTACCTG AAAGACATGG CTGTTGGGGA CGGAGAAGAA CCTATAATAT TATATCTGGT CTATAAAGGT ACAGGAAACG AATTAACTTT GATATTTGAT TATTTAAAAG AAGGTGCAAC CGATCGAGAA CAACTTTCAC AAACCCTGTT TCTTAGCGTT AAAAAAGAAA AAGAACAAAC TTCCGGTGGT TCACAAACCA ACACGGCCGA ATACAAGCCG AATTTCAGAA TTGTGGGAAA GATAAGCAGC AAACAGGAAG GAAAAAATGT TTCTGTTGAA TTTCCCATTA AAAATGTGTC CAATTTTACC GCCAAAGACA TTCAAATAAC CATGTCAGCC GATTCGGCGG ACTCACCTTT TTCGGCGCCG ATGGGGCATC TTTCCGTATC CGTTGACGAG ATTAAACCCG ATGCCGAAAA AAAGATAAAG CTTGACCTTG CTGTAAAACC GAATACCAAA AGCGGTATAT ATCCCTTGAA ACTTGAGTTT AAATACGGCA ATTTGTATGG TGATTCATTC TCTTCATCGG AAGTTATATA TGTTGACATT GAAAACAATG ATAAAAGCCC AAGCCTCATT TTAAAAGGTG TGGAAATGCT TCCCCAAAAA CCTGCACCGG GGGACAGGTT CAGTGCTTCC ATAGAACTTG AAAACCTCGG AACTCTTGGA GCAAAAGACG TAAAGGTCAC CTTGAAGGGA TTAACGGTTG ACGGGATTTA TTCTGAGCTT GTGGGAGTCA ATTATTTGAA AACCATTGAG GGAGGCAGGA CGGGCAAGCT GAATTTCAGC CTTGTTGCGT CCAATAAAAT AAATGTTCAA AGTTTTCCGC TTGAAATAGC CGTTGACTAT AAAGACGAAT TTGGCAATTC ATATGCCGAA AGCTTTATAT ATTATGTGCC CATAAAGCAA AAAAGCGAAG GAAAAGCTTC ATTGAAGATT GACAACATTA CTTCTCCGGC GACAGTTGTT GCACCGGATG AGGATTTTAA AGTTGGCTTT GATATTGTGA ATGACGGAAC AAAAGAACTT TCCGACTTAA AGGTGTCGGT AACGGCTGAA AACGGCATTA TCTGCAAATC ACAAAGCATA ACTGTTGTTG ATTCCTTGAA AGTTGGGGAA AAGAAAAGCT TTGAGTTTCT TTTCACTGCC TTGACCGATG CAGTCACTAA AAACTACCCT ATTGCCATAA ATGTGGAATA CGATGATGAA GGTTCTTCGG GAGGAACAAA AGAAAAGCGG ACTGTTACAC AATATGTTGG AGTTTATGTT GAAAATCCAA AGGAAGAGGA GAAAAAAGAA AATACTTCCA CTCCGAGGCT TATAATTGAC CGGTACAGTA TATCCACGGG GCAGGCGATA GCGGGAAAGA GCTTTGAAAT TGAGCTTGGC ATATTGAACA CCCATAAAAA TATGAATGTT GAAAATATTG CTGTTTCTTT CCTTGCGGAT GAAGGAGTGT TTTTGCCGGC GGCGGAAAGC GGCAGCACCA TATTCATTGA CCAAATAAAA GCCGGAGAAA GAGTTGTTAA AAAGATGACT TTTGCGACCA AATATGATGC TGTGCCAAAG AGTTATTTGC TCAATATAAA CTTTGAATAC GAAGACGAAC AGAACAAGGC ATATACCTTG AAAGAAAGCA TCAGCATACC TGTTATTCAG GAACAGAGGC TTGAGATAAG TGAAATACAG ACGGGAATGG ATGCAGTTGT GGGACAGCCG GTTTCTGTAA ATTTGAATTT TTATAACATG GGCAAGTCAA CATTGAACAA TCTTATGGTA AGGTGCAAGG GAGATTTTGA GCTGCAGCCC AGTTCAGAAT ATTTTGCGGG TAATTTTGAA CCCGGCAGAA GCGACTATTA TGAAGCATAT ATTGTACCCA ACAAGGAAGG ACAGGTAAAG GGAAGCATTA TTTTCACATT TGAAGATAAC AACGGCGAGG TTAAAGAGAT TGAGAAAGAG TTCGAGATTT TTGTCCAGGG TCAGCCTTCA GTAATGAAAG GTGATGTTAC GATAGTGGAG CCCGGCATGG CGGAAGCGGG AATGAAGTTT GGAAAAGCAG GTTTTCCCGT GCGCAGACTG TTAATCCTTG CAGGTGTTTC AGTGCCGGTG ATAGCAGGTG TTGTGGTTCT CATAATAATT CTGGCAAAAA GAAAGAAAGC GAGAGCTGAT TTGTATGAGA ATATCTGA
|
Protein sequence | MSKIKIVNIL ILLVLLISFN LETVNVFAGP NDIIITKVVQ KEENVSAGKS FKLEVYYKNV LGVPLKDVYI SVDKSSSFYI DNDHYQTEYL KDMAVGDGEE PIILYLVYKG TGNELTLIFD YLKEGATDRE QLSQTLFLSV KKEKEQTSGG SQTNTAEYKP NFRIVGKISS KQEGKNVSVE FPIKNVSNFT AKDIQITMSA DSADSPFSAP MGHLSVSVDE IKPDAEKKIK LDLAVKPNTK SGIYPLKLEF KYGNLYGDSF SSSEVIYVDI ENNDKSPSLI LKGVEMLPQK PAPGDRFSAS IELENLGTLG AKDVKVTLKG LTVDGIYSEL VGVNYLKTIE GGRTGKLNFS LVASNKINVQ SFPLEIAVDY KDEFGNSYAE SFIYYVPIKQ KSEGKASLKI DNITSPATVV APDEDFKVGF DIVNDGTKEL SDLKVSVTAE NGIICKSQSI TVVDSLKVGE KKSFEFLFTA LTDAVTKNYP IAINVEYDDE GSSGGTKEKR TVTQYVGVYV ENPKEEEKKE NTSTPRLIID RYSISTGQAI AGKSFEIELG ILNTHKNMNV ENIAVSFLAD EGVFLPAAES GSTIFIDQIK AGERVVKKMT FATKYDAVPK SYLLNINFEY EDEQNKAYTL KESISIPVIQ EQRLEISEIQ TGMDAVVGQP VSVNLNFYNM GKSTLNNLMV RCKGDFELQP SSEYFAGNFE PGRSDYYEAY IVPNKEGQVK GSIIFTFEDN NGEVKEIEKE FEIFVQGQPS VMKGDVTIVE PGMAEAGMKF GKAGFPVRRL LILAGVSVPV IAGVVVLIII LAKRKKARAD LYENI
|
| |