Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0327 |
Symbol | |
ID | 4808476 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 413675 |
End bp | 416248 |
Gene Length | 2574 bp |
Protein Length | 857 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640105741 |
Product | S-layer-like domain-containing protein |
Protein accession | YP_001036758 |
Protein GI | 125972848 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01451] conserved repeat domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAAA TTTGGGGCAG AAAAGTTTTG GGATTTGCGG TTGGTGTATG CTTACTTTTA ACTATGATTT GCCAAAATGT GGCTTTTGTA TCGGCAGAAC CTGAGGAAAA AACTGTTGTC GTAAATTTCG AAGAAAGCTT GAATGAAACT ATGTCAAAAA CCATTGAAAT ACTAAATCTT TTTGACATTT CCGAGATTGT TGTCGATAGC GGAAAAGTGT CTTACAGCAG AGAAGGAGAC AAAGTAACGG TTACTGTTTC GGAAGGCGTT TATAGAGTAG GCCCGCATAC ACAAAATGTT ACTTTAACTA TAGAGGATGA TAAAGGTATT TTTGATGAGA AAACATCCTA TAATGTTGAC GGCTATGAAG GAATCCTTAC GAAAATAGAT TCTGGTTATG ATGAGGCAAA AGGACTGTAC TGGGTAAAAT ATCAAGGTAG TGTAACTAAA GAAAACTGCA AATTTTACCA ATATACAGTA ACAATCAAAT ATACGGAAAA TGTTCGTCCA GAAGTGTATC TTACTGAGCC GAAACGAGGC ATGATTACTA ATGGTAAGAT TAATGTTCAG GGTTATGTCA GAGATGAAAA CATAGGTGAT GAGCTGAAAT TGTTCTTTAG CTTTGACAGT TACGATGAAA GCATGACAGG AAATCTTCTG AATGAGAATT CCATTATATC CGACGGAACC TGGCAGGAGA TAAGCGGAAC TATAGATTTA TCACCTTTCA ATCTTAAAGA CGGCGATCAC GACTTTTATT TTTGGGCAGT GGATAAAAGG GGAGTAAGAT CGGTTGGCGA GATCATAAGG TTTACGCTTG ATACAGTACC GCCTGAAGCG CCTGTTCTTA CTCCGGATAA AACAGAGTCT ACGAATCAGA GTGTTGTAGT GTCAGTATAT TATCCACCGG ATGCTGTAGG CAGGGAGATA AAGATAAATG ACGGACCTTG GATGCCGATA ACCGATATAA CCAAAAATGA TCAGATAATA ATGGATGAGA ACGGCAAGAT AGAGGCAAGG GCAATTGATG AAGCGGGAAA TATTTCAGAG GTGGCGGAAC TTGAGATAAA GAATATTGAC AAAATTCCTC CGACAGCACC GACAATCAAT ACAAGTGCTG ATGAAACTAC AGAGCAACCG ATTAAGGCGA CGATAGTGCC GGGAGTTGAT AATGAGTCAG GTGTGGATCG AACCGAATAT TGTTTAAGAG GAGCAAGTAC AAAAGATTGG GAGAAATACG ATGAAGGAAC CGAAATAACA ATAACTGCGT TGGGAGAAAC AGAAATTTGT GCAAGAACAA TTGACAATGC CGGAAACATC TCCGCTGAAA CAGTTAAGAA AGTTACAATA AAGAAGAAAG AGGACAGTGG CGGTAACAAC GGTGGAAGTG GCGGCACAGG CGGAAACAGC GGTAACAACG GTGGCAGCGG TGGCACAGGC GGAAGCGGTA GTAGCGGTGG AAGCGGAAGT AACAGTGGTG GCGGAAATAA TGACGGAAAT GGCAACGACG GGAAAAAAGA CGATGAAATA CTGCAGCCCG AACCCAATAT TCCAGGCGCA GGAGGCAGTC CTGTGGATTT GTCCGTGTTT ATAAGTGCGG ATAAATCAAA ATATGAAGAA GGTGAAGTAA TTACTTTCAA TATTACATAC AAAAACAAAA CCAATGTTCA GGCAAACAAC GTTATTGTGA AAGCAGGAAT ACCGGCAAAC ACAACTGTTG AGGATATAGC CGGAGGTACT CAAAATGGAA ATGACATTGA ATGGAAAATT GAATCGCTTA AAGCAAACTC TTCAGGCAAG ATTCAATACA AAGTCAAGGT GAATTTGCTT GAGGTGCCGG AAATAAGTTC TTCTGCTACT GCTTCAATAA CTGCAAGTGG AACTCTTATT AACAAGGATG ACGATGAATC AAGAACTATA TTCCTTCTTT ATTCGAACCG TTTTGGTGAA AACTTTCACG GCAAATATAT TACAGGCTAT GAGGACAATA CATTCAGACC GTTGAATAAT ATAACAAGAG CTGAAGTGGC AACAATTATG ACCAACATTT TGGGATTGAA GCAGGAGGTT GCAGGAGGCA AAACATATAC AGATTTGTCA AAGAGCCATT GGGCATATAA CAATATAATT GCGGTAACCG AAAAAGGTTT GTTCACAGGA TATGAAGACG GTTCGTTCCG TCCGGACAAC TTTATCACAA GGGCGGAATT TGCTACGGTG CTGGCTAATT ATTTGGGACT TAAGAATGTT GAGCATGATG AGTTGAACTT TGCGGATATC GAAAATCACT GGGCTAAGAA CTTTATAGAG GAAATATACA GAGTAAGATT GATAGAAGGT TATCTGGAAA ATGGCTTAAG ACTGTTTAAG CCTGACAACT ACATAACCAG AAGTGAAGCG GTGACAATAA TAAACAAGAT GCTGTTCAGA GGTCCGCTTG AAGGAGCAAA GGTGCCGTTT ACCGATGTTG AGGAAGGATA CTGGGCTTAC GGACATATAT TGGAAAGCTC TATAGATCAT TACTACGTAA GAAATAAAGA TCAGAGCGAA ACAATAGTAA ACAAGAAACA GTAA
|
Protein sequence | MKKIWGRKVL GFAVGVCLLL TMICQNVAFV SAEPEEKTVV VNFEESLNET MSKTIEILNL FDISEIVVDS GKVSYSREGD KVTVTVSEGV YRVGPHTQNV TLTIEDDKGI FDEKTSYNVD GYEGILTKID SGYDEAKGLY WVKYQGSVTK ENCKFYQYTV TIKYTENVRP EVYLTEPKRG MITNGKINVQ GYVRDENIGD ELKLFFSFDS YDESMTGNLL NENSIISDGT WQEISGTIDL SPFNLKDGDH DFYFWAVDKR GVRSVGEIIR FTLDTVPPEA PVLTPDKTES TNQSVVVSVY YPPDAVGREI KINDGPWMPI TDITKNDQII MDENGKIEAR AIDEAGNISE VAELEIKNID KIPPTAPTIN TSADETTEQP IKATIVPGVD NESGVDRTEY CLRGASTKDW EKYDEGTEIT ITALGETEIC ARTIDNAGNI SAETVKKVTI KKKEDSGGNN GGSGGTGGNS GNNGGSGGTG GSGSSGGSGS NSGGGNNDGN GNDGKKDDEI LQPEPNIPGA GGSPVDLSVF ISADKSKYEE GEVITFNITY KNKTNVQANN VIVKAGIPAN TTVEDIAGGT QNGNDIEWKI ESLKANSSGK IQYKVKVNLL EVPEISSSAT ASITASGTLI NKDDDESRTI FLLYSNRFGE NFHGKYITGY EDNTFRPLNN ITRAEVATIM TNILGLKQEV AGGKTYTDLS KSHWAYNNII AVTEKGLFTG YEDGSFRPDN FITRAEFATV LANYLGLKNV EHDELNFADI ENHWAKNFIE EIYRVRLIEG YLENGLRLFK PDNYITRSEA VTIINKMLFR GPLEGAKVPF TDVEEGYWAY GHILESSIDH YYVRNKDQSE TIVNKKQ
|
| |