Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2155 |
Symbol | |
ID | 4811203 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2562995 |
End bp | 2564890 |
Gene Length | 1896 bp |
Protein Length | 631 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640107559 |
Product | S-layer-like domain-containing protein |
Protein accession | YP_001038551 |
Protein GI | 125974641 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTAAAG GTTCCGGTAC CTTGTCCGTT TCGATTAATG AGTATAGTTA TTCTGTCGGA AATAATGTGA GAATAGTGTG CTATAATGAC ATTGCTTTCG AAAAAGGGGA TGTATTTACG GTTACCGTGC CGGCGTTCCG TTCAGATGAA TTTTCGGACG CGGCATTAAA TGGTTCTAAA GTTGAATATT TGCTGGTTGA TAAAAAGGGA AATGTAATTG CTCCGGATAT TTCTCTTTCA GGTGCTGACG CTGAAAATGC ATATTTTGAC ATTAATGTAT ACGGCGATGG TTACGGTTTT GTTTTTGGAA GTGAAAGAAT AAGTTTGGGC GATTATGCTG AAATCACTGC CGTTCTTTAC CAAGGATATG AATTTGAAGG TTGGTATGTG GAAGGTAAAA AAATATCATC GGAAGCTAAT TATCGCTTCA GACCGGAAAA AGACATGGAT ATATATGCTA AATTTATGCC TAATGATATT TTCACACAAA ATGTCAAGGT AAAGGTAAAA ACTGATAAAA ACAAATATGC TGTAGGTGAA ACGGTAAAAG TAGATGGAAT TATAACAAAT ATGCATCCCA GCGAAACGAT GAAAGATGTT CAGGTAATTA CTGTAGTTTA TGATGAAGTT GAACATAAAA GATGGGAAAA TAATAAATCC GGTTTGACTT TGGCACAGGG CGAATCAAAA ATTGAGAGTG CTATCTGGTC AACAGAAAAT GCAAAGCCGG GAAAGTATAA AGTTGAGGTC GCAGCTTATA TATCAAAAGG CAACACACCC GTTTTTATTT TTGATGAAGT TGAATTTGAA ATAGTTGAAG AGAAAACAGC CCAAACAACA CCTGATCCTA CCGTGACTTC GACTCCTGAA AATGACGAAG AGTCTGAAGA CAGTGAAGAA AGACAGATGC AAAAAGACAT TTTTATCCTT GTGACTACGG ATAAGAGAAT ATATTCGGGA AATGACGTGA TTACATATAA AGTTGAATAT TACAATTTAA CGGATTCACA AACCGGCGAA TTTGAACTTT CCACCCAGGT TCCTACATAT ACGGAAGTAT TGGAAGCCGG TCGCGGAGTA GTCGAAGGCA ATACAATAAA GTGGAAAATA TCGAATTTGA ATAAAAATGG GTCCGGTGAA ATAACGTATA AAGTAAAGAT TAATGAGATA CCCAACTCTG AAGTAAAAAT AAAGAACAAA TTTGAAGTAA AAGATGAAAA GCTGATTAAT CCTAAAAATG CATTATCAAC CATTGAAGGC ATGGCCAGGA CGGGAAGGCA TGGAAACATA ATTCATAAAT CCTATATAAA TGGATATCCG GATAATTCAT TCAAACCTGA CAATCACATA ACAAGAGCAG AAATAGCAAC CATATTGGCA AATGTTATGG GACTTTCCGT TCCGGACAAT ATAGAAGAAA CAGGGTTCAA AGATGTGGAT AAAAACCATT GGGCGGCAAG ATATATAAAA GCTGTAACCG ATGCCGGTAT TTTCAAAGGA TATGAAGATT CAACTTTCAG ACCGAATGAG ACCATAAGCA GGGCGGAACT GGCAACGGTT ATATTTAAGT GTCTGGACTT GGATGAGAAA AAGGCCGTAA AATCAAGCTT CACAGATACA AAGGGACATT GGGCGTCGGA TTTCATTGAA GAAGTCAGCC GAAACAAGAT AATCAGAGGA TATGAAGACG GAAGTTTTAA ACCAAACGGC AAGGTTACCC GGGCGGAAGC GGTTGTTATG ATTAATAATA TGTTGTACAG AGACCCAATA AATGTAGACT CAAGTAAATT TACGGATTTG GATAAGAAAC ATTGGGCCTT TGGACATATA GAGGCTGCGG CCGGAGATTA TGAGTTTAAA ATAGATGAGG ACGGCGGCAA AGTACTTGTT AATTGA
|
Protein sequence | MAKGSGTLSV SINEYSYSVG NNVRIVCYND IAFEKGDVFT VTVPAFRSDE FSDAALNGSK VEYLLVDKKG NVIAPDISLS GADAENAYFD INVYGDGYGF VFGSERISLG DYAEITAVLY QGYEFEGWYV EGKKISSEAN YRFRPEKDMD IYAKFMPNDI FTQNVKVKVK TDKNKYAVGE TVKVDGIITN MHPSETMKDV QVITVVYDEV EHKRWENNKS GLTLAQGESK IESAIWSTEN AKPGKYKVEV AAYISKGNTP VFIFDEVEFE IVEEKTAQTT PDPTVTSTPE NDEESEDSEE RQMQKDIFIL VTTDKRIYSG NDVITYKVEY YNLTDSQTGE FELSTQVPTY TEVLEAGRGV VEGNTIKWKI SNLNKNGSGE ITYKVKINEI PNSEVKIKNK FEVKDEKLIN PKNALSTIEG MARTGRHGNI IHKSYINGYP DNSFKPDNHI TRAEIATILA NVMGLSVPDN IEETGFKDVD KNHWAARYIK AVTDAGIFKG YEDSTFRPNE TISRAELATV IFKCLDLDEK KAVKSSFTDT KGHWASDFIE EVSRNKIIRG YEDGSFKPNG KVTRAEAVVM INNMLYRDPI NVDSSKFTDL DKKHWAFGHI EAAAGDYEFK IDEDGGKVLV N
|
| |