Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2877 |
Symbol | |
ID | 4809157 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3401397 |
End bp | 3403154 |
Gene Length | 1758 bp |
Protein Length | 585 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640108296 |
Product | S-layer-like domain-containing protein |
Protein accession | YP_001039268 |
Protein GI | 125975358 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAGCGAG TACTGTCAAT TATTTTGGTT CTCGGCATTA TGATGCTAAG TACAGCTGCG GCGGCAGCCA CAGGATACGA CACGGTTTTT TCAGACATTT CGGGGCATTG GGCAAAGGAC ACAATTGAAA GAATGGCAAA CTTGGGAATA GTCAAAGGTG TTGGCAACGG TATGTTTTTG CCGGATAGGG AGATAAAACG GTCCGAATTT ATTGTGGCTT TGCACAAAGC AGCGGAAATT AAAATTAACT ACTTCAAGGC TCCGGACATC AATGAATTTT TCGATGATGT AAAAAACGAA GACTGGTATG CTTCGACTCT GTATGATCTG GCATCACTAA ACATAGTTGA CGACAGGGAG AAGTTTAGGC CCAATGACCT CATAACCCGT GAGGAAATGG TGCATTATCT TGTAAATACA TATAAATACA AGCTTAATAT TGTTTTGGAT CAAATTGATG AGAAAGACAA GTGTTTTGAT GATGAGGAGA GTATTCAGAA GCAGTATAAA GAATCTGTGA AGTATGCATT TAAACTGGGA TTTGTCAGGG GAAGGAGCAA TGGGAAATTT GTCCCCAAAG GATACAGCAC AAGGGCTGAA GCCATGATAG TTCTGGAAAA ATTAATGCAG GCTTTGAAGG AAAATATAAA GGCCGAGGTT GAAGTTATTC CTTCGTTTGA AAAACATGAA GACGGATATA AAATGGCGCT CACAATTAAA AACAACAGCA AAAAAGATGT TGTAATCCAA CACTTTTCCG GGCAAAAGTA TGATTTTGTA CTGCTTGATG ATAAAAAAGA AGAACTGTAC AGGTGGTCTG GAGACAGAGC ATTTGTTGAG ATTTTGACAA GTACAGAGAT TCCGGCGGGA AAGACAGTGG AATTTTCGGA AATTCTTGAT GCAAAGACTT ACGATGAAAT CAGCGGCAAG GCATATTATT TTAAAGCCGT GATAGTAGGA AGCAGTGAGG ATTTTGAAAT AAATGAGGAT GGATATTATT TAAGCCTCAA AGAAGAAAAA GATAATAAGC TTGAGATAGT ACCAAGCTAT AAAAAAGGCG AAAAGACCTT TACAATGAAG CTTTCAATAA AGAATACATC GAAAAAACCA ATAACAATCA ATCACACATC TGGGCAAAAG TTTGACTTTA AATTGCTCGA TGAAAATAAA GAAATAATTT ACACCTGGTC TGCTGACAAG ATATTTATAA TGATGGAAAC TCAAACGGTA ATAGATCCGG GAAAGACAGT GGAGTTTGCC GATGAGCTGG ATATGGAAAG CTTCGGGGAT ATTGTCAAAA AGGCAAGGTA TTTGAAGGCA TATATTGTGG GAGCAAGTGA GGATTGTGAA ATAGAAGAAG ACGGATACGA GGTAGAGATA ACAGAGAGTA AGGAAAACAG TCTTGTAGTT GTGCCGGAAT ATGAAAAGAG CCAGAATACT TTCACGATGA AGCTCAAGCT CAAAAATACA TCCGACGGGG ATATAACTAT TAACCACTCA TCAGGACAAA AGTTTGATTT CAAACTGTTG GACAAAAATA AAGAAATTCT CTATACCTGG TCCGCCGACA AGGGATTTAT AGGTGTACTG ACCGAGACGG TGATAGACGC CGGAAAGACA GTGGAATTCG AAGAAAAACT TGATATGGAA AACTATAAAG ATGTTATTGG AAAAGCAAAA TATTTGAAGG CATATATTGT AGGTACAAGT GAAGATTGTG ACATAGAAAA AGACGGATAC GAGATAGAGA TAAAATAA
|
Protein sequence | MKRVLSIILV LGIMMLSTAA AAATGYDTVF SDISGHWAKD TIERMANLGI VKGVGNGMFL PDREIKRSEF IVALHKAAEI KINYFKAPDI NEFFDDVKNE DWYASTLYDL ASLNIVDDRE KFRPNDLITR EEMVHYLVNT YKYKLNIVLD QIDEKDKCFD DEESIQKQYK ESVKYAFKLG FVRGRSNGKF VPKGYSTRAE AMIVLEKLMQ ALKENIKAEV EVIPSFEKHE DGYKMALTIK NNSKKDVVIQ HFSGQKYDFV LLDDKKEELY RWSGDRAFVE ILTSTEIPAG KTVEFSEILD AKTYDEISGK AYYFKAVIVG SSEDFEINED GYYLSLKEEK DNKLEIVPSY KKGEKTFTMK LSIKNTSKKP ITINHTSGQK FDFKLLDENK EIIYTWSADK IFIMMETQTV IDPGKTVEFA DELDMESFGD IVKKARYLKA YIVGASEDCE IEEDGYEVEI TESKENSLVV VPEYEKSQNT FTMKLKLKNT SDGDITINHS SGQKFDFKLL DKNKEILYTW SADKGFIGVL TETVIDAGKT VEFEEKLDME NYKDVIGKAK YLKAYIVGTS EDCDIEKDGY EIEIK
|
| |