Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2170 |
Symbol | |
ID | 4810883 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2581757 |
End bp | 2582866 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640107573 |
Product | hypothetical protein |
Protein accession | YP_001038565 |
Protein GI | 125974655 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATGCGT TCATAACCAT AAAAAAGTCC ATACTTATAG GTACCGTTTT TGCAATTACA ATTTGCGCCT TCAGTATAGC CCTGTATAAT GCCTCTCCGG CTCTGCCAAG CGTAGCCATA AGCGAAGAGC ATATAATTAT GCTCGATGAA ATGTTTAAAA TCAGAAGCAA AGCCTTCCTG GATAACGACT TGAAACTGAT AGAAACACTC TACAACAAAA AAACAAAGTT TGGGATATGG GCACTTGAAC ATGAGAAAAA AAAGATGAAA TACCTCCATG ACTGGGCTGA CAAACAAGGC ATAAAATTTA CCGATATAAA ACCCAAAGTT AAAATCTACC AGGTAAAGGA AAGGGGCAGC GGCTATTTAA TTAACTTCAT AGCCTCTACA GAGTATCATT ACCACTACGA AAATCAGCCC GAAATCACCA ATTTTTTCAG AATCGGAACC TACCATTCCA TGAATCTGGA TAATGTTGAC GGTCAATGGC TTATTTCAAA AGAATGGTAT ACTGATCCCT TTGCCGACTC ATTAAACACC GACAATATAA AAGTGGACGA ATTTAAACAG TATTTATTGA ATTCTTCTCC CAGAGATTTC TCCAAGCTAA ACAAAAGGCG CATAAGTGCG GTGGAATATG CCGACCGCTA CTGTGGGGCC GCCGCCGATG AGCAATACGG CTATTCATAC AATAAAAAAT ACAAAAATTA CAACCCTTTA GGCGGAGACT GCGCAAACTT TGCCTCCCAA ATTCTCTATG AAGGAGGCAA ATTCAAGCAG ACCGGAGCAT GGAGGTATGA AAAAGACGGA AGCAAAGCAT GGGTCAACGC CCATGCTTTC AACAGCTACA TGCTCTACAG CGGCAGAGGT TCATTAATCG CAAGAGGTAC CTACAATCAA GTTTTCAAAG CTTCATTCAA GCTTCTGCCG GGGGACTACA TAGCCTACGA AAAGAAAGGA AAAGTAGTCC ACATATCCGT TGTCACCGGC GCCGATTCAA AGGGCTATAC CCTGGTCAAC TGCCACAATA CAGACAGATA CAGAGTCCCG TGGGATTTAG GATGGAGTGA CAAAGGTATT AAATTCTGGC TGGTTCGTGT AAACTACTAA
|
Protein sequence | MYAFITIKKS ILIGTVFAIT ICAFSIALYN ASPALPSVAI SEEHIIMLDE MFKIRSKAFL DNDLKLIETL YNKKTKFGIW ALEHEKKKMK YLHDWADKQG IKFTDIKPKV KIYQVKERGS GYLINFIAST EYHYHYENQP EITNFFRIGT YHSMNLDNVD GQWLISKEWY TDPFADSLNT DNIKVDEFKQ YLLNSSPRDF SKLNKRRISA VEYADRYCGA AADEQYGYSY NKKYKNYNPL GGDCANFASQ ILYEGGKFKQ TGAWRYEKDG SKAWVNAHAF NSYMLYSGRG SLIARGTYNQ VFKASFKLLP GDYIAYEKKG KVVHISVVTG ADSKGYTLVN CHNTDRYRVP WDLGWSDKGI KFWLVRVNY
|
| |