Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2855 |
Symbol | |
ID | 4809135 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3373261 |
End bp | 3374499 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640108275 |
Product | hypothetical protein |
Protein accession | YP_001039247 |
Protein GI | 125975337 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000208059 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAGATC CGTTATGTGA TAAAAATTAT TTATTAAAAA CAATAGAACT TAGAAAGAAA TATATTTGTG AAATGAAAGG AGAAATTGTT CAATTAAAAT CTGATATAGA AAAGGGGATT CAGAGATATC CTAGAGATAA TCAAAGTATA ATTTTTGCTA GATTTGCAAT AATGTTTATG TATGGTATGG ACATGCTTTT AGCAAAATAT TCCTTGGGCA ATCACCCTGA TACAATGATA GATGACTATT TAGACAACAT AACATATTTA GAGAATTGCG GTGAAGAAGA GGCCGGCTAC ATTAACCTTT TATGGATGGT TGGACTGGGT ATCCTTTTGG AAATGGATAA AGAAGTGTTA AAAAGACTGG CAAGAGTTAT AGAAAGGCAA AGAATAGAAG ACGCACTTAT GGATTTTCTA TTGAAATCCT GTGATATAGG TTGGAATCAC AGTACAACGA AATATGAAAA AAAGAACCCG TATGAAAAGA CAGCAGAGAT TATAAAAATA GCATTACACG ACAAAGACAA GGAAGCGGCA TCAAAAAGGC TTGAAAAATA CATGGGAAAA GAATGGTTCA AGGGACATTA CGACTTTGGG TGGAGGAATG CCCATAAGGA ACCTGGCTAT TATGGTTTTT GGAGTTTTGA TACAGCGGCA CTGGCCAAGA TACTGGGACT GGATGACAGT GCGTTAAAAG ACAACAACCA TTATCCTTAT GATTTGGCAC ACTATAAAAA TGGAATGACC TTTGATTTGA GTTGGTATAG TGTACCAAAG GAAGAGGAAG ATAAGGAAGA AGAAACGGTG GTATATGGTA TACCGGGTAA TCCTGAGTTG GAGAGAATAA TACCTGGGAG ATTCCACAGT TTTGTAAATG AGATAATAAA TGATTATAAA ACACTGCCGG ACGAAGAATT TTGGAAGAAA TACAATTTGA AAGAAATCTG GTTTGATGTG GAGGAGTATA AGGAGGATAA TAAAGATAAG AATTTGCTAG GAACGATTAT AGTATTCATG CTTGTGGACA AAGATTATAT TTTGCAGTTG GATTATAAAG AAGAGTTAAT AGACTATATA GAGAATATAC ATAATTACTG GGCCAAGAAA GAAGTTAAGC TTATAAGCTT TGAATTAGAC AATGACCAGC AGTACTATGC ATATGTGCCG AAGGATGCGG AGGTTGGTTC GTTGTATGAG GTAAAACTGA CAGAAGTGGA GAAAATAGAG GAGGTTTAG
|
Protein sequence | MRDPLCDKNY LLKTIELRKK YICEMKGEIV QLKSDIEKGI QRYPRDNQSI IFARFAIMFM YGMDMLLAKY SLGNHPDTMI DDYLDNITYL ENCGEEEAGY INLLWMVGLG ILLEMDKEVL KRLARVIERQ RIEDALMDFL LKSCDIGWNH STTKYEKKNP YEKTAEIIKI ALHDKDKEAA SKRLEKYMGK EWFKGHYDFG WRNAHKEPGY YGFWSFDTAA LAKILGLDDS ALKDNNHYPY DLAHYKNGMT FDLSWYSVPK EEEDKEEETV VYGIPGNPEL ERIIPGRFHS FVNEIINDYK TLPDEEFWKK YNLKEIWFDV EEYKEDNKDK NLLGTIIVFM LVDKDYILQL DYKEELIDYI ENIHNYWAKK EVKLISFELD NDQQYYAYVP KDAEVGSLYE VKLTEVEKIE EV
|
| |