Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2826 |
Symbol | |
ID | 4809663 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3342816 |
End bp | 3344054 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640108246 |
Product | hypothetical protein |
Protein accession | YP_001039218 |
Protein GI | 125975308 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000022412 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAGATC CATTATGTGA TAAAAAAGAT TTGATAGAAA CGATAGAATT TAACCAAAAG GCTATTTTAA AAATGAAAGA AAAAATTATT AATCTGAAGG CCGACATAGA GAATGGTATA CAAAGATATC CAAGAGATAA TCAAAGTATA ATTTATGGTA CATTTAAATT AATGTTTATG TATGGAATGA GTACACTGAG AGCAAAATAT TCTTTGGGAA ATGAGCCGGA TGCAATGATA AATGATTATT TAGATAATAT AACGTATTTA GAGAATATGG GAGAAGAAGA AATAGGATAT ATTTTTCTTT TATGGATGGT GGGACTGGGT ATCCTTTTGG AAGTGGATAA AGAAGAATTG AAGAAGTTGG CGAAAGTTAT AGAGAGACGA AAAACAGAAG ATGCACTTAT AGATTTTCTT TTGAAATCCT GTGATATAGG TTGGAACCAC AGTACAACGA AATATGAAAA AAAGAACCCG TATGAAAAGA CAGCAGAGAT TATAAAAATA GCATTGCACG ACAAAGACAA GGAAGCGGCA TCTAAAAGGC TTGAAAAATA TATGGAAAAA GAATGGTTCA AGGGGCACTA TGACTTTGAA TGGAGGAATG CGCACAAGAG GCCGGGGTAT TATGGTTTTT GGAGTTTTGA TACAGCGGCA CTGGCCAAGA TACTGGGACT GGATGACAGT GCACTGAAAA ACAACAACCA TTATCCTTAT GATTTGGCAC ACTATAAGAA GGGAATGACC TTTGATTTGA GTTGGTATAG TGTACCAAAG GAAGAGGAAG ATAAGGAAGA AGAAACGGTG GTATATGGTA TACCGGGTAA TCCTGAGTTG GAGAGGATAA TACCTGGGAA GTTTCACAGT TTTGTAAATG AGATAATAAA TGATTATAAA ACACTGCCGG ACGAAGAATT TTGGAAGAAA TACAATTTGA AAGAAATCTG GTTTGATGTG GAGGAGTATA AGGAGGATAA TAAAGATAAG AATTTGCTAG GAACGATTAT AGTATTCATG CTTGTGGACA AAGATTATAT TTTGCAGTTG GATTATAAAG AAGAGTTAAT AGACTATATA GAGAATATAC ATAATTACTG GGCCAAGAAA GAAGTTAAGC TTATAAGCTT TGAATTAGAC AATGACCAGC AGTACTATGC ATATGTGCCG AAGGATGCGG AGGTTGGTTC GTTGTATGAG GTAAAACTGA CAGAAGTGGA GAAAATAGAG GAGGTTTAG
|
Protein sequence | MRDPLCDKKD LIETIEFNQK AILKMKEKII NLKADIENGI QRYPRDNQSI IYGTFKLMFM YGMSTLRAKY SLGNEPDAMI NDYLDNITYL ENMGEEEIGY IFLLWMVGLG ILLEVDKEEL KKLAKVIERR KTEDALIDFL LKSCDIGWNH STTKYEKKNP YEKTAEIIKI ALHDKDKEAA SKRLEKYMEK EWFKGHYDFE WRNAHKRPGY YGFWSFDTAA LAKILGLDDS ALKNNNHYPY DLAHYKKGMT FDLSWYSVPK EEEDKEEETV VYGIPGNPEL ERIIPGKFHS FVNEIINDYK TLPDEEFWKK YNLKEIWFDV EEYKEDNKDK NLLGTIIVFM LVDKDYILQL DYKEELIDYI ENIHNYWAKK EVKLISFELD NDQQYYAYVP KDAEVGSLYE VKLTEVEKIE EV
|
| |