Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0187 |
Symbol | |
ID | 4808675 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 225552 |
End bp | 226724 |
Gene Length | 1173 bp |
Protein Length | 390 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640105598 |
Product | hypothetical protein |
Protein accession | YP_001036621 |
Protein GI | 125972711 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00032944 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAGC TGGGATGTCT TGTTATTGCT TTACTGGTTG CATTTTTGTT CGGATGTCAG GTACCGACGG ATAAAACAAC TGAAAATGAA AACGGTGTGA ACAGCCCGGA TATTGATAAA GAAGTTATGA GCTTGAATGA CGCAATAGAT GAGAAAGAAA ATGGGGAAAG AAATGAGGAA GGTGCTGCAG GTGAGAGCGG CAAAGAATCA AGTAAATCTC CGGAGGCTTG GAAGGAAAGT GTATTAAGCA CTGTGGAGCC GTCGGTGGAA AATGGTGAAA GTAAGGAAAA AAGCAAGGAA AAAACGAAAA TTGCTGTGAC CGTGTATTAC AGGGACAGCG ACAACTGCCT CATTCCTATA ACGAGAGATA TTTATAAAGA AGAGGGCATT GCCAAAGCCG CTTTAAGATG CATGGTAGAC AATGATTCAA ATCGCAGTGC GGTGCAATCT CTGGGACTTT ATCCTGTTTT GCCGGAAGGT ACCGAAATAT TGGGAATGAA CATAAAAGAC GGTATCGCAA CGGTTGACTT TAATAGCCGT ATTTTGGACT ACAAAGACAA AACAGCTGAG AATAATATAG TGGCGGGTGT TGTATACTGT CTTACTGAGT TTAGCACAAT TGACGGCGTA AAAATATTGG TTGAAGGCAT GGAAAAGGAA AAACTGCAAT ATGGCGCCGA TATATCCGGT GTTCTTGACA GAGACAATGT GCTTATAAAT TCGGATAAGG TAAACCTTCA GGACAAACTG AAAAAAGCGG ACATTTACAT GTTTAAGTAC ATAAAAGACA AAAATGAGTT TATTATTCCG ATTTCAATGG AATATATCGG TGTTCCGGAG GAAGAACTTC CGGCGCAGAT AATAAGAATG CTGGCTGAAA AGCCTGCCAA CGAGAGAATT TATTCTTTGA TTCCGTCGGG AGTCGAGGTG CTTGGCAGCA GGATTGAGGG AAACCTTCTT ATATTGGATT TTAACAAAGA AATAAAAAAC TACGGGGGCA CGGCCAGGGA AGAAGGTATT TTGAAGCAGA TACTGTATAG TATGAAGCAG CTTAAAAATA TTGAGAAAAT ACGAATTCTT ATTGAGGGCA AAAAAGACCA TCTTCCCGAA GGTACCGACA TATCCGGCGA GATGTTGCTT CCTGTGGAAA TTAACAGACA AAAAGACTTG TAA
|
Protein sequence | MKKLGCLVIA LLVAFLFGCQ VPTDKTTENE NGVNSPDIDK EVMSLNDAID EKENGERNEE GAAGESGKES SKSPEAWKES VLSTVEPSVE NGESKEKSKE KTKIAVTVYY RDSDNCLIPI TRDIYKEEGI AKAALRCMVD NDSNRSAVQS LGLYPVLPEG TEILGMNIKD GIATVDFNSR ILDYKDKTAE NNIVAGVVYC LTEFSTIDGV KILVEGMEKE KLQYGADISG VLDRDNVLIN SDKVNLQDKL KKADIYMFKY IKDKNEFIIP ISMEYIGVPE EELPAQIIRM LAEKPANERI YSLIPSGVEV LGSRIEGNLL ILDFNKEIKN YGGTAREEGI LKQILYSMKQ LKNIEKIRIL IEGKKDHLPE GTDISGEMLL PVEINRQKDL
|
| |