Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1204 |
Symbol | |
ID | 4809896 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1435506 |
End bp | 1436774 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640106627 |
Product | hypothetical protein |
Protein accession | YP_001037629 |
Protein GI | 125973719 |
COG category | [S] Function unknown |
COG ID | [COG2718] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR02877] sporulation protein YhbH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 40 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGAAGGAT TAAACCCCTT AATCCCGCAA AAATCAAACC GGGGGGTGAT TGATGTGGCT ATTTTCAGAG AATATGTGAG CGGCGGAAAA GACAGAGCTG CTGAAGACAG GCGGCGCCAC AGGGAATTGG TTGAAGAATC CATAAAAAAG AATATAGGTA ATATCATTGC AGAGGAAAGT ATTATCGGTC AGAGCAAGGA CAAAAAAATA AAAATACCGA TAAGAAGCAT TAAAGAATAT CAGTTTGTAT ACGGGAAAAA CGGTCCTTCG GTGGGTTCCG GAGACGGAAC GGAAAAGCGC GGGGAAAAAA TAGGCGAGGA GAAAACGGCT AACGGCGGAC AGGGAGTTGG GCAGGCCGGA AACCAGGAAG GGGAAGAAAT TTATGAGACA GAAATTACAA TTGAGGAACT CATAAATTAT CTCTTTGATG ATTTGAATCT TCCGGATATA GATAAAAAGA GGATTGCGGA GCAGGAATCC ATAAGAAGCT ACAAGAACCT GGGTTATCAG CGAAAAGGAA TACCCCCAAG ACTTGCCAAG AAGCGTTCCG TTATTGAAAA GATAAAAAGA AAGCAGGCAT ATCTGAGAAA CAGCAGAGAA TTGGGCGATT TGGACGAAAG TGCCGAAGAG GATATCGCCG CGCAGGAGAC CTTGGACGGT GTCAGAAAGA GATTTCCTTT CAGCGAGGAT GATTTGCGAT ACAGAAGGGT CAGAGAGGAC CGCAAAAAAG ATTTCAATGC AGTGGTTATC TGTATTATGG ATGTTTCCGG TTCCATGGAC CAGACAAAGA AGTATCTTGC CCGAAGCTTT TATTTTTTGC TGTACCAGTT TATAAGATTG AAATATGCCA ATGTTGATGT TGTTTTTATA GCCCATACCA CCACTGCGAA AGAAGTCAGT GAAGATGAAT TTTTTCACAG AGGCGAATCG GGAGGAACAT ATATCAGCAG CGGCTATGAA AAAGCTCTTG AAATAATCGA GCAGAGATAC AACCCCAACA GTTGGAATAT ATATGCTTTT CATTGCAGTG ACGGCGACAA CTGGTCCGAG GACAATAAAA AAGCCGTCGA ACTTGGATTG AAACTCTGTG ATGTGTGCAA CCTGTTTGGA TACGGTGAAA TAGTGCCGGG TTATTATTCC ACCGGAAGTA CTATAAAAGA CGAGTTTCAA AAAAGCATTA AAAGAGACAA TTTTTCTGTC ATAACAATAA CCAATAAAGA TGATGTGCTT CCAGGATTGA AGAAACTCCT GGAAAAGGAA GGAGAATAA
|
Protein sequence | MEGLNPLIPQ KSNRGVIDVA IFREYVSGGK DRAAEDRRRH RELVEESIKK NIGNIIAEES IIGQSKDKKI KIPIRSIKEY QFVYGKNGPS VGSGDGTEKR GEKIGEEKTA NGGQGVGQAG NQEGEEIYET EITIEELINY LFDDLNLPDI DKKRIAEQES IRSYKNLGYQ RKGIPPRLAK KRSVIEKIKR KQAYLRNSRE LGDLDESAEE DIAAQETLDG VRKRFPFSED DLRYRRVRED RKKDFNAVVI CIMDVSGSMD QTKKYLARSF YFLLYQFIRL KYANVDVVFI AHTTTAKEVS EDEFFHRGES GGTYISSGYE KALEIIEQRY NPNSWNIYAF HCSDGDNWSE DNKKAVELGL KLCDVCNLFG YGEIVPGYYS TGSTIKDEFQ KSIKRDNFSV ITITNKDDVL PGLKKLLEKE GE
|
| |