Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2166 |
Symbol | |
ID | 4810879 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2576033 |
End bp | 2577169 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640107569 |
Product | putative PAS/PAC sensor protein |
Protein accession | YP_001038561 |
Protein GI | 125974651 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG2208] Serine phosphatase RsbU, regulator of sigma subunit |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATGCAT TAAATAACAA GCTTTATTTT ATAGACAGCA CGAAACATAA GATCTTTGTA GAACAAATTA TAAACGGAAT GCATGACTGG GTAAGGGTTA TTGATATTAA CGACAATATA ATCTTTGTCA ACGAATCCAT GGCAAAAGCA CTGGGCAAAA ACGTAATCGG GGAAAAATGT TATAAAGCCA TCGGAAAAAG CGAGCCCTGT GAAAACTGTA CCTCAAGAAA AGCAGTTTTC GAAGGAACAA TTCAAGCCAA GGAAGAAATC ATAGGGGATA GAATTTTCTC CGTCAAAAGT TCCCCCATCA GGGATGAGAA CGGCCAAATT ACCGCTGTTG TTGAAGTTCT CCGTGACATA ACCGAAATGA AAAAAATGCA GAAAAAAATC CTTGAGCACA ACCAAAAACT TCAGAGCGAG CTTAACATGG CAAGAAGACT TCAATGCAGC CTTCTTCCAA AGGAACTGCC CCAGGACAAG ATTGATTTCT CATATGTTTA CAGGCCCTGT GAAGCCATCG GAGGGGACTT TTTGGATATA TTCAAGATCG ATGATGAGCA CATTGGAATA TATATCGCCG ATGTATCCGG GCATGGAGTA CCGGCTTCAA TGCTTACAGT TTTCTTACGC TCTTCAATAA ACAAAAAAAC CCTTTCGCCC GCAGAAGCTT TAAATCAGCT CTACAAAGAG TTTAACCGGG ATTATTATGA CCAGGAGTTG TACATCACAA TATTTTATGC CATCATTGAC ACTAAAAATA AAAATATCAT ATATTCAAAT GCAGGCCACA ACGCCAGTCC CGTATTATTC AACCATGAAA GCCACAGGTT CGACATTCTT AGAATACCCG GAGTTCCCAT CAGTGACTGG GTTGACAATC CGGAATATAC TGAAAAAAGT ATTTCAATTG AAAAAGGCGA CCGATTGTTT ATGTATACCG ACGGTATTGT GGAACTGCGA AACAACAAAG GTGAGCAGTT TGGCGAGGAA AGGCTCCTTA ACATTTTGCT GGGTGAAAAA ATGCCTCCTG CAATGACTCT TGACCGCATC ATAGAAGCTG CCATGGAATT TGCAAATATC AAAAATTTCA ATAAAATAAT AGACGATATT ACAATGGCCT TGCTGGAAAT CTTATAA
|
Protein sequence | MDALNNKLYF IDSTKHKIFV EQIINGMHDW VRVIDINDNI IFVNESMAKA LGKNVIGEKC YKAIGKSEPC ENCTSRKAVF EGTIQAKEEI IGDRIFSVKS SPIRDENGQI TAVVEVLRDI TEMKKMQKKI LEHNQKLQSE LNMARRLQCS LLPKELPQDK IDFSYVYRPC EAIGGDFLDI FKIDDEHIGI YIADVSGHGV PASMLTVFLR SSINKKTLSP AEALNQLYKE FNRDYYDQEL YITIFYAIID TKNKNIIYSN AGHNASPVLF NHESHRFDIL RIPGVPISDW VDNPEYTEKS ISIEKGDRLF MYTDGIVELR NNKGEQFGEE RLLNILLGEK MPPAMTLDRI IEAAMEFANI KNFNKIIDDI TMALLEIL
|
| |