Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0083 |
Symbol | |
ID | 4808778 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 116031 |
End bp | 117710 |
Gene Length | 1680 bp |
Protein Length | 559 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640105492 |
Product | serine phosphatase |
Protein accession | YP_001036517 |
Protein GI | 125972607 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG2208] Serine phosphatase RsbU, regulator of sigma subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAGTA TTATACGCGC CTTTTTAAAG TATAAGGAAT ATGTATTGCT TGCAATAATA GCTGTTGCTG CCGGTTTAGT ATTTGGAATT GCATATTTTT TGAAGGAACA ATTCAGTGCC GTAATGACTG TTGAGAGTTT TTTGGCATGG CACAATATAC TTGAATTTAC AAGTGTACTG ATCTCCTTTA CTGTTTTTGC GGTCTCTTAT TATACTTACG AACAAACCGG AAATTTGCGA TCCGTTTTTT TGGGCAGTGT TTTTCTTGCA GTAGGTATGG TGGATGCTTT TCATACACTG TCATATAAGG GCATGCCCGC ATTTCTGATT GAAAACAACA GTTCCAACAG AGCTACAATC TTCTGGATTA TTGCAAGATT GTTTACAGCT CTTGGCTTTT TCATATCAAG TTTGATACCG GCTAGTGTCA AGATTCGGAC AAAGCGGATA ATATTTGTAG CCGTTCCCCT GATTACAAGC ATTTCCGCTT TGTATCTGGC CACTTATCAT CCTGAACTGT TTCCGCCGAT GCATATTGAG GGAAAAGGTC TTACTTTTTT TAAAATCTAT TCCGAACACC TGATTATTAT ATTGTTTGCC CTGTCTGTTT TAATGTTTAT CCGTGAGTAT AACAAAACAA AAAACAAAAT GGTACTTCTT CTTTGTGTTT CTCTTGAGAT AACTATTTTC AGTGAAGCTG CGTTTGTCCT GTATTTCAGT GTTTATGACA TATATAATTA TCTTGGTCAC GTGTACAAAT TTATAGCATT CTTCATTATT TTCAGGGCTA TTTTTATTAA CGATATACAA GAGCCGTATC GAAAGCTTTC AAAGGCAAAA GAAAAACTGA GGAACCATGC CGAAAACCTG GACATGATGA TCAGGGAAAG AACAAGAGAG CTTGAAAATC TCAATCAGAA ACTCATGCAA GATTTGAAAT ATGCCCGGGA CATACAAAAA TCGGTTTTTA AGCTGCGCAA TCAGGATTGG GAAAAAGTGC GGTTTGAAGT GAAAAACTAT TCTTCTGAAA TGGTAAGCGG TGATTTTTGC AATGTTTTTA AAATTGACAA CGACAATATA GGGTTTTATA TCGGAGATGT GTCCGGCCAC GGTGTTCCGG CGGCAATGCT TACGATATTT TTGAATCAGA CAGTGAAGAC TTTGCTGGAG ATGGAAACAA ACGAACTTAA CAAAATCAGT CCGGCAATGG TTTTGGAAAA CATATACCGT TCTTTCAACT CAACAAACTT CGACGAAAAT GTATATATTG TCATGATTTA TGCGGTATAC AACAGGCACA CACAGGTTCT TACTTATTCC TCGGCAGGTC TTAATGTTTC GCCGATTCTT ATAAAACCTT CAGGAGAAAT TTTGGAAATA GAAATAAAAG GCTTTCCCAT ATGCAAATTT ATTGAGTTTT ATGACGGAGA ATATCAAAAT CATGCGTTAA AGCTTAATAA AGATGAGAAA ATTTTGTTTT ACACCGACGG GCTTATTGAA GCACAGAATA CGGACAGGAA CTTTTTTGGA GACATGAGAC TGAAAGAAAT TTTACAGGAA AATTATAATA AATCCGCTTC CGAACTGTCA AAGCTGATTT CCGACGGTAT TTTTGGATTT ACCGGGAAAA AAGAAATTAA AGATGATATA ACGTTCCTTA TCATGGAAGT AGTAGAATGA
|
Protein sequence | MRSIIRAFLK YKEYVLLAII AVAAGLVFGI AYFLKEQFSA VMTVESFLAW HNILEFTSVL ISFTVFAVSY YTYEQTGNLR SVFLGSVFLA VGMVDAFHTL SYKGMPAFLI ENNSSNRATI FWIIARLFTA LGFFISSLIP ASVKIRTKRI IFVAVPLITS ISALYLATYH PELFPPMHIE GKGLTFFKIY SEHLIIILFA LSVLMFIREY NKTKNKMVLL LCVSLEITIF SEAAFVLYFS VYDIYNYLGH VYKFIAFFII FRAIFINDIQ EPYRKLSKAK EKLRNHAENL DMMIRERTRE LENLNQKLMQ DLKYARDIQK SVFKLRNQDW EKVRFEVKNY SSEMVSGDFC NVFKIDNDNI GFYIGDVSGH GVPAAMLTIF LNQTVKTLLE METNELNKIS PAMVLENIYR SFNSTNFDEN VYIVMIYAVY NRHTQVLTYS SAGLNVSPIL IKPSGEILEI EIKGFPICKF IEFYDGEYQN HALKLNKDEK ILFYTDGLIE AQNTDRNFFG DMRLKEILQE NYNKSASELS KLISDGIFGF TGKKEIKDDI TFLIMEVVE
|
| |