Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2681 |
Symbol | |
ID | 4808852 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3164153 |
End bp | 3166549 |
Gene Length | 2397 bp |
Protein Length | 798 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640108099 |
Product | serine phosphatase |
Protein accession | YP_001039073 |
Protein GI | 125975163 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG2208] Serine phosphatase RsbU, regulator of sigma subunit |
TIGRFAM ID | [TIGR02865] stage II sporulation protein E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGACAC AAATTGTTCC GTATCAAAAA AACAGTATTT TGGAAAGAAT GCATGGAAAA ACGAGGGGAG GAATTTCAGC CGGACTTGCA ATTTCAAAAA ACAATATTAT ATTCTTTTTG CTAAGTTTCC TCCTTGGCAG AGTGTCTCTA TTTGGAGGTC TTATGCCTTT TGGACTTGCT TTTTATGCGG CGGCAATCAC ATCAAATGCA AACAGATTTC TGACAGCTGC GGCAGTAGTA GCCGGAATGG CAACGGGCGG AGCCCGGACG GAGCTTTATA CGGCATTGAC TTTTATGGTT CTTTTCAGTT TGTTCACTAA ATTATTAAAA AATTCAGCGG AAAAAGAAAA TCTCAAACAT GCAGCTTTAG GCTTTGTAAG TGCTGCTGTT CCCCAGATTA TAATGGCGTG GCTGCAGGGA TTTTTAATGT ATGATGTTTT AATGGCACTG CTTAACAGTG TGCTGGTATT TTCAATGATA TACATATTCA GGCGCGGCAT GCCTTTAATA GAAGGAACAA AGAAAAAAGG TGTCATTGGC AATGAAGAAA TGATAAGTGT TGCAATGATG GCGGCCATTT CCCTGTCGGG ATTGGGGGGA ATAAATATTT TTGGCACATC TGTTCTTAAT GCATTGGGTA TTCTGGCGGT AATGCTCTTA AGCTTCAAAG CCGGCCCGGG AGTGGGAGCG GCTGTCGGAG TGATTGTGGG TTTGGTAATC AATTTAAGCG GACCTGCAAC ACCACTGGTT ATAGGTTCCT ATGCTTTCTG CGGTTTGATG GCCGGTTTGT TCAGAAATAT CGGCAAAATG GGAACAGCTG TGGGTTTTAT ACTTGCCAAT GCAGTGCTGA CTTTATATAT AAACGGTTCA ACGGAAGTGA TTATACACAT CAAAGATATA ATTCCCGCCG CACTTATGTT TGTGATTGTT CCGAAAAAAG TGATAGAAGA CGTGTTTGGA GCATTTTCAC AGGAGACGGA AGCTTCAAAA GACAAGCCCG CTTACAGCCG GCGTATAAAG GAACTTACGG TGGAAAGGCT GAATAATTTT GCAAGGGCCT TTGAGGAACT TTCCAAAACT TTCAGCGAGA TTTCCCAAAC CAAAATAGTT GCCGGCAAGC AGGATATTTC TTCTTTGTTT GACAGAGTGG CGGACAAGGT GTGCAAGGAT TGCAGCCTGT GTCTCCACTG CTGGGACAGG AATTTCTACA ATACATATCA GGTGATGTTT AAAATAGTTG AGAATCTGGA GAAAAAGGGA TGGATTGATG AGAGTGACAT ACCGGAGTAT TTCATGGAAA GATGTGAAAG AATAGGGGAG TTTGTAAGGC AGGTAAACAA TGTATATGAA CTGTTTAAAG TGGACATGGT GTGGAAAAAC AGAATAGGAG AGAGCAGAGG GCTTATATCG CAGCAACTGG ACGGGCTGGC AAAGGTTATA TTAAATCTGG CCGTTGAAAT TGACGGTGAG ATTAAATTTA AGAGCGATAT GGAGGATGTA TTGCTTTTCG AGCTTAGGAA TAAAGGAATA AAAGTAAATG ATGTGGTGGT GTGTGAAAAC AAATGGGGTA AATATGAAGT AAACATTTTT CATAATGGAT GCGCGGGAGC AAACAAATGT CTTGATTATA TTGAAGAAGT GGCATCTTCT GTATTGGGAA GGCGAATGGT ACGGGGTAAA AATGAATGTA TACATAATTA CAGAACGGGT ATGTGCAATT TAAAACTGGT TGAGGCGGAA GTTTTCAGCA TAACAACCGG TATGGCCAGA AGTTCAAAGT ATGAAAACCA GGTTTCCGGA GACAGTTATT CCTTTATGAA CACCGGCTCG GGAAAGTATA TTCTGGCTTT GAGCGACGGT ATGGGCACGG GACAGGCGGC GTCCAGCCAA AGCAAGGTTG CCATAAGTCT GCTGGAGCAG TTTATGGAAA CAGGTTTTGA TCAGGATACC GCAATAAAGC TCATAAATTC AATACTTGTA CTAAAATCCG ATGATGACTC CTTTGCCACC ATTGACCTTT CGGCTATAGA TTTGTATGAC GGAAAAATTG AATTTGTGAA AATTGGCGCC GCTCCAACCT TTATAAAGAG ACAAAACAAG GTGGAGGTCG TAAAAACAGT ATCTCTTCCG GCAGGAATAC TCAGTGATAT AGATACGGAG CTCGTTTCCA AAAATGCGGA CAACGGGGAT TTTATAATCA TGGTTACTGA CGGAATTATT GATTCTTTCA AGCTTGAAGA AGGCGGGGAG CAGAATTTGA TAAAATTTAT TGAGGATATT GACAGCATAA ATCCCCAGGG AATTGCGGAT CTTATATTGG CCGAGGCCTG CTCAAAGTGC AAGGACAAGC CTGTCGACGA TATGACGGTG CTGGTGGCAA AGGTATGGAA GCGATAA
|
Protein sequence | MKTQIVPYQK NSILERMHGK TRGGISAGLA ISKNNIIFFL LSFLLGRVSL FGGLMPFGLA FYAAAITSNA NRFLTAAAVV AGMATGGART ELYTALTFMV LFSLFTKLLK NSAEKENLKH AALGFVSAAV PQIIMAWLQG FLMYDVLMAL LNSVLVFSMI YIFRRGMPLI EGTKKKGVIG NEEMISVAMM AAISLSGLGG INIFGTSVLN ALGILAVMLL SFKAGPGVGA AVGVIVGLVI NLSGPATPLV IGSYAFCGLM AGLFRNIGKM GTAVGFILAN AVLTLYINGS TEVIIHIKDI IPAALMFVIV PKKVIEDVFG AFSQETEASK DKPAYSRRIK ELTVERLNNF ARAFEELSKT FSEISQTKIV AGKQDISSLF DRVADKVCKD CSLCLHCWDR NFYNTYQVMF KIVENLEKKG WIDESDIPEY FMERCERIGE FVRQVNNVYE LFKVDMVWKN RIGESRGLIS QQLDGLAKVI LNLAVEIDGE IKFKSDMEDV LLFELRNKGI KVNDVVVCEN KWGKYEVNIF HNGCAGANKC LDYIEEVASS VLGRRMVRGK NECIHNYRTG MCNLKLVEAE VFSITTGMAR SSKYENQVSG DSYSFMNTGS GKYILALSDG MGTGQAASSQ SKVAISLLEQ FMETGFDQDT AIKLINSILV LKSDDDSFAT IDLSAIDLYD GKIEFVKIGA APTFIKRQNK VEVVKTVSLP AGILSDIDTE LVSKNADNGD FIIMVTDGII DSFKLEEGGE QNLIKFIEDI DSINPQGIAD LILAEACSKC KDKPVDDMTV LVAKVWKR
|
| |