Gene Cthe_2681 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2681 
Symbol 
ID4808852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3164153 
End bp3166549 
Gene Length2397 bp 
Protein Length798 aa 
Translation table11 
GC content42% 
IMG OID640108099 
Productserine phosphatase 
Protein accessionYP_001039073 
Protein GI125975163 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2208] Serine phosphatase RsbU, regulator of sigma subunit 
TIGRFAM ID[TIGR02865] stage II sporulation protein E 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGACAC AAATTGTTCC GTATCAAAAA AACAGTATTT TGGAAAGAAT GCATGGAAAA 
ACGAGGGGAG GAATTTCAGC CGGACTTGCA ATTTCAAAAA ACAATATTAT ATTCTTTTTG
CTAAGTTTCC TCCTTGGCAG AGTGTCTCTA TTTGGAGGTC TTATGCCTTT TGGACTTGCT
TTTTATGCGG CGGCAATCAC ATCAAATGCA AACAGATTTC TGACAGCTGC GGCAGTAGTA
GCCGGAATGG CAACGGGCGG AGCCCGGACG GAGCTTTATA CGGCATTGAC TTTTATGGTT
CTTTTCAGTT TGTTCACTAA ATTATTAAAA AATTCAGCGG AAAAAGAAAA TCTCAAACAT
GCAGCTTTAG GCTTTGTAAG TGCTGCTGTT CCCCAGATTA TAATGGCGTG GCTGCAGGGA
TTTTTAATGT ATGATGTTTT AATGGCACTG CTTAACAGTG TGCTGGTATT TTCAATGATA
TACATATTCA GGCGCGGCAT GCCTTTAATA GAAGGAACAA AGAAAAAAGG TGTCATTGGC
AATGAAGAAA TGATAAGTGT TGCAATGATG GCGGCCATTT CCCTGTCGGG ATTGGGGGGA
ATAAATATTT TTGGCACATC TGTTCTTAAT GCATTGGGTA TTCTGGCGGT AATGCTCTTA
AGCTTCAAAG CCGGCCCGGG AGTGGGAGCG GCTGTCGGAG TGATTGTGGG TTTGGTAATC
AATTTAAGCG GACCTGCAAC ACCACTGGTT ATAGGTTCCT ATGCTTTCTG CGGTTTGATG
GCCGGTTTGT TCAGAAATAT CGGCAAAATG GGAACAGCTG TGGGTTTTAT ACTTGCCAAT
GCAGTGCTGA CTTTATATAT AAACGGTTCA ACGGAAGTGA TTATACACAT CAAAGATATA
ATTCCCGCCG CACTTATGTT TGTGATTGTT CCGAAAAAAG TGATAGAAGA CGTGTTTGGA
GCATTTTCAC AGGAGACGGA AGCTTCAAAA GACAAGCCCG CTTACAGCCG GCGTATAAAG
GAACTTACGG TGGAAAGGCT GAATAATTTT GCAAGGGCCT TTGAGGAACT TTCCAAAACT
TTCAGCGAGA TTTCCCAAAC CAAAATAGTT GCCGGCAAGC AGGATATTTC TTCTTTGTTT
GACAGAGTGG CGGACAAGGT GTGCAAGGAT TGCAGCCTGT GTCTCCACTG CTGGGACAGG
AATTTCTACA ATACATATCA GGTGATGTTT AAAATAGTTG AGAATCTGGA GAAAAAGGGA
TGGATTGATG AGAGTGACAT ACCGGAGTAT TTCATGGAAA GATGTGAAAG AATAGGGGAG
TTTGTAAGGC AGGTAAACAA TGTATATGAA CTGTTTAAAG TGGACATGGT GTGGAAAAAC
AGAATAGGAG AGAGCAGAGG GCTTATATCG CAGCAACTGG ACGGGCTGGC AAAGGTTATA
TTAAATCTGG CCGTTGAAAT TGACGGTGAG ATTAAATTTA AGAGCGATAT GGAGGATGTA
TTGCTTTTCG AGCTTAGGAA TAAAGGAATA AAAGTAAATG ATGTGGTGGT GTGTGAAAAC
AAATGGGGTA AATATGAAGT AAACATTTTT CATAATGGAT GCGCGGGAGC AAACAAATGT
CTTGATTATA TTGAAGAAGT GGCATCTTCT GTATTGGGAA GGCGAATGGT ACGGGGTAAA
AATGAATGTA TACATAATTA CAGAACGGGT ATGTGCAATT TAAAACTGGT TGAGGCGGAA
GTTTTCAGCA TAACAACCGG TATGGCCAGA AGTTCAAAGT ATGAAAACCA GGTTTCCGGA
GACAGTTATT CCTTTATGAA CACCGGCTCG GGAAAGTATA TTCTGGCTTT GAGCGACGGT
ATGGGCACGG GACAGGCGGC GTCCAGCCAA AGCAAGGTTG CCATAAGTCT GCTGGAGCAG
TTTATGGAAA CAGGTTTTGA TCAGGATACC GCAATAAAGC TCATAAATTC AATACTTGTA
CTAAAATCCG ATGATGACTC CTTTGCCACC ATTGACCTTT CGGCTATAGA TTTGTATGAC
GGAAAAATTG AATTTGTGAA AATTGGCGCC GCTCCAACCT TTATAAAGAG ACAAAACAAG
GTGGAGGTCG TAAAAACAGT ATCTCTTCCG GCAGGAATAC TCAGTGATAT AGATACGGAG
CTCGTTTCCA AAAATGCGGA CAACGGGGAT TTTATAATCA TGGTTACTGA CGGAATTATT
GATTCTTTCA AGCTTGAAGA AGGCGGGGAG CAGAATTTGA TAAAATTTAT TGAGGATATT
GACAGCATAA ATCCCCAGGG AATTGCGGAT CTTATATTGG CCGAGGCCTG CTCAAAGTGC
AAGGACAAGC CTGTCGACGA TATGACGGTG CTGGTGGCAA AGGTATGGAA GCGATAA
 
Protein sequence
MKTQIVPYQK NSILERMHGK TRGGISAGLA ISKNNIIFFL LSFLLGRVSL FGGLMPFGLA 
FYAAAITSNA NRFLTAAAVV AGMATGGART ELYTALTFMV LFSLFTKLLK NSAEKENLKH
AALGFVSAAV PQIIMAWLQG FLMYDVLMAL LNSVLVFSMI YIFRRGMPLI EGTKKKGVIG
NEEMISVAMM AAISLSGLGG INIFGTSVLN ALGILAVMLL SFKAGPGVGA AVGVIVGLVI
NLSGPATPLV IGSYAFCGLM AGLFRNIGKM GTAVGFILAN AVLTLYINGS TEVIIHIKDI
IPAALMFVIV PKKVIEDVFG AFSQETEASK DKPAYSRRIK ELTVERLNNF ARAFEELSKT
FSEISQTKIV AGKQDISSLF DRVADKVCKD CSLCLHCWDR NFYNTYQVMF KIVENLEKKG
WIDESDIPEY FMERCERIGE FVRQVNNVYE LFKVDMVWKN RIGESRGLIS QQLDGLAKVI
LNLAVEIDGE IKFKSDMEDV LLFELRNKGI KVNDVVVCEN KWGKYEVNIF HNGCAGANKC
LDYIEEVASS VLGRRMVRGK NECIHNYRTG MCNLKLVEAE VFSITTGMAR SSKYENQVSG
DSYSFMNTGS GKYILALSDG MGTGQAASSQ SKVAISLLEQ FMETGFDQDT AIKLINSILV
LKSDDDSFAT IDLSAIDLYD GKIEFVKIGA APTFIKRQNK VEVVKTVSLP AGILSDIDTE
LVSKNADNGD FIIMVTDGII DSFKLEEGGE QNLIKFIEDI DSINPQGIAD LILAEACSKC
KDKPVDDMTV LVAKVWKR