Gene Cthe_0083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0083 
Symbol 
ID4808778 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp116031 
End bp117710 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content36% 
IMG OID640105492 
Productserine phosphatase 
Protein accessionYP_001036517 
Protein GI125972607 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2208] Serine phosphatase RsbU, regulator of sigma subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAGTA TTATACGCGC CTTTTTAAAG TATAAGGAAT ATGTATTGCT TGCAATAATA 
GCTGTTGCTG CCGGTTTAGT ATTTGGAATT GCATATTTTT TGAAGGAACA ATTCAGTGCC
GTAATGACTG TTGAGAGTTT TTTGGCATGG CACAATATAC TTGAATTTAC AAGTGTACTG
ATCTCCTTTA CTGTTTTTGC GGTCTCTTAT TATACTTACG AACAAACCGG AAATTTGCGA
TCCGTTTTTT TGGGCAGTGT TTTTCTTGCA GTAGGTATGG TGGATGCTTT TCATACACTG
TCATATAAGG GCATGCCCGC ATTTCTGATT GAAAACAACA GTTCCAACAG AGCTACAATC
TTCTGGATTA TTGCAAGATT GTTTACAGCT CTTGGCTTTT TCATATCAAG TTTGATACCG
GCTAGTGTCA AGATTCGGAC AAAGCGGATA ATATTTGTAG CCGTTCCCCT GATTACAAGC
ATTTCCGCTT TGTATCTGGC CACTTATCAT CCTGAACTGT TTCCGCCGAT GCATATTGAG
GGAAAAGGTC TTACTTTTTT TAAAATCTAT TCCGAACACC TGATTATTAT ATTGTTTGCC
CTGTCTGTTT TAATGTTTAT CCGTGAGTAT AACAAAACAA AAAACAAAAT GGTACTTCTT
CTTTGTGTTT CTCTTGAGAT AACTATTTTC AGTGAAGCTG CGTTTGTCCT GTATTTCAGT
GTTTATGACA TATATAATTA TCTTGGTCAC GTGTACAAAT TTATAGCATT CTTCATTATT
TTCAGGGCTA TTTTTATTAA CGATATACAA GAGCCGTATC GAAAGCTTTC AAAGGCAAAA
GAAAAACTGA GGAACCATGC CGAAAACCTG GACATGATGA TCAGGGAAAG AACAAGAGAG
CTTGAAAATC TCAATCAGAA ACTCATGCAA GATTTGAAAT ATGCCCGGGA CATACAAAAA
TCGGTTTTTA AGCTGCGCAA TCAGGATTGG GAAAAAGTGC GGTTTGAAGT GAAAAACTAT
TCTTCTGAAA TGGTAAGCGG TGATTTTTGC AATGTTTTTA AAATTGACAA CGACAATATA
GGGTTTTATA TCGGAGATGT GTCCGGCCAC GGTGTTCCGG CGGCAATGCT TACGATATTT
TTGAATCAGA CAGTGAAGAC TTTGCTGGAG ATGGAAACAA ACGAACTTAA CAAAATCAGT
CCGGCAATGG TTTTGGAAAA CATATACCGT TCTTTCAACT CAACAAACTT CGACGAAAAT
GTATATATTG TCATGATTTA TGCGGTATAC AACAGGCACA CACAGGTTCT TACTTATTCC
TCGGCAGGTC TTAATGTTTC GCCGATTCTT ATAAAACCTT CAGGAGAAAT TTTGGAAATA
GAAATAAAAG GCTTTCCCAT ATGCAAATTT ATTGAGTTTT ATGACGGAGA ATATCAAAAT
CATGCGTTAA AGCTTAATAA AGATGAGAAA ATTTTGTTTT ACACCGACGG GCTTATTGAA
GCACAGAATA CGGACAGGAA CTTTTTTGGA GACATGAGAC TGAAAGAAAT TTTACAGGAA
AATTATAATA AATCCGCTTC CGAACTGTCA AAGCTGATTT CCGACGGTAT TTTTGGATTT
ACCGGGAAAA AAGAAATTAA AGATGATATA ACGTTCCTTA TCATGGAAGT AGTAGAATGA
 
Protein sequence
MRSIIRAFLK YKEYVLLAII AVAAGLVFGI AYFLKEQFSA VMTVESFLAW HNILEFTSVL 
ISFTVFAVSY YTYEQTGNLR SVFLGSVFLA VGMVDAFHTL SYKGMPAFLI ENNSSNRATI
FWIIARLFTA LGFFISSLIP ASVKIRTKRI IFVAVPLITS ISALYLATYH PELFPPMHIE
GKGLTFFKIY SEHLIIILFA LSVLMFIREY NKTKNKMVLL LCVSLEITIF SEAAFVLYFS
VYDIYNYLGH VYKFIAFFII FRAIFINDIQ EPYRKLSKAK EKLRNHAENL DMMIRERTRE
LENLNQKLMQ DLKYARDIQK SVFKLRNQDW EKVRFEVKNY SSEMVSGDFC NVFKIDNDNI
GFYIGDVSGH GVPAAMLTIF LNQTVKTLLE METNELNKIS PAMVLENIYR SFNSTNFDEN
VYIVMIYAVY NRHTQVLTYS SAGLNVSPIL IKPSGEILEI EIKGFPICKF IEFYDGEYQN
HALKLNKDEK ILFYTDGLIE AQNTDRNFFG DMRLKEILQE NYNKSASELS KLISDGIFGF
TGKKEIKDDI TFLIMEVVE