Gene Cthe_2166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2166 
Symbol 
ID4810879 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2576033 
End bp2577169 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content39% 
IMG OID640107569 
Productputative PAS/PAC sensor protein 
Protein accessionYP_001038561 
Protein GI125974651 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2208] Serine phosphatase RsbU, regulator of sigma subunit 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGCAT TAAATAACAA GCTTTATTTT ATAGACAGCA CGAAACATAA GATCTTTGTA 
GAACAAATTA TAAACGGAAT GCATGACTGG GTAAGGGTTA TTGATATTAA CGACAATATA
ATCTTTGTCA ACGAATCCAT GGCAAAAGCA CTGGGCAAAA ACGTAATCGG GGAAAAATGT
TATAAAGCCA TCGGAAAAAG CGAGCCCTGT GAAAACTGTA CCTCAAGAAA AGCAGTTTTC
GAAGGAACAA TTCAAGCCAA GGAAGAAATC ATAGGGGATA GAATTTTCTC CGTCAAAAGT
TCCCCCATCA GGGATGAGAA CGGCCAAATT ACCGCTGTTG TTGAAGTTCT CCGTGACATA
ACCGAAATGA AAAAAATGCA GAAAAAAATC CTTGAGCACA ACCAAAAACT TCAGAGCGAG
CTTAACATGG CAAGAAGACT TCAATGCAGC CTTCTTCCAA AGGAACTGCC CCAGGACAAG
ATTGATTTCT CATATGTTTA CAGGCCCTGT GAAGCCATCG GAGGGGACTT TTTGGATATA
TTCAAGATCG ATGATGAGCA CATTGGAATA TATATCGCCG ATGTATCCGG GCATGGAGTA
CCGGCTTCAA TGCTTACAGT TTTCTTACGC TCTTCAATAA ACAAAAAAAC CCTTTCGCCC
GCAGAAGCTT TAAATCAGCT CTACAAAGAG TTTAACCGGG ATTATTATGA CCAGGAGTTG
TACATCACAA TATTTTATGC CATCATTGAC ACTAAAAATA AAAATATCAT ATATTCAAAT
GCAGGCCACA ACGCCAGTCC CGTATTATTC AACCATGAAA GCCACAGGTT CGACATTCTT
AGAATACCCG GAGTTCCCAT CAGTGACTGG GTTGACAATC CGGAATATAC TGAAAAAAGT
ATTTCAATTG AAAAAGGCGA CCGATTGTTT ATGTATACCG ACGGTATTGT GGAACTGCGA
AACAACAAAG GTGAGCAGTT TGGCGAGGAA AGGCTCCTTA ACATTTTGCT GGGTGAAAAA
ATGCCTCCTG CAATGACTCT TGACCGCATC ATAGAAGCTG CCATGGAATT TGCAAATATC
AAAAATTTCA ATAAAATAAT AGACGATATT ACAATGGCCT TGCTGGAAAT CTTATAA
 
Protein sequence
MDALNNKLYF IDSTKHKIFV EQIINGMHDW VRVIDINDNI IFVNESMAKA LGKNVIGEKC 
YKAIGKSEPC ENCTSRKAVF EGTIQAKEEI IGDRIFSVKS SPIRDENGQI TAVVEVLRDI
TEMKKMQKKI LEHNQKLQSE LNMARRLQCS LLPKELPQDK IDFSYVYRPC EAIGGDFLDI
FKIDDEHIGI YIADVSGHGV PASMLTVFLR SSINKKTLSP AEALNQLYKE FNRDYYDQEL
YITIFYAIID TKNKNIIYSN AGHNASPVLF NHESHRFDIL RIPGVPISDW VDNPEYTEKS
ISIEKGDRLF MYTDGIVELR NNKGEQFGEE RLLNILLGEK MPPAMTLDRI IEAAMEFANI
KNFNKIIDDI TMALLEIL