Gene Cthe_1826 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1826 
Symbol 
ID4809810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2163000 
End bp2164091 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content42% 
IMG OID640107240 
Productresponse regulator receiver sensor signal transduction histidine kinase 
Protein accessionYP_001038240 
Protein GI125974330 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain
[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAGCA TGCTTCTTCC AAAGTATTTT GACATACTGA TAGTTGACGA TATCCCGGAA 
CATATAGATG TAGCCGTTCA GGTACTTAGG GACAGCAATT TCAAAATCCG GGTTGCAACG
GACGGAAATA CCGCGCTGAA GCTTATATAT CAGCAGAAAC CGGATCTTAT TCTTTTAGAC
ATTTACATGC CCGAAATGGA CGGATTTGAA CTTTGCCGGC TTATAAAAAA TACACCCGAT
TTAAAAAATA TCCCCGTTAT ATTTCTCACA TCTTTCAGCG ACGAAGAAAG TATAAGAAAA
GGTTTTGAGT CGGGAGGGCA GGATTATGTG GTTAAGCCGT TTAACGCTTC GGAACTGCTT
TCAAGAGTCA AAACACATTT GATGCTGAAA TGCCAGGCAG AATCTTTGAA AGAAGCCAAT
AAGGAACTGG ACAGTTTCTG TTATACAGTC GCCCATGACC TTAAATCCCC GCTGCTTTCC
TTAAATAAAC TGGTTGAGCT TTTGGTTTCC GATCATTTAA ATCAATTGGA CTCGGCCGGA
AAAGAACTTG TCTATAATAT ACGGGAAAAA TCCTCGGAAA TAATACATAC CGTCGATCGT
TTGCTGGAAT TTTCTAAAAT GTGCGAAATG CAGGTCAATT TTGAAATCAT CGATCTCAAC
GAATTGTTTA CCGAGGTCTG CAATGAGCTC AAAAGCCTGG AGCCGCAGCG GGATATACGT
ATTCATATCC AGCCTCTGCC TAAGGTTTAT GGTGACCGTC TTCTGATGAG GCTCTTAATT
TCAAACATCC TTTCCAACGC TTTTAAATAT ACACGCAACA GGCAGACGGC AATTATTGAG
ATACAATCCT CGGAAGACGG CAATGAATAT GTTTTTTTCG TCAAAGACAA CGGTGCCGGC
TTTGACATGA AGTATTCGTC AAGGCTGTTC GGAGTATTTC AGAGGCTTCA CAGCAAAGAT
GAATTCGAAG GTTCGGGAGT CGGCCTCGCC ATATGCCAAA GGATTCTCAA GAGGCATAAC
GGCAGGGCGT GGATGACAGG TGAAATAGAC AAAGGTGCTA CTTTCTTCTT CACCCTCCCT
AAATTCGAAT GA
 
Protein sequence
MESMLLPKYF DILIVDDIPE HIDVAVQVLR DSNFKIRVAT DGNTALKLIY QQKPDLILLD 
IYMPEMDGFE LCRLIKNTPD LKNIPVIFLT SFSDEESIRK GFESGGQDYV VKPFNASELL
SRVKTHLMLK CQAESLKEAN KELDSFCYTV AHDLKSPLLS LNKLVELLVS DHLNQLDSAG
KELVYNIREK SSEIIHTVDR LLEFSKMCEM QVNFEIIDLN ELFTEVCNEL KSLEPQRDIR
IHIQPLPKVY GDRLLMRLLI SNILSNAFKY TRNRQTAIIE IQSSEDGNEY VFFVKDNGAG
FDMKYSSRLF GVFQRLHSKD EFEGSGVGLA ICQRILKRHN GRAWMTGEID KGATFFFTLP
KFE