Gene Cthe_0800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0800 
Symbol 
ID4810418 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp966908 
End bp968119 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content34% 
IMG OID640106217 
Productperiplasmic sensor signal transduction histidine kinase 
Protein accessionYP_001037228 
Protein GI125973318 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTAAAA AGCTTAAAAT AAAATTTGTC ATGACCAATA TTGTCAGCAT AACAACCATT 
CTGGTTATTA TTTTTTTTGG TATATATTTG TCAGTAAAGG CGTTTTTGAA ACTTCAGGCC
GATATAATAC TGTACACCAT TGCAAATGAA GAAAAACTAA ATTCAAATTT TGATTCCGGA
TTTGTAAGAT TCTTTTCTAT AAAAATAGAC ACATCAGGAA AAATTATCGG GTATCTGATG
AATATCAACA TTTCCAGTGA AGAAATGGAA ACACTCAAAG AAAAAGTAAT AGAAAAAGGA
GAAACAAGAG GAAAAATTTC AAATGACAAG TTCAAATTTA AATTTTTGAA AATTCCCAAG
GAATATGGAT ATATAATTGT ATTTCTTGAT TACACTGTAG AAGAAAAAAT GTACAAACCA
CTCATTATCA TAAGTATCTA TATTGTTCTA TTGTCCATAG TACTGGTTTT TACAGTAAGT
TTTTTCCTTG CAAACAGATC CATAAAACCA ATAAAAACCT CCTGGGAAAA GCAGACTGCT
TTTATTGCTG ACGCATCCCA TGAACTCAGG ACACCTCTCG CAGTAATAAA TTCCAACCTG
GAAATAGTGA TGGAAAACGA AAATGAAACT GTCGGAAGTC AAATGAAGTG GCTTGGTAAC
ATCCAAAGCG AATTGGAGCG CATGAAAAAA CTTGTTGACG ATTTATTGTT TCTGGCAAGA
GCGGATGCTG AAGATGAAAT GCCTAAGGAA TATTTTGATT TAAGCAGGCT TGTACACAAA
ATTTATGACG AGTTTACACC CCTTTGCCAA AAGAAAAGCT TGGAATTTTT ATTGGACGCT
AAAGACAATA TTGTGTTTTA CGGCAACGAA TTTCGCATAA AACAGCTCAT AACAATATTA
TTGGACAATG CAATAAAGTT CACGGGTGAA GGAGGAAAAA TCATACTTAA GTTAAAAGTG
CATGCAAACA GTATTCAATT GTCTGTCAGC GATACAGGAG AAGGCATTGC CAAAGAACAT
ATTGACAAAA TTTTTGACAG ATTTTACAGG GTGGACAAAT CCCGTTCACG AAACCACGGA
GGCTCGGGAT TGGGCTTGGC CATTGCCAAA TGCATAGTAA ATGAACATAA AGGCACCATC
GATGTTTTCA GTGAAGTGTC CAGAGGAACG GAATTTACAG TATCTTTGCC ATATAAAGCA
TCCCAGTGTT AA
 
Protein sequence
MFKKLKIKFV MTNIVSITTI LVIIFFGIYL SVKAFLKLQA DIILYTIANE EKLNSNFDSG 
FVRFFSIKID TSGKIIGYLM NINISSEEME TLKEKVIEKG ETRGKISNDK FKFKFLKIPK
EYGYIIVFLD YTVEEKMYKP LIIISIYIVL LSIVLVFTVS FFLANRSIKP IKTSWEKQTA
FIADASHELR TPLAVINSNL EIVMENENET VGSQMKWLGN IQSELERMKK LVDDLLFLAR
ADAEDEMPKE YFDLSRLVHK IYDEFTPLCQ KKSLEFLLDA KDNIVFYGNE FRIKQLITIL
LDNAIKFTGE GGKIILKLKV HANSIQLSVS DTGEGIAKEH IDKIFDRFYR VDKSRSRNHG
GSGLGLAIAK CIVNEHKGTI DVFSEVSRGT EFTVSLPYKA SQC