Gene Cthe_2944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2944 
Symbol 
ID4810227 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3459183 
End bp3460997 
Gene Length1815 bp 
Protein Length604 aa 
Translation table11 
GC content40% 
IMG OID640108367 
Productperiplasmic sensor signal transduction histidine kinase 
Protein accessionYP_001039335 
Protein GI125975425 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTTTTG GACGCAAAAA GGGAATGCCC GAAGTGATAA AGGGACTTGA AGAGGAAAAA 
ATCAGCACGT CATGGAAATT AAACCTTGAA TTTGCGCTAA TAGTTATGTT TTCCATATTT
GTGGTAGGTG CAGCCTCGTT TGGTATGATT TCGAACTTTA TATTAAATCA TGCAAAGGTC
AGTTCCAGCG AGCTTATCAA GCAAACATCC AAAAACATTG AAGCTATTCT GACAGGTTTT
GATGAACTGG CGATGACTGT GTCAAGGGAT AACACACTGG CAGAGTATAT AAGTGTGCAC
GACAGTATAG AGGATGTTAA TCTTAAGGCT CAAAATGAAA GAAAAATCAA GGAAATCCTG
AATAATTACG CAAAAAAAAG AAAAGACATA ACCAATATTG CAGTGGTATC AAACGGGGGA
ACTTACATAA CACCAGATGA GACAAAACCG GGCATTGATA AAAATATAGA TGACATTTAT
GCGGTAAAAG CGTTTAAAGA AAGCCACAGG CAGTCATTAT GGCTCAACAC GTATACATTG
GACACTTCAC CCTCCGAAAA TGTACAGGTC TTTTCAATAA TAAAGGGGAT ATATTCTTTA
AGCAGTCTGA AAAGCCAGGG AATCTTGATT ATAAACATTA CGGAAGACTA TCTTTTCAGA
CTTATATCGG ATATTAAGCC CATTGACGAC GGAAGAATCT ATATAATCGG CAGTGACGGG
AATTACGTTT TAAACCCCTA TGACAGAAGC AAGAACGGTA AAAAGGCGGA TCTTGAGTTT
GTGGAGGACA TGCTGCGCAA GGGCGAGAAT GTGGACATAA AGGAAATAAA TGGTGAGGAG
TATCTTGTGA CTTACAATAC CATTCAGGAG ATAAAAGGTA CCGGGCTGGG ATGGATGATA
GTCGAGATTA CTCCGGTTTC GGTGATCAGA ACCAGTGTTA CCGAAGCGGG AATGCGCCTG
TTTTTCATAG GTTTTGGGTG TGTTGTCCTG GGATTGATTC TTGTAGGGAT GGCAACCGCT
TTTTACAACC GATATCTCAA TAAAAGCTAT TGGGAAAGGC ATTCCGTTGC ATTGGAAAGG
GAGAGACTTG CTTCCTTGGG ACAACTGATA GGGGGAATTG CACACAATTT CAAAACTCCA
ATTATGTCAA TAGCCGGAGG ACTGGAGGCA TTAAAAGATC TTGTGGATGA GTACGACATT
TCCATCGGAG ATCCGCAGGT AACCGGTGAG GATCATCATG AAATTGCTGC TGAAATGAGA
GATTGGATAA GCAAAATAAA GCCTTACTGC GGGTATATGT CGGAGATTAT ATCCACGGTA
AAAGGGCAGG CCGACAATAT GAATGGATCA GAGAATTCAA GCTTTACGGT GGGAGAACTT
TTAAAAAGAG TTGAAATTTT AATGAGCCAC GAGCTTAAAA AATTTTCCTG CGAGCTGAGA
TTGGATATAA AAGTGGATGA GGATACAACT ATAAAAGGAG AAATAAACAA CCTTGTACAG
GTATTGAACA ACCTTATATC CAATTCTATC GAGTCTTACA ACGGAAAGGA AGGAAAAATA
GACCTGTCAG TAAGCAAAAA TGGCCAGGAA TTGGAAATAG TTGTAAAAGA CTATGGATGC
GGCATACCGG AAAATGTAAA GAGGAAACTT CTGAAAGAAA TGATAACGAC CAAGGGAAAA
AACGGAACAG GACTTGGCCT TTATATGTCT CACTCCACGA TTAAGGGCAA ATTTGGCGGA
ACAATGAAAG TCAAGAGTGA GGAAGGAAAA GGGACGGAAA TATGCATTTT GATTCCTTTT
GCTGCCAAAA CTTAA
 
Protein sequence
MLFGRKKGMP EVIKGLEEEK ISTSWKLNLE FALIVMFSIF VVGAASFGMI SNFILNHAKV 
SSSELIKQTS KNIEAILTGF DELAMTVSRD NTLAEYISVH DSIEDVNLKA QNERKIKEIL
NNYAKKRKDI TNIAVVSNGG TYITPDETKP GIDKNIDDIY AVKAFKESHR QSLWLNTYTL
DTSPSENVQV FSIIKGIYSL SSLKSQGILI INITEDYLFR LISDIKPIDD GRIYIIGSDG
NYVLNPYDRS KNGKKADLEF VEDMLRKGEN VDIKEINGEE YLVTYNTIQE IKGTGLGWMI
VEITPVSVIR TSVTEAGMRL FFIGFGCVVL GLILVGMATA FYNRYLNKSY WERHSVALER
ERLASLGQLI GGIAHNFKTP IMSIAGGLEA LKDLVDEYDI SIGDPQVTGE DHHEIAAEMR
DWISKIKPYC GYMSEIISTV KGQADNMNGS ENSSFTVGEL LKRVEILMSH ELKKFSCELR
LDIKVDEDTT IKGEINNLVQ VLNNLISNSI ESYNGKEGKI DLSVSKNGQE LEIVVKDYGC
GIPENVKRKL LKEMITTKGK NGTGLGLYMS HSTIKGKFGG TMKVKSEEGK GTEICILIPF
AAKT