Gene Cthe_1268 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1268 
Symbol 
ID4809773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1541610 
End bp1542788 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content39% 
IMG OID640106691 
Producthistidine kinase 
Protein accessionYP_001037693 
Protein GI125973783 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCCGGTT ATAAAAAAGT AGACACAGCT AATTTGGATA AAATAATTAA GAAAACAATT 
GAAGCCATAA ATAACAGTAA AGCCGAATTG TTTGACATAG CCGAAAACGC GAGAAATGAA
TGCGTAAGGC TTGAAAAAGA ACTGGAGGAG TTAAAGCGCA GGACATCCGA AATTATCAAA
AGTGTGGAAA CCCTTGAGGT TGCACTTTAT GAAAGCAAGA AACGGCTAAT GCATGTGAGC
AGAAATTACG ATAAATATTC TGAAGAGGAA TTAAGGGAAG CTTATGAAAA TGCAGACAAT
ATCAGGGTTG AGCTTGCCAT AAAACGGGAG CGTGAGCAAT ACTATATCAA AAGAAGAAAT
GAATTGGAAA TGAGGCTTAA AGAGGCTTAT AAAACCGTTG AAAAGGCGGA CAACCTTATC
TCCCAGATTG GAATTTCCTT AAGCTATCTT ACCGGAGATC TTGAGAATGT CAGTTTGCAG
ATTGAAGATA TGAAACAAAG GCGGCTTTTG GGGATTCGGA TAATAAAAGC CCAGGAAGAG
GAGCGACAGA GGGTTGCAAG GGAAATTCAC GACGGTCCTG CCCAATCGAT GTCCAATATT
GTTTTAAAAG CGGAAATATG CGAAAGATTG GTTGACTCTG ACCCGGAAAA GGCAAAAGAT
GAGCTTAGAA CTTTAAAATC CGTTGTCAGA GACACTCTTC GGGATGTAAG GAAAATAATA
TATGACTTAA GACCAATGTC ATTGGACGAC TTGGGTTTGA TACCAACCCT TCAAAGGTAT
ATAGAGACTT GTCGGGAAGA ATCCGGAATA AAAATAACGT TTAAGACAAG AGGTACATGT
GAGCAATTGA AACCTGTGGT TTCTTTGACC GTTTTCCGAC TTGTCCAGGA AGCAGTCAAT
AATATTAAAA AGCATGCCCG TGCCGATAAA GTAACTATAA ATCTCGAATT TTTGGAAAAA
GAATTAAAGC TCTATATAGC AGACAATGGA GTAGGTTTTG ACTTTGATTC TTTAAAATCA
AACGAAGAGG ATATAAACAA AGGCTTCGGT CTTATAAGCA TGAGAGAAAG GGTTGAGCTT
TTGGACGGCA AATTTGAGAT TGATTCTGCC GTTGGCAAAG GAACCAGACT TAATATAACT
GTACCTTTAT TACCGGAAGA GGGGGTCTCA AATGGATAA
 
Protein sequence
MAGYKKVDTA NLDKIIKKTI EAINNSKAEL FDIAENARNE CVRLEKELEE LKRRTSEIIK 
SVETLEVALY ESKKRLMHVS RNYDKYSEEE LREAYENADN IRVELAIKRE REQYYIKRRN
ELEMRLKEAY KTVEKADNLI SQIGISLSYL TGDLENVSLQ IEDMKQRRLL GIRIIKAQEE
ERQRVAREIH DGPAQSMSNI VLKAEICERL VDSDPEKAKD ELRTLKSVVR DTLRDVRKII
YDLRPMSLDD LGLIPTLQRY IETCREESGI KITFKTRGTC EQLKPVVSLT VFRLVQEAVN
NIKKHARADK VTINLEFLEK ELKLYIADNG VGFDFDSLKS NEEDINKGFG LISMRERVEL
LDGKFEIDSA VGKGTRLNIT VPLLPEEGVS NG