Gene Cthe_1846 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1846 
Symbol 
ID4809392 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2189981 
End bp2191372 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content43% 
IMG OID640107260 
Productperiplasmic sensor signal transduction histidine kinase 
Protein accessionYP_001038260 
Protein GI125974350 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000887504 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTTA GTTTAAGGAT AAAGTTGTCC GTATCTTATC TTCTGGTTGC CGTGACAAGC 
GTAGTTCTGG CAATGATGCT GACCAATATG TTTCTTGACA GCCACTTTAG AAAATATGTC
AACCAAAACC AGGAGAAAAG AAACCGCGAA GTTGTCACAG CGGTAAGCCA GCAGTATCTG
GGAGACGGAA AATGGAACAC GGAAATGGTG GAAGCCATAG GTGTCAGCGC GCTTGAAAAC
GGTCTGATAA TAAAGGTAAA GGATATCAGC GGCAGAGTAA TCTGGGATGC GACGGCCCAT
AACAACGGAA TGTGCCAAAG AATTATTGAA CATATGGCAA GAAACGTAAG CAGACGTTAT
CCTGATGCCG GAGGATCTTA CACAGAGATA CCTTATGAGG TAACTTACAA CCAGAAGAGT
GTCGGTCTGG TGGAGATTGG CTCCTATGGG CCGTATTATT TGAGTGACAA TGACCTTGCA
TTTATAAACA CTTTGAACAA AATTTTGATT GCAGTAGGTG TTTTTTCAGT AATTTTCTCG
CTGGTTTTGG GAAGTATAAT GGCAAAAAAG CTGAGCCAGC CCATAGCCAG GGTTATAAGT
AGCGCGCAGT CGATAGCCCG GGGATATTTT TCCGACAGAA TAACGGAAAA ATCCACTACG
GAGGAAATAT GCCAGTTGAC TTCAACAATC AATAATCTTG CCCAAACCCT GGAAAATCAG
GAGGCTCTGC GCAAAAAAAT GAGTGCCGAC ATGGCCCACG AACTTCGGAC TCCCCTTGCA
ACACTGCAAA GCCACATGGA GGCCATGATA GACGGTATAT GGGAAACGGA CGCGGACAGG
CTCAAGAGCT GTTATGAGGA AATAATCAGG ATAAACAAGA TGGTGGGAGA CCTTGAAAAG
CTGGCAAAAT ACGAAAGTGA AGGTTTTGTA TTGGACAAAA CCGTATTTGA CATGTCGGCA
CTGATACGCA GAATCATATG CAATTTTGAG CCCGAATTTA AAAATAACGG AATTGAAATT
GCCTTTGAAG GTGGGACTGA GGAAATTTTT GCCGATAAGG ATAAAATGAG TCAGGTAATT
ATAAACCTTC TGTCCAATGC CTTAAAATAC ACGCAAGAAG GAGGCAAGGT TGAAATAAGT
GCCAAAAGCA AGGGAGATGT GCTTGAGATC AGGGTGAAGG ATAACGGGCA GGGCATACCT
GAGGAGGACT TGCCTTTCGT GTTTGAAAGA CTTTACAGGG CGGACAAGTC GCGTAACAGG
AAGACAGGAG GCGCAGGAAT AGGACTTACT ATTGCAAAAA CAATTGTTGA GGCTCATAAT
GGACATATAG AGGTTTACAG TAAAATAAAT GAGGGAACGG AATTTCTTAT AACATTGCCA
AAAGGTATTT AA
 
Protein sequence
MKFSLRIKLS VSYLLVAVTS VVLAMMLTNM FLDSHFRKYV NQNQEKRNRE VVTAVSQQYL 
GDGKWNTEMV EAIGVSALEN GLIIKVKDIS GRVIWDATAH NNGMCQRIIE HMARNVSRRY
PDAGGSYTEI PYEVTYNQKS VGLVEIGSYG PYYLSDNDLA FINTLNKILI AVGVFSVIFS
LVLGSIMAKK LSQPIARVIS SAQSIARGYF SDRITEKSTT EEICQLTSTI NNLAQTLENQ
EALRKKMSAD MAHELRTPLA TLQSHMEAMI DGIWETDADR LKSCYEEIIR INKMVGDLEK
LAKYESEGFV LDKTVFDMSA LIRRIICNFE PEFKNNGIEI AFEGGTEEIF ADKDKMSQVI
INLLSNALKY TQEGGKVEIS AKSKGDVLEI RVKDNGQGIP EEDLPFVFER LYRADKSRNR
KTGGAGIGLT IAKTIVEAHN GHIEVYSKIN EGTEFLITLP KGI