Gene Cthe_2332 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2332 
Symbol 
ID4809260 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2778314 
End bp2780116 
Gene Length1803 bp 
Protein Length600 aa 
Translation table11 
GC content41% 
IMG OID640107739 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_001038727 
Protein GI125974817 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTTGA AGCGTTTGCA GTTTCTGACA AGTCTTCAGT GGCGGCTTGT AACTATTTTC 
ATACTTTTGG CATTGGTGTT GGCGGTGTCG GCAAGTGTCT CGCTAAATTA TTTTGTGGAA
TTGTTTTTTT ATGACACCTT TAAAGCGGGA ATAGAGAATG GTTTTGAGTA TTGGGGCATA
GATGATGAGG ACCAGCCGAC AAAAGAAGAA ATTGTTTCGT TTCTTACGGC TAACAACAAA
GCAAATGCAA TGTCCCTGTT TTTTATAAAC AATTTCAGAA CATTTACCAT AATTGACAAG
AACACCAATG AAATAATATA CACCGATGTA AAGTCTCCTT ACAGGGAAAC ACTGAGGGAG
GATATTATAC AGTCAAGAAA CTATCTTGCC GCACTGGCGG GAGGTAAAGG AGACAAGGGA
AAATTAATCA GAATCGGGGA CAAGTCCTTT TTTGACTATG CAAGACGAAT TGGCAATACG
GATTATATTT TATACTTCAG GTATGACCGG GAAGAGTGGG CCGGAGCAAT AGAAGCTTTT
AATGATATAA TAAAGCTGAG TTTTCTGATT GCTGTTATAT TGTCATTGAT TTTTGGATAT
GCACTGTCGA AAACGATAAC CGTCCCGATT GTCAATCTCA TGCACAAGGC CAGGGAAATG
GCGGCGGGGC ATTTCGGACA GGTAATGGAG GTTAAATCCG ACGATGAAAT AGGAAAGCTT
ACAAAAGCTT TCAACTTCAT GAGCAGGGAG CTTAGGAAAA CTTTAAACCA GATATCGAGG
GAAAAAAGCA AGATTGAGAC TATTCTTAAC TATATGACCG ACGGTGTGAT TGCGTTTAAC
ATAAATGGTG AGGTAATTCA CATAAATCCT GTGGCAAAAG CAATATTGGG AATTGAAAAG
TGCGATTTTG ATTTCAACGA GTTTTCGAAA AGATATAATT TAGATGTTTC GATAGAAGAG
GTCAAATACC TTGAAATATA CAACGCAAAG GAAGTAAGTA CCAATATAGG CGAAAAATAC
GTCAAAATAT ATTTTGCTCT TTTTACTGAC GAGGAAAACA ATGCTGACGG CGTAATTGCC
GTGCTTCACG ATATAACCGA ACAGCAAAAG CTTGAAAACA TGCGCAAGGA GTTTGTTGCA
AATGTGTCCC ATGAACTAAG GACACCTCTG ACTTCTATCA AGAGCTATGC CGAGACACTT
CTGGACGGAG CTTTGGAGGA CAGGGAACTT GCCGGGAAAT TCTTAAGTGT TATTAATTCG
GAAGCCGACA GAATGACAAG GCTTGTAAAA GATCTGCTCC AGCTTTCAAG GCTCGACAAC
AATCAAATGA AGTGGGATAT GCAGAAAATA TCCTTTGAAG ACTTGGTAAG AAACTGCGTG
GAAAAAGTTA AATTTGAATC TGAAGAGAAA AATCAGACGC TGGAGTGTTT TACCATAGGG
GAGGAACTGG AGATTGTTGC CGACAAAGAC CGCATGGAGC AAGTGGTTTT GAATATTCTT
ACCAATGCCA TAAAGTATAC TCCTGAAGGA GGCAAAATTA CAGTTTATAT TGGAAGAATG
TACAGCGAGG TTTATGTAAA AGTGGTCGAC TCAGGTATAG GAATTCCAAG GGAGGATCTT
GGCAGAATAT TTGAAAGATT CTACAGGACG GACAAGGCCC GTTCAAGAGA AATGGGCGGT
ACAGGTCTTG GATTGGCAAT TGCAAAAGAA ATCGTGGAAG CTCATAAAGG CTCGATATCT
GTCGCAAGTG AGCCTGGAAA AGGTACGGAA GTTACCGTGA AGCTGCCGGC GGCGTGTAAC
TGA
 
Protein sequence
MILKRLQFLT SLQWRLVTIF ILLALVLAVS ASVSLNYFVE LFFYDTFKAG IENGFEYWGI 
DDEDQPTKEE IVSFLTANNK ANAMSLFFIN NFRTFTIIDK NTNEIIYTDV KSPYRETLRE
DIIQSRNYLA ALAGGKGDKG KLIRIGDKSF FDYARRIGNT DYILYFRYDR EEWAGAIEAF
NDIIKLSFLI AVILSLIFGY ALSKTITVPI VNLMHKAREM AAGHFGQVME VKSDDEIGKL
TKAFNFMSRE LRKTLNQISR EKSKIETILN YMTDGVIAFN INGEVIHINP VAKAILGIEK
CDFDFNEFSK RYNLDVSIEE VKYLEIYNAK EVSTNIGEKY VKIYFALFTD EENNADGVIA
VLHDITEQQK LENMRKEFVA NVSHELRTPL TSIKSYAETL LDGALEDREL AGKFLSVINS
EADRMTRLVK DLLQLSRLDN NQMKWDMQKI SFEDLVRNCV EKVKFESEEK NQTLECFTIG
EELEIVADKD RMEQVVLNIL TNAIKYTPEG GKITVYIGRM YSEVYVKVVD SGIGIPREDL
GRIFERFYRT DKARSREMGG TGLGLAIAKE IVEAHKGSIS VASEPGKGTE VTVKLPAACN