Gene Cthe_1393 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1393 
Symbol 
ID4809054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1702079 
End bp1703848 
Gene Length1770 bp 
Protein Length589 aa 
Translation table11 
GC content37% 
IMG OID640106817 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_001037818 
Protein GI125973908 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000411549 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGA AAATTTTTAA ATATTATTTG CTTTTGATTC TTATAATCCT ATCTGTGACA 
GTCATATTTA TCCCAAAGGT CTCAAGAAAG TTCTACACTC AAGAGGTGGA AAACAAACTT
GAGGGAATAG CTTTTTCAAT TGAGTACTAT TTGTTGAATG AAGCAAAGAA CGGCGAAATT
GATTTTGACT TTATCGCAAA GGATTATGCC GCAAAGTACA ACCAAAATTC CACCTTTCAG
GGTGAGAGTC TAAGAATTAC CTTTATCAGT TATGACGGAA AAGTGTTGGG TGATTCGGAT
GCAAATTTTA ATCAAATGGA AAATCACTTG AGCAGAAAAG AGATTCAGGA TGCACTCAAA
GGGAATGTCG GTAAAGACAT CCGCAGCAGT AAGACTTTGA AGTTGGATTT GCTTTATATG
GCAATTCCTG TGGAGGAATT GAATGTAATT GCCAGAGTTT CTGTTCCCCT TGTTCAAATA
AAAAAAATTA ACAGATTGAT TTGGCTCTAT TCAATTTTAA TTTTTATTAT GGCACTTATA
ATAACGGTAA TTGTGTCTTT GAGGATAGCA GGACTTGTAA TTCGTCCGTT GAATGATATT
ATTTCGGTAT CAAAGGAAAT AACCAACGGC AACTATTCCA GGAGAATCAA GTTAAAATCA
AAAGATGAGC TGGGACAGCT TGCCGTCCAT TTTAACAAAA TGGCTTCAAA GCTTGAAAGA
ACCATATCTG ATTTGAATAC AAAGAAAATT GAGCTTGAAT CTATTGTGGA GAGTATAACA
AACGGAATTG TTGCAGTGGA CGGCAATAAC AAGGTTATAC TGATAAATCC TGCTGCTTTC
ACTGTTTTTA ATTTGGATGC CGATGCCGAA ATTTTGGGAG ATGATATTGA AAACCATATT
AAAAACAGTC AGATAAATTC TCTTTTAAAA GATGCAATAC AGAAAAATAA GCCGTTGGAG
GCTGAAGTTG CAATTGACGG CCGGGTGCTT TTGGTAAACG CTTCACCTAT AAGACCCAAA
GACAGCGATA TTGATAATTC GGGAGGAATT GTATTTATTC AGGACATAAC AAAGGTAAGA
AAGCTGGAGC AGATTCGAAC TGAATTTGTT TCCAATGTCA CCCATGAACT AAAAACACCG
ATAACACCGA TCCGAGGATT CATTGAGACA TTGAAAAACG GTGCTATGAA TAATCCTGTT
GTGGCGGAAA GATTTTTGGA AATCATTGAT ATCGAAGCTG AGCGTCTTCA CGAATTGATT
AACGACATTT TGCTGCTGTC GGAAATTGAG ACAAAGTTAA AAGATACCAA CTTGGAAATC
TTCGATTTAA AATCCATGGT GGATGATGTT TTTAAAGTTA TGCAAAACAT TGCAAAGGAG
AAGAAGATCA GCCTGAACAA CAATGTGCGG GATGAAGTGT TGATGAAAGC AAACATAAAC
CGCATGAAAC AGTTGATAAT GAATTTAGTG GACAACGGAA TAAAATATAA TGTTCAAAAC
GGCTCGGTAT CGGTTGACGG GTACAGGGAG GACGGAAAGG TTGTCATTTC CGTAAAAGAT
ACGGGAATAG GAATTCCTTC GGCCCATATA CCGAGAATTT TTGAGCGCTT TTACAGGGTG
GACAAGGGAA GGTCAAGGGG AATGGGAGGT ACCGGTCTGG GTCTTTCAAT AGTAAAGCAC
ATTGTAAACC TTTATAACGG AGAAATAAAG GTAAATTCTG TTGTGGGGGA AGGCACCGAA
TTTATTGTAA AAATTCCGTG CCAACCGTAA
 
Protein sequence
MKKKIFKYYL LLILIILSVT VIFIPKVSRK FYTQEVENKL EGIAFSIEYY LLNEAKNGEI 
DFDFIAKDYA AKYNQNSTFQ GESLRITFIS YDGKVLGDSD ANFNQMENHL SRKEIQDALK
GNVGKDIRSS KTLKLDLLYM AIPVEELNVI ARVSVPLVQI KKINRLIWLY SILIFIMALI
ITVIVSLRIA GLVIRPLNDI ISVSKEITNG NYSRRIKLKS KDELGQLAVH FNKMASKLER
TISDLNTKKI ELESIVESIT NGIVAVDGNN KVILINPAAF TVFNLDADAE ILGDDIENHI
KNSQINSLLK DAIQKNKPLE AEVAIDGRVL LVNASPIRPK DSDIDNSGGI VFIQDITKVR
KLEQIRTEFV SNVTHELKTP ITPIRGFIET LKNGAMNNPV VAERFLEIID IEAERLHELI
NDILLLSEIE TKLKDTNLEI FDLKSMVDDV FKVMQNIAKE KKISLNNNVR DEVLMKANIN
RMKQLIMNLV DNGIKYNVQN GSVSVDGYRE DGKVVISVKD TGIGIPSAHI PRIFERFYRV
DKGRSRGMGG TGLGLSIVKH IVNLYNGEIK VNSVVGEGTE FIVKIPCQP