Gene Cthe_2945 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2945 
Symbol 
ID4810228 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3461018 
End bp3462868 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content38% 
IMG OID640108368 
Producthistidine kinase 
Protein accessionYP_001039336 
Protein GI125975426 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.546794 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAAAAG AGATGAAAGC GCCGGAAAAA ATAAAAAGTG TTTTTAAAAG ATTTAATATA 
AAAACAATAA ACAAGCAAAT AAGGCTCAAT ATCGTTTTTG GTTTGATTGT ATTAATTTCC
ACCATGTTGT TGGGATATCT GTCCTTTGAC ATGATATCGG ATGCCATGAT AAGGCATGCC
GGTGCGGACA ACCTGGAGCT TGTAAAACAA ATAACCAAAA ATATTGAGAC AGTAATGACC
GGATTTGACG ATATTTCAAA CGAAATTCTT ACCAATGAAA ACTTTGACAG GCTTGTAAAA
ATGCATGTTA CATTGGATGA TGAACATAAA AAGGCAGCAA ACAGAAGAAG CATAGAGAGT
ATACTAAACG GTTATACCAA CACGAGAACG GACATAGCGG ACATAGCAGT GGTTACAAAT
ACCGGAGAAT ACATTACCTC GGGAGAAACA AGACCTTTGG TTACGGACAA TGCGCTTTCA
TACTATGTAG TAAAAAGATT CAAGCAAAGC GGAAGAGACT CATTGTGGCT TGACACGTAT
CAGACTGAGG TTGCATCCAC GGGAACACAT ACAGGGAACC AGCTGGTCAT ATCCAACATA
AAAAGCATTA AAGGAGAAAA CAATGAAGAA ATTGGCATGC TTATCCTTAA TGTAAAAGAA
TCCTACATAT ACAGTCTTAT ATCGGAAATA AAGCTTCCCG ATGAAGGGCA GCTGTATATT
GTCGGAAAAG ACGGCAATTA TGTAATGAAT CCATTTAACA GGCTTCAAAA TGGGAAAGTG
GATTATGTAA AATATGAGTT GTATATTGAA GAAATATTAA AGAAAAAAAA CGGAACATTT
ATAAAAAAAA TAGATGGAAG GGATTACTTG CTGGCCTTCC AGACGATTGA CAGCATAAAC
GGTATTGAAC TGGGATGGAC GGTATTCGGG ATGACACCGG TTGATATCAT AACGTCGGGT
ATTGAAAGTA CCCAAGATAT TTTGTATGAG ATTGGGTTGA TATGCGTTAT CGCAGGATTT
GTGATTTCTC TGCTGATTAC AAGGCTTTAC AATGCTCATC TGGAAAAGAG ATATGAAAGA
AAGCACTCCA TTATTATGGA AAGGGAGAGG CTTGCATCTT TGGGACAGCT TATGGGCGGA
ATAGCACAGA GTTTTAAAGC TCCAATTATG TCAATATCGG ATGGACTTGA TGAATTAAAC
AGTCTTGTGG ATGAATATGA AAAATCCATA GAAAACGAAA ATGTGTCGGA TGAGACAAGG
CATGAGATTG CTTCCAGGAT GAGAGAGTGC CTGGACAAGA TAAAACCGCA TTGTTCGTAT
ATTTCCGATG AAATATCTGC CGTAAAGGGG CAGGCTGTCA ATTTCAACGA TTCGACAGAC
GGAATTTTTA CTGTTGATGA ATTGATTAAA AATGTAAAAC TGCTCATGAG CCATGAGATT
AAATTCTGGA ATTGTGAAAT GAACGTGGAA CTTAAAGTAA GCGGAGACAC CTCGATAAGA
GGAGAAATAA ACAATATGAC TCAGGTAATG AATAATATAA TTACCAATGC CATTGAAGCC
TATAACGGCA AAGGAGGAAA AATTGATTTA ATATTCAGCA AAAAAGGACA TAATTTGGAG
ATAACCGTAA GAGATTATGG ATGCGGAATC CCCGAAAGCG TAAAAAGCAA ACTGTTTAAA
GAAATGGTGA CGACCAAGGG TTCAAAAGGT ACGGGTATAG GCGTGTATAT GGCCTATTCC
ACCATAAAAG GAAAATTCGG AGGAACCATG ACCATTGACA GCAAGGAAGG GAAGGGAACC
TCCGTAAATA TCACCATACC CTTAAAGGAT AAAGATTTTA CTCCACCGTA A
 
Protein sequence
MSKEMKAPEK IKSVFKRFNI KTINKQIRLN IVFGLIVLIS TMLLGYLSFD MISDAMIRHA 
GADNLELVKQ ITKNIETVMT GFDDISNEIL TNENFDRLVK MHVTLDDEHK KAANRRSIES
ILNGYTNTRT DIADIAVVTN TGEYITSGET RPLVTDNALS YYVVKRFKQS GRDSLWLDTY
QTEVASTGTH TGNQLVISNI KSIKGENNEE IGMLILNVKE SYIYSLISEI KLPDEGQLYI
VGKDGNYVMN PFNRLQNGKV DYVKYELYIE EILKKKNGTF IKKIDGRDYL LAFQTIDSIN
GIELGWTVFG MTPVDIITSG IESTQDILYE IGLICVIAGF VISLLITRLY NAHLEKRYER
KHSIIMERER LASLGQLMGG IAQSFKAPIM SISDGLDELN SLVDEYEKSI ENENVSDETR
HEIASRMREC LDKIKPHCSY ISDEISAVKG QAVNFNDSTD GIFTVDELIK NVKLLMSHEI
KFWNCEMNVE LKVSGDTSIR GEINNMTQVM NNIITNAIEA YNGKGGKIDL IFSKKGHNLE
ITVRDYGCGI PESVKSKLFK EMVTTKGSKG TGIGVYMAYS TIKGKFGGTM TIDSKEGKGT
SVNITIPLKD KDFTPP