Gene Cthe_2545 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2545 
Symbol 
ID4809301 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3013237 
End bp3014322 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content44% 
IMG OID640107961 
Productsignal transduction histidine kinase regulating citrate/malate metabolism 
Protein accessionYP_001038940 
Protein GI125975030 
COG category[T] Signal transduction mechanisms 
COG ID[COG3290] Signal transduction histidine kinase regulating citrate/malate metabolism 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.290644 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAGAAA TCTTGTTTGG TACGGCGACA GCGGTCAGTT TCGCCGGAAT GGAACGCACA 
CGTAAAAATT ATATTGCGGT CGGCTGCCTA ACTACGGTTT TATTTTTTTT GCAGGTCATA
TGCTTAAATG CATGGGATAT TGATGTGACA TTTAAATTGT ATCCGCTTCT GTCCCACTTA
CCCATTACTG TTTTCATAGT GGCATATTTA AAGCGTCCAT GGCTGATTTC ACTAACCAGT
GTGCTTGCAT CCTTTCTGTG CTGTCAGCCT CCCCGTTGGA TCGGTACCGC CCTTGGCGAA
GTTTTTGACA GTGTTTCCAT AAATCACGTC AGCTATATTG CCGCCGCGTT TTTAACATAC
TGTTTCCTTC GAAAATATGC AGTGACATCA GTTCGGCATC TGATAGAACG TTCTGTCAGT
TCCTGCCTGC TTTTGGCGCC ATGCCGGCTT TTTACTATCT GTTCGAATAT GTTGGATTCC
CAGTTTAAGC AGGCACAAAA AGAATTTGAA TCACTACGGC AGATGCAAAA AACTACCGCA
TCCTACCGAC ACGATATGCG TCATCATTTT GCTCTTCTTC AGAGTATGGC ATCCAAAGGA
AACATGGAAG ATATCAAAGA ATATCTGCAG ACCGTGCAGT CTGACTTGGA CGCCATTACT
CCTGTACGTT TTTGTGAAAA CGAAACTGTA AATTTAATTT TGTCCTCTTT TGCCGCCAAA
GCGAAACAGT CGGAAGTCAG GCTGACCATA GATGCAAAGC TACCGGACTC TTACCCTTTC
AGCGACACCG AGCTTTGCTC CCTCTTGTCA AACACCCTGG AAAATGCCAT ACATGCATCA
AAGCAAATAA CCGACATCAG TAAACGCATT ATACATCTGC GCATGTTTTC CAGGAACAAC
AAGCTGTGCA TTGACATTCG CAACAGCTAT CAAAAAGAGC CGGTTTTTCA TCATGGTCTC
CCGGTGTCAA AAGAGCAGGG ACATGGCTTT GGCACAAAAA GCATGGCTCA TATCGTGGAA
AAGTACGGCG GTGTATTTCA ATTTTCAGTC AAGGATGGTT GGTTTATATT TCAAGCCACA
ACATGA
 
Protein sequence
MIEILFGTAT AVSFAGMERT RKNYIAVGCL TTVLFFLQVI CLNAWDIDVT FKLYPLLSHL 
PITVFIVAYL KRPWLISLTS VLASFLCCQP PRWIGTALGE VFDSVSINHV SYIAAAFLTY
CFLRKYAVTS VRHLIERSVS SCLLLAPCRL FTICSNMLDS QFKQAQKEFE SLRQMQKTTA
SYRHDMRHHF ALLQSMASKG NMEDIKEYLQ TVQSDLDAIT PVRFCENETV NLILSSFAAK
AKQSEVRLTI DAKLPDSYPF SDTELCSLLS NTLENAIHAS KQITDISKRI IHLRMFSRNN
KLCIDIRNSY QKEPVFHHGL PVSKEQGHGF GTKSMAHIVE KYGGVFQFSV KDGWFIFQAT
T