Gene Cthe_3062 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3062 
Symbol 
ID4809933 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3596342 
End bp3597724 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content38% 
IMG OID640108483 
Productsignal transduction histidine kinase regulating citrate/malate metabolism 
Protein accessionYP_001039451 
Protein GI125975541 
COG category[T] Signal transduction mechanisms 
COG ID[COG3290] Signal transduction histidine kinase regulating citrate/malate metabolism 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGA AGGTTATGCT AAAGCAAAAG CTTTTAAGGG TATCTATTAT TTTAATTGCA 
TTATTTACAG CCGCGATTTT TTTCGTCACA AACATGAAGG TTACAAATTT AGTGGAGAAA
AGCATAGCAG CAAAATTGGA CAACATTTCA AGCCTTGGGC TTGATATTAT TGAAACCAAA
TACAGCGGAG ATTGGAATGT GAAAGGTGGA AGGCTGTATA AAGGAGAGAA TTTGATTAAC
AATGATTCCC TGATTTTAGA CGCAATTAAG GAGAAAACCG GAGCAATCGC GACTGTATTT
CTTGGAGATG AAAAAATTGC CACAAGCGAG CTTGACAGCG ACGGTATAAG GCCAATCGGG
GGCAAGGCGT CAAGTGAAGT GGTTGAAAGT GTTTTGCAAA AAGGTGTTGT ATATACAGGA
ACAGAGAATA TTCTTGGCGA AACATATGCC GTAAAATATG TGCCTTTAAA AGACCGCAGC
GGTGAAGTTT TAGGAATGTG GTTTGTCGGT ATGCCAAAAA GCAATATTGG CAGCAAGGAC
AGCCAGATAC TTATCATGAG GATTTCAATT GTTGTTATTT CCGTATTGTG CGGTATATTG
GGCTGCGCAT TGTTAATGCT TTATGTAAAG AAGTTTTTAA ACGATATAGA TACCCACAAA
GTTTCATTCC TTGAATCCAA TTCAAGCGGT AATAAAACTC AGCGCAAAGT TGTAATGTTA
TCATTATTCC TTATAGGCAC GTTTTTCCTG ATTTGGTTTA CTGTACAGGG CTTCACAATC
GGTAATGTGG TCAACAACCT TGAAGACAGT AATATAAAAG ACAGGCTGAA TGCGTGTTCA
CAACTGGGAG AAATGTTAAT TAATGAAATG CATAAAGGGG AGTGGACAAT TAAGTTTAAC
AAACTTTATA AGGGTTCCCT TCTTTTGAAT GATGACACTT CAATAGCGGA AAAAATTAGT
TCCGATACAG GATTGCTTTC AACTGTTTTT ATGATGGACA CAAGAATTGC AACAAACATT
TCAAAGGATG ACGGCACAAA ACCCATAGGA GCCAAGGCTG CGAGTGAAAT TGTAGAGACT
GTCTTAAAGC AGGGCAAGGA ATATATCGGA GAAATCACTG TTGCCGACAA GAAATGTATA
GAAAAGTATG TTCCGATAAA AGACAGCACC GGCCAGACAA TCGGTATGTG GTCCATAGGA
ATCGAGAGGA AGGTAACTGC AAGACAAATA AGGGATTTAA GAAAGGCTAT ATCTCAAATA
AGCTTGTTGG CTATATTGGT GTCATTAGGA GCGTTTTTAT TCCTGTCGGT AAGATTCGTA
TCGGATATAA GAAATTATAA TGTATGTCTG AGTACAAAAG TCAGTGAAGA AAGTTCTTAT
TGA
 
Protein sequence
MKKKVMLKQK LLRVSIILIA LFTAAIFFVT NMKVTNLVEK SIAAKLDNIS SLGLDIIETK 
YSGDWNVKGG RLYKGENLIN NDSLILDAIK EKTGAIATVF LGDEKIATSE LDSDGIRPIG
GKASSEVVES VLQKGVVYTG TENILGETYA VKYVPLKDRS GEVLGMWFVG MPKSNIGSKD
SQILIMRISI VVISVLCGIL GCALLMLYVK KFLNDIDTHK VSFLESNSSG NKTQRKVVML
SLFLIGTFFL IWFTVQGFTI GNVVNNLEDS NIKDRLNACS QLGEMLINEM HKGEWTIKFN
KLYKGSLLLN DDTSIAEKIS SDTGLLSTVF MMDTRIATNI SKDDGTKPIG AKAASEIVET
VLKQGKEYIG EITVADKKCI EKYVPIKDST GQTIGMWSIG IERKVTARQI RDLRKAISQI
SLLAILVSLG AFLFLSVRFV SDIRNYNVCL STKVSEESSY