Gene Cthe_1599 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1599 
Symbol 
ID4809590 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1930261 
End bp1931931 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content39% 
IMG OID640107017 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_001038018 
Protein GI125974108 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.928385 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGA GTATTTATAA AAGCATGCTG GTATTGGCCC TTACAACTAT TTTACTTACC 
TCGTTTCTTA TAACCGGAGT GATGTACCGG GCATTCTATC TCAGGATGCA GCAAGAAATC
AGGAATGAAG CCATTTTTAT ATCATCCGCT TACAATCTAA TCGGACAAGA GTTTTTTTCT
AGCATTGCAG ATCAAGAAAG CTCTAGCAGG ATTACATGGG TAGCTAGTGA CGGTACTGTT
TTGTTTGATA ATATGGCTGA TGCAGAAAAG ATGGAAAATC ACCTCAACAG GCCGGAAATT
GCCGATGCGT TGAAAAACGG ATTTGGTGAA GCAGTTCACC TTTCCAAAAC TCTAGGAACT
CAGACCTTTT ATTGGGCTGT CCGACTTAAT GACGGCACAG TCCTAAGAGT ATCAGCCACG
ACTAATAGTG TTTTTAAATC TGTTTTAGGT TTTTTTCCGT ATGTCGCTTT AATCACATTG
ATGGTTATCT TGCTGACCAT GATCATTGCC AACCTGTTGA CGAAAAAAAT CTTTCTTCCT
TTGAACAATT TAAATTTGGA GGATCCGTTG TCCAATGATG TCTATGACGA ATTGTCTCCA
TTGCTAATTC GCATGGCTAA ACAAAATGAT CAGATTAAAA GTCAGTTCAA AAAACTGAAA
GAGCAGAAGG AAGAGTTTAA TGCAATTACG GAGAATATTA GAGAGGGAAT CATAGTTTTA
AACAACAAAG GCTTGATATT ATTCATAAAC AAAAGTGCCG CAGACATATT CAATGTCAGT
ACTCAGGATA TAATTAACAA GCATATATTA ACACTTGATC GAAGCATAAC TCTTCAAAAG
GCAATAGAGA CAGCTATGGG AGGGCATTTA TTTGAGGATA TATTTGCCAT AGGTGAAAAT
TCTTTTAATT TACTGGCTAG CCCTGTTAAG GATGAAGTGG TTGTCAAAGG AGTTATTCTG
TTTATATTGG ATGTAACAGA AAAACAATCC GCCGAAAAAA TGCGCCGTGA GTTTGCCGCT
AATGTGTCGC ATGAACTTAA AACGCCTCTC ACCTCTATTT TAGGTTACGC AGAGCTTATG
AAAAGCGGCA TGGTAAAACC TGAAGACATT TCCGAGTTTT CCGACCGCAT ATACAACGAA
GCAAGGCATC TCATAGACTT GATAGAAGAT GTGATACGGA TCTCCAGACT GGATGAAAAA
AATGTTCAGC TCCCCTTTGA AGAGATTGAT TTATTGGAAT TGGCAAAAGA AACAGTCGGC
AGATTATCTT CCCTTGCACA GCAGAAACGG ATAAAGCTAT CAGTCGGCGG TGATAACGCG
ATCATTTTTG GTGTTAGACA AATTCTGGAG GAGATGATCT ATAATCTTTG TGATAACGCA
ATCAAATACA ATTATGAAAA CGGCAAGGTT GATGTAAATG TAAAAACTTT CTCCGACCAG
GTTGTACTAA CCGTAGCCGA TAATGGCTTT GGCATTCCGA GGGAGCATCA AAGCCGCGTG
TTTGAACGCT TTTATAGAAT CGACAAGTCC CATTCAAGGG AAACCGGTGG AACTGGTCTG
GGCCTTTCTA TTGTAAAGCA CAGTGCCGAA TTCCATAATG CAAAGATTCG ATTGATGAGC
AAGCCTGGAA AGGGTACAAC GATTACAGTT ATATTTAGTC GTGAACAATA G
 
Protein sequence
MKKSIYKSML VLALTTILLT SFLITGVMYR AFYLRMQQEI RNEAIFISSA YNLIGQEFFS 
SIADQESSSR ITWVASDGTV LFDNMADAEK MENHLNRPEI ADALKNGFGE AVHLSKTLGT
QTFYWAVRLN DGTVLRVSAT TNSVFKSVLG FFPYVALITL MVILLTMIIA NLLTKKIFLP
LNNLNLEDPL SNDVYDELSP LLIRMAKQND QIKSQFKKLK EQKEEFNAIT ENIREGIIVL
NNKGLILFIN KSAADIFNVS TQDIINKHIL TLDRSITLQK AIETAMGGHL FEDIFAIGEN
SFNLLASPVK DEVVVKGVIL FILDVTEKQS AEKMRREFAA NVSHELKTPL TSILGYAELM
KSGMVKPEDI SEFSDRIYNE ARHLIDLIED VIRISRLDEK NVQLPFEEID LLELAKETVG
RLSSLAQQKR IKLSVGGDNA IIFGVRQILE EMIYNLCDNA IKYNYENGKV DVNVKTFSDQ
VVLTVADNGF GIPREHQSRV FERFYRIDKS HSRETGGTGL GLSIVKHSAE FHNAKIRLMS
KPGKGTTITV IFSREQ