Gene Cthe_2814 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2814 
Symbol 
ID4809651 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3321895 
End bp3323934 
Gene Length2040 bp 
Protein Length679 aa 
Translation table11 
GC content38% 
IMG OID640108234 
Productperiplasmic sensor signal transduction histidine kinase 
Protein accessionYP_001039206 
Protein GI125975296 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000129021 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGATACAA AATGGAAAAA TATTAAGTAT TCGAACATTA CAAAAGTGAT TGTGATTTTT 
CTGGCATGGC TGAGTTTTGT GTGTGCTTCT GAAAGCGTTT TTTTCGTGGT GACAAACGGA
AACACTGTTG AATATGCCAG CTATTACGAT TATCCTGAAT TTATATCCGG TTTTTATACT
TGTCTTGAGA GCGTTGCAGA TTATTATATT TGGGTAAAAG ATGTTGACGG CCCGGATGAT
ATTATGGTTT TGGAAGACGA AAGCATTGCC GACAAGATGT TTGCTTATTA CAATGCCAAA
CCGACAATTT CCGGGTTGGT GAACTTTGCT TATTATATTA AGAATAACAC TACAGGTGAA
ACTTTTACAA ATATAAAATA TGAGGAACCG GTTGAGTTGT TAAAGAAGCA ACCTACTTAT
ACGTATATAG ACAGGTCGCA TGTGCAAAGC ATGAAGCCGT ATAAGACAAG TCATGTAATT
GAGGTAATAA GGAAAACGGA AATGGCATAT GAAATTCATG CAGCAATTAT AGAGCCGTTA
AAGCCGGGGG ATAAATTTTA CGATGATTTT GTCCGTTTTA ACAGAGTGAA GCATCTCTGG
GATTTAATGG TAGTTGTGTT TGTAGTAAGC TCCATACTGT TTATAGGCAG CCTGGTGTTC
CTATTTAATG TTACATGGGA AAGGATAAAA TATCAGAACC GGAATTTGGC TTTTATAGAC
AAAATGCATA TGGATATATA CACATTGGGC GTATTAATTT TAACCGCAAT AGGAATGTCT
ATTTTTTGGA ATGTGTCATG GGATACATTG AATAAACCTT CTTCCACCTA TACATTGGGA
AATATCTTCA TTGGTGTCAC TGTACTGAGT TTGATCTTTA TAATCTGTCT GTCCTGTCTC
TTGTCTTTTG TTCGCCGGGC AGCGAAAGGA AAGTTGCGGA ATAGTTTTCT TTTTGTCATG
ATTTTAAGAA AAATTGGGGA TTTCATTAAA CAACTTTTCA GCGGGAAAAT TTTTAAAGGA
TGGATACTGT TTTTCCTTTT CATCTATGTG GCTATAGACT GCATACTATT TACAATGTTT
GTTGACCAAT TTGTCAATTA TGGCTTCTGT AAAACAGTTG CAACTTTGGT TTTCCTGCTG
GTATTTATAA ACGCATTGGT TTTTGTTTTT ACTGCCAAGG CATTAAGGTC TCTTGCGGCA
ATTATCGAGG GTACTGAGCA GATATCCCGG GGTAACCTCG ATTATGAAAT GAATATTGGC
AAAATGTCGC CTGTGTTTGT CTCTTTTGCC CAAAATATTT CCAATATCCG CGGCGGGCTG
AAAAAAGCGG TGGAAGAAGC AATAAAGGGA GAACGCATGA AGACGGAGCT GATAACCAAC
GTGTCCCATG ATTTAAAAAC ACCGCTCACT TCTATAATAA ATTATGTGGA CCTGCTCAAA
AGAGAAGAGA TGGGAAGTCA AAAGGCAAAG GAATACATCG GTATTCTTGA AGAAAAGTCC
GCAAGGCTTA AAGTACTTAT TGAGGATTTG GTGGAGGCAA GCAAGGCGTC CAGCGGGAAT
CTGGCAGTCA ATTTTGAAAA AGTGGATCTT CATGAGCTGG TGTTGCAAGC ACAGGGAGAA
TATCAGGATA AAATGGAAAA ATCAGGACTT GATATACGAA TCAGTGCTGA AGATAATAAT
ATTTTTGTCC GGGCCGATGG AAGGCATATG TGGAGGATAA TAGAAAACCT TATGTCGAAT
GTTTTGAAAT ACTCGCTTCA AGGTTCCAGG GTTTATATCG ATATAACCAG AAATCAAACT
GACGGAGTTC TTGTTATAAA AAACATATCT GCTACTCCGT TAAATATACC GGTGGAACGA
TTGACGGAGA GATTTGTAAG GGGCGACGAG GCAAGGACGA CGGAAGGCTC GGGACTTGGT
CTGTCCATTG CCCAAAGCTT GACTACTTTG CAGAAAGGTA AATTTGACAT AGAAATAGAC
GGGGATTTGT TCAAAGTAAT TGTACAAATG CCGTTATGGG AAGCTTCTTT TATTTCATAA
 
Protein sequence
MDTKWKNIKY SNITKVIVIF LAWLSFVCAS ESVFFVVTNG NTVEYASYYD YPEFISGFYT 
CLESVADYYI WVKDVDGPDD IMVLEDESIA DKMFAYYNAK PTISGLVNFA YYIKNNTTGE
TFTNIKYEEP VELLKKQPTY TYIDRSHVQS MKPYKTSHVI EVIRKTEMAY EIHAAIIEPL
KPGDKFYDDF VRFNRVKHLW DLMVVVFVVS SILFIGSLVF LFNVTWERIK YQNRNLAFID
KMHMDIYTLG VLILTAIGMS IFWNVSWDTL NKPSSTYTLG NIFIGVTVLS LIFIICLSCL
LSFVRRAAKG KLRNSFLFVM ILRKIGDFIK QLFSGKIFKG WILFFLFIYV AIDCILFTMF
VDQFVNYGFC KTVATLVFLL VFINALVFVF TAKALRSLAA IIEGTEQISR GNLDYEMNIG
KMSPVFVSFA QNISNIRGGL KKAVEEAIKG ERMKTELITN VSHDLKTPLT SIINYVDLLK
REEMGSQKAK EYIGILEEKS ARLKVLIEDL VEASKASSGN LAVNFEKVDL HELVLQAQGE
YQDKMEKSGL DIRISAEDNN IFVRADGRHM WRIIENLMSN VLKYSLQGSR VYIDITRNQT
DGVLVIKNIS ATPLNIPVER LTERFVRGDE ARTTEGSGLG LSIAQSLTTL QKGKFDIEID
GDLFKVIVQM PLWEASFIS