Gene Cthe_0881 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0881 
Symbol 
ID4810499 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1057225 
End bp1058355 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content37% 
IMG OID640106297 
Productdiguanylate cyclase 
Protein accessionYP_001037308 
Protein GI125973398 
COG category[T] Signal transduction mechanisms 
COG ID[COG3706] Response regulator containing a CheY-like receiver domain and a GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000228326 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGATA TAATAAATGG GCTTTTAAAG AAAAAAAAGA CTGGTAACTT TCGCCAACAG 
AGAAATTTTG TTTACTTAAG ATGGGTAGCT CTGTTATTGG TGATACCTCT TTTTTTTCTG
GTTTCTGATT GCCCGCGGAC AGAGAAAAGT TTCTGGATTA CCTTTTCAGT GGCAATTCTG
TATAATGGAT TTCAAACTTT GCTCATTTTA TTCAAGCGTC CGAACGGTTG GGTAAAGAAA
TTCATAAGCA TTGCCTTTTA TTTTGATATT ATGTTTATTT GTGCTTTTTC CTACATATTA
AACGGTATTG AGTCGGATAT ATACATTCTC ATATTTTTTG TAATTTCATA TTACGGCATT
GGCAAAGATG TTTCGAGTAC CATAAATATC AGTATTTTTA GCATAATACT TTACACGGTT
TCGTCAATTG CGGTAAAAGC GGATAATATT GGGGAATTGA ACTTTTTAAA ACTCATAATC
AGGGATTTTT TCATTCTTTT GGTGGCATAC GGTGTGTCAA TGGTTATCCT GGAAGTAAAA
AAATATGACG AAATGCACCA AAGAGAGTTT AAGCTCGCCA GGACGGATAA GCTTACAGGG
CTTGCCAACA GGCATATGCT GGATCAGAAA CTCGAGGAGG AAGCTCTTTA CTGTGAGTAT
TCAAAAAAGC CTTTAAACGT TCTTATGTTT GATATTGACG ATTTTAAAAA ATTTAACGAC
ACTTACGGTC ACATTTGGGG TGATAAGCTT CTTTCGCTTT TTGGCGACAT AATAAAGCAA
AGCATTCGAA AGACTGATAT GGCTTTTCGG TATGGTGGGG AAGAGTTTAT GGTTCTCATA
AGGGACCTTG ACCTTGAGAA AGCCAAATGC GTGGCAGACA GAATAAGGTG TCAGCTTGAA
AAACAGAATC TGTTTTCCGA TGAAGGCCAT AACATGGGGA AGGCGACTGT AAGCTGCGGG
ATTGCACAAT TTCCGACGCA TTCCGATGAT ATAAAAAAGG TTGTCGATTA TGCCGACCGG
GCATTGTATT ATGCAAAGAA AATAGGAAAA AATATTGTTG TAAGCTATGA TGAAATAGGC
AAACTTACAG AAACGGTACA GATTAACGCA GATACTTATT TAAGCAAGTG A
 
Protein sequence
MNDIINGLLK KKKTGNFRQQ RNFVYLRWVA LLLVIPLFFL VSDCPRTEKS FWITFSVAIL 
YNGFQTLLIL FKRPNGWVKK FISIAFYFDI MFICAFSYIL NGIESDIYIL IFFVISYYGI
GKDVSSTINI SIFSIILYTV SSIAVKADNI GELNFLKLII RDFFILLVAY GVSMVILEVK
KYDEMHQREF KLARTDKLTG LANRHMLDQK LEEEALYCEY SKKPLNVLMF DIDDFKKFND
TYGHIWGDKL LSLFGDIIKQ SIRKTDMAFR YGGEEFMVLI RDLDLEKAKC VADRIRCQLE
KQNLFSDEGH NMGKATVSCG IAQFPTHSDD IKKVVDYADR ALYYAKKIGK NIVVSYDEIG
KLTETVQINA DTYLSK