Gene Cthe_2189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2189 
Symbol 
ID4810905 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2607494 
End bp2608741 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content41% 
IMG OID640107595 
Productdiguanylate cyclase with GAF sensor 
Protein accessionYP_001038584 
Protein GI125974674 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCAAAGA AGACTTCAAA TAAAAATAAA AGTAAAAAGA AGCCCGGTTT TTTTACTTTA 
TTGCTTTTTT CCCTAATTGC CGGTGCAGCT TTGGTCTGGC CTACTCACAT ATTGTTTTCC
TATTACGCTG AAAATCCTCT TCCCAATGTT CCTTTAATCT TGTTTGCTTC AATACACAGC
GGAATTTTGT ATGTTATAAT ATTTCAGGTC TCAGGCATAT TGATAAGAAA TTATTCCGTG
GAGCGTGCAA ATGAAAAGGA ACTTAAAAAC TTTAAAGATT TTACGGACAC AATTCACAGG
GCAGTAACGG AAATCGAAGC GTATCAAACC CTCTACGAGT TTATACAAAA AATTCGTGTG
AGCAGCCGCA TAACACTGTT CTACCGAAGG GAAGTATCTT CAAGTGAAAT AGTATGGGAG
AGACTTACAA AGGAAAGATT CCCTCTTTGC ACCATGGAAC CCAGGAACTG TCCTGTGGTC
AAGTATGGGC GGGAATGTCT GGTAAAGAAC ATTGCCACAG ACATACCATG TGCCAACCAG
CTTCCGGAGC ATAAATCAGG AAGCTACATT TGTCTGCCGA TAACCGAAGG AAATATAACA
TTGGGAATTT TACAGCTTTA TTCAAAATCA AAAAATTACT TTGATGAAGC TCTAATATCG
AAAGTCAAAT CTTACATAGA AGTCGCCAAG CCGGTTATCA GCAGTAAAAG GACATTGCAC
CAGCTTAGAA AGAATGCCAC CACTGACAAA CTTACCAAGC TTTACAACAG GAGATTTCTG
GAACAGTATC TTGACAATCA GCTTGAAATT ACCGGCTTCT CAAATCAAAA ATTAAGCGTC
ATCATGATGG ACATTGACAA CTTCAAACGG ATAAACGATA CTTACGGACA TAATGCCGGC
GATGCCGTCC TGGTTACATT TGCACGAGTT ATTTTGCGTT GTTTAAGGCA AACCGACCTT
GTTGCCCGTT ATGGAGGAGA GGAATTTATA GCAATACTTC CGTCCTCGGA CACCAAAAAC
GCCTATGACA TAGCAGAACG TATCAGGGAA TCCATAGCTG CGGAACCCAT GCCCAAAATA
AATGATGTCC AGCTCCCCAA TATAAGCTGC AGCTTCGGAG TCTCAACCTA CCCCACCCAT
GCAAAAAACA AAAATGACCT GATAAAAGCT GCGGATATAG CCTTGTACAA AGCAAAACAG
GCAGGCAAAA ACCGAGTCAT CACCTACTCT GAGGGTATGG AAATGTAG
 
Protein sequence
MSKKTSNKNK SKKKPGFFTL LLFSLIAGAA LVWPTHILFS YYAENPLPNV PLILFASIHS 
GILYVIIFQV SGILIRNYSV ERANEKELKN FKDFTDTIHR AVTEIEAYQT LYEFIQKIRV
SSRITLFYRR EVSSSEIVWE RLTKERFPLC TMEPRNCPVV KYGRECLVKN IATDIPCANQ
LPEHKSGSYI CLPITEGNIT LGILQLYSKS KNYFDEALIS KVKSYIEVAK PVISSKRTLH
QLRKNATTDK LTKLYNRRFL EQYLDNQLEI TGFSNQKLSV IMMDIDNFKR INDTYGHNAG
DAVLVTFARV ILRCLRQTDL VARYGGEEFI AILPSSDTKN AYDIAERIRE SIAAEPMPKI
NDVQLPNISC SFGVSTYPTH AKNKNDLIKA ADIALYKAKQ AGKNRVITYS EGMEM