Gene Cthe_0408 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0408 
Symbol 
ID4808411 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp509847 
End bp512435 
Gene Length2589 bp 
Protein Length862 aa 
Translation table11 
GC content39% 
IMG OID640105822 
Productdiguanylate cyclase/phosphodiesterase with PAS/PAC sensor(s) 
Protein accessionYP_001036839 
Protein GI125972929 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain
[COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGTCA AGTGTTTTGG AATAAATCTT TGTGTTTTTT CACTACTTCT GATATTTTGT 
CAGGTTGTTT ATGCTAAAAC TGACACGATT ATTTATGAAA CAGAGTTATA CGGTCCTCCG
TTTAAGTTCA TAGAGGACGG TGAAATATCG GGGTTTGAAA TCGAACTGAA CCAGTACATA
TTTTCAGGCA GTGAATACAG GTTTGATTAT AGGTTCAACA CCTGGGAAAA AGTTTATGAA
AAACTAAAAA ACGGAGAAAT TGATACCTGT GGTCTTCTTG TTGTAAATGA AGAAAGGAAA
AAAGACATAT TGTTTTCCGA TACGGTTATG AATATTTATA TTTCCATTTA TTCCAAGGAA
AAAAATAAGA ATATTGGAAT AAAAGATCTT GAGAAATATC GTGTGGGTGT GGGAAAAGAG
CAATACAGTG AGCATATTTT GAAAGACAGT GTGGGCATTA GCAATTACAC AACCTTTGTG
GATGTGGAAG AAGCGATAGA TGCTTTAAAT GAGGGAAAAA TTGATGTTAT TTTTGAAAAT
CAGGACGTCG TAAATCATTA TTTGATTAAA AAAGGTTTAA CCGGCAAAAT AATTCCCCAC
AAAACGGAGC TTTTTCCCGT CAAAGTTGCT TACGGTGTGA GTAAAAGAAA TCCCGAACTT
GTCAAATATA TTAATGAACG ACTGGATAGT GTGAAAAAAA ACGGTATATA TGAACAGCTT
TATAGAAAGC ATTTCCTGCG GCCTTCAGAT TTCTACAGAA GAAAACAGAG AATCAGAAAT
TTTGCCGCAA TGCTTGCACT GCTTGTCCTT TTGGCACTTT TGCAAGTTTA TATAAGGCAC
CTTAAGAAAA AGATATCAAA AGCATACCGT GAACTTCGCA AACAGCATGA ATGGCTGAGA
ATAACGCTCT CAAGCATTGG AGAGGCGGTA ATTACAACGG ATGAAAACGG AACCGTTACT
TTCAGCAATT ATGAAATTCA AAAGATGTTG GGCCTTTCTG AAGAAGAAAT ACTAGGCAAG
AAATTGGACA AGCTGCTTTC GGGATTGGTG GATAAAAGGG AAAAGGTTTA TAAAATTCCT
ATTGAAGAAG TTGTAAATCG GGGTAGCATG ATAAAACTTG AGACTGATTT GAGTCTTGTA
ACTCCGGGTG GGAGAAGACT TGTGGAGGGC ACTGTCGCAC CAATAAGAAA TGATTCGGAT
GTGATCATTG GGACAGTTGT TGCATTAAAG GACATTACGG AAATAAAGAA GAAAGATGAA
ATCCTGTACA ACATGGAGTA TTATGACCCG CTGACAGGGC TTCCCAACAG AAGTCTTTTT
TCCGACCGCC TTAAAATGGC TCTTGCCCAG TCGAAACGCA ATAATGAGAT GTGTGCGCTG
ATTATATTGG ATCTTGATAA TTTTAAAGCA ATTAATGATA CACTGGGGCA CTCCGTCGGA
GATATGCTTT TAAAGCAGGT GGCTGAAAAA ATAAAGGGTT ATCTGAGGGA AGTTGATACC
GTTGCAAGAA TAGGAGGAGA TGAGTTCATA ATTATTCAGC CTCAAATAAA AGATATAAAC
GATGCTACCA GAGCAGCGGA CAGAATATTG AAAAAATTTC AGCAACCGTG GATCTTGGAA
GGCAAGGAAT ATTATATAAC TGCCAGTATG GGCATCGGCA TTTACCCCAA TGGCGGAGAG
GATCCGCAAA CTATTTTTAA AAATGCGGAT ACGGCATTAT ACAGAGCCAA AGAGCTGGGA
AGGAATAATT ATCAGTTATA TACCGAGTCG ATGAACCAAA AGGTCCTTCA AAGGCTGGAT
ATTGAAAATA GCTTAAGGAG AGCAATTGAG AAGGAAGAAT TTGTACTGTT TTATCAACCA
CAGATCGATA TCAAAACCGG TAAGATTGTC GGTTTTGAAG CGCTTTTGAG ATGGTATCAC
CCTGATTATG GGCTTATGCC TCCCATGGAA TTTATACCCG TTGCAGAGGA TTCAGGGCTT
ATAGTGGTTA TTGGGGAATG GGTTCTTGAA ACTGCGTGCA GGCAGAACAA AAAATGGATT
GAGTGTGGAT TGGAGCCGCA TTTGATTTCG GTAAACTTGT CTGCAAGACA ATTTCAACGT
TCAAACATTG TTGAAGTGAT TGACAGAATT CGCAGTAGCA CCGGTTTGGC ACCGGAGCTT
TTGGAGCTGG AAATAACGGA GAGCACTGCG ATGCAAGACT TGAGTTTTAC AATAGATGTT
TTGAATCAGT TGAGGAAAAA GGGAATAAGG GTGTCCCTTG ATGATTTTGG AACCGGTTAT
TCATCACTGA ATTATTTAAG ACAGCTTCCT ATAGATACTC TCAAAATAGA TAAAAGTTTT
GTTCAGGACA TAAGGGCCAA CTCAAAAGAA GAGGCTATTG CTAAAACCGT TATCAGCCTT
GCTCACAAGC TTGACCTTAC TGTTGTGGCG GAAGGTGTTG AGACAAAAGA ACAACTTTTA
TTCCTTAAAA AGGAGAAGTG TGACAAGGCT CAAGGATATC TTTTCAGCAA ACCGCTGCCG
GCAGAGGAAA TTGAAAAAAT GTTAAGAGAT AAAAAATGTT TTGTCATCGG CGAGGAAGTT
GACAACTGA
 
Protein sequence
MKVKCFGINL CVFSLLLIFC QVVYAKTDTI IYETELYGPP FKFIEDGEIS GFEIELNQYI 
FSGSEYRFDY RFNTWEKVYE KLKNGEIDTC GLLVVNEERK KDILFSDTVM NIYISIYSKE
KNKNIGIKDL EKYRVGVGKE QYSEHILKDS VGISNYTTFV DVEEAIDALN EGKIDVIFEN
QDVVNHYLIK KGLTGKIIPH KTELFPVKVA YGVSKRNPEL VKYINERLDS VKKNGIYEQL
YRKHFLRPSD FYRRKQRIRN FAAMLALLVL LALLQVYIRH LKKKISKAYR ELRKQHEWLR
ITLSSIGEAV ITTDENGTVT FSNYEIQKML GLSEEEILGK KLDKLLSGLV DKREKVYKIP
IEEVVNRGSM IKLETDLSLV TPGGRRLVEG TVAPIRNDSD VIIGTVVALK DITEIKKKDE
ILYNMEYYDP LTGLPNRSLF SDRLKMALAQ SKRNNEMCAL IILDLDNFKA INDTLGHSVG
DMLLKQVAEK IKGYLREVDT VARIGGDEFI IIQPQIKDIN DATRAADRIL KKFQQPWILE
GKEYYITASM GIGIYPNGGE DPQTIFKNAD TALYRAKELG RNNYQLYTES MNQKVLQRLD
IENSLRRAIE KEEFVLFYQP QIDIKTGKIV GFEALLRWYH PDYGLMPPME FIPVAEDSGL
IVVIGEWVLE TACRQNKKWI ECGLEPHLIS VNLSARQFQR SNIVEVIDRI RSSTGLAPEL
LELEITESTA MQDLSFTIDV LNQLRKKGIR VSLDDFGTGY SSLNYLRQLP IDTLKIDKSF
VQDIRANSKE EAIAKTVISL AHKLDLTVVA EGVETKEQLL FLKKEKCDKA QGYLFSKPLP
AEEIEKMLRD KKCFVIGEEV DN