Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0408 |
Symbol | |
ID | 4808411 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 509847 |
End bp | 512435 |
Gene Length | 2589 bp |
Protein Length | 862 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640105822 |
Product | diguanylate cyclase/phosphodiesterase with PAS/PAC sensor(s) |
Protein accession | YP_001036839 |
Protein GI | 125972929 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain [COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain |
TIGRFAM ID | [TIGR00229] PAS domain S-box [TIGR00254] diguanylate cyclase (GGDEF) domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGTCA AGTGTTTTGG AATAAATCTT TGTGTTTTTT CACTACTTCT GATATTTTGT CAGGTTGTTT ATGCTAAAAC TGACACGATT ATTTATGAAA CAGAGTTATA CGGTCCTCCG TTTAAGTTCA TAGAGGACGG TGAAATATCG GGGTTTGAAA TCGAACTGAA CCAGTACATA TTTTCAGGCA GTGAATACAG GTTTGATTAT AGGTTCAACA CCTGGGAAAA AGTTTATGAA AAACTAAAAA ACGGAGAAAT TGATACCTGT GGTCTTCTTG TTGTAAATGA AGAAAGGAAA AAAGACATAT TGTTTTCCGA TACGGTTATG AATATTTATA TTTCCATTTA TTCCAAGGAA AAAAATAAGA ATATTGGAAT AAAAGATCTT GAGAAATATC GTGTGGGTGT GGGAAAAGAG CAATACAGTG AGCATATTTT GAAAGACAGT GTGGGCATTA GCAATTACAC AACCTTTGTG GATGTGGAAG AAGCGATAGA TGCTTTAAAT GAGGGAAAAA TTGATGTTAT TTTTGAAAAT CAGGACGTCG TAAATCATTA TTTGATTAAA AAAGGTTTAA CCGGCAAAAT AATTCCCCAC AAAACGGAGC TTTTTCCCGT CAAAGTTGCT TACGGTGTGA GTAAAAGAAA TCCCGAACTT GTCAAATATA TTAATGAACG ACTGGATAGT GTGAAAAAAA ACGGTATATA TGAACAGCTT TATAGAAAGC ATTTCCTGCG GCCTTCAGAT TTCTACAGAA GAAAACAGAG AATCAGAAAT TTTGCCGCAA TGCTTGCACT GCTTGTCCTT TTGGCACTTT TGCAAGTTTA TATAAGGCAC CTTAAGAAAA AGATATCAAA AGCATACCGT GAACTTCGCA AACAGCATGA ATGGCTGAGA ATAACGCTCT CAAGCATTGG AGAGGCGGTA ATTACAACGG ATGAAAACGG AACCGTTACT TTCAGCAATT ATGAAATTCA AAAGATGTTG GGCCTTTCTG AAGAAGAAAT ACTAGGCAAG AAATTGGACA AGCTGCTTTC GGGATTGGTG GATAAAAGGG AAAAGGTTTA TAAAATTCCT ATTGAAGAAG TTGTAAATCG GGGTAGCATG ATAAAACTTG AGACTGATTT GAGTCTTGTA ACTCCGGGTG GGAGAAGACT TGTGGAGGGC ACTGTCGCAC CAATAAGAAA TGATTCGGAT GTGATCATTG GGACAGTTGT TGCATTAAAG GACATTACGG AAATAAAGAA GAAAGATGAA ATCCTGTACA ACATGGAGTA TTATGACCCG CTGACAGGGC TTCCCAACAG AAGTCTTTTT TCCGACCGCC TTAAAATGGC TCTTGCCCAG TCGAAACGCA ATAATGAGAT GTGTGCGCTG ATTATATTGG ATCTTGATAA TTTTAAAGCA ATTAATGATA CACTGGGGCA CTCCGTCGGA GATATGCTTT TAAAGCAGGT GGCTGAAAAA ATAAAGGGTT ATCTGAGGGA AGTTGATACC GTTGCAAGAA TAGGAGGAGA TGAGTTCATA ATTATTCAGC CTCAAATAAA AGATATAAAC GATGCTACCA GAGCAGCGGA CAGAATATTG AAAAAATTTC AGCAACCGTG GATCTTGGAA GGCAAGGAAT ATTATATAAC TGCCAGTATG GGCATCGGCA TTTACCCCAA TGGCGGAGAG GATCCGCAAA CTATTTTTAA AAATGCGGAT ACGGCATTAT ACAGAGCCAA AGAGCTGGGA AGGAATAATT ATCAGTTATA TACCGAGTCG ATGAACCAAA AGGTCCTTCA AAGGCTGGAT ATTGAAAATA GCTTAAGGAG AGCAATTGAG AAGGAAGAAT TTGTACTGTT TTATCAACCA CAGATCGATA TCAAAACCGG TAAGATTGTC GGTTTTGAAG CGCTTTTGAG ATGGTATCAC CCTGATTATG GGCTTATGCC TCCCATGGAA TTTATACCCG TTGCAGAGGA TTCAGGGCTT ATAGTGGTTA TTGGGGAATG GGTTCTTGAA ACTGCGTGCA GGCAGAACAA AAAATGGATT GAGTGTGGAT TGGAGCCGCA TTTGATTTCG GTAAACTTGT CTGCAAGACA ATTTCAACGT TCAAACATTG TTGAAGTGAT TGACAGAATT CGCAGTAGCA CCGGTTTGGC ACCGGAGCTT TTGGAGCTGG AAATAACGGA GAGCACTGCG ATGCAAGACT TGAGTTTTAC AATAGATGTT TTGAATCAGT TGAGGAAAAA GGGAATAAGG GTGTCCCTTG ATGATTTTGG AACCGGTTAT TCATCACTGA ATTATTTAAG ACAGCTTCCT ATAGATACTC TCAAAATAGA TAAAAGTTTT GTTCAGGACA TAAGGGCCAA CTCAAAAGAA GAGGCTATTG CTAAAACCGT TATCAGCCTT GCTCACAAGC TTGACCTTAC TGTTGTGGCG GAAGGTGTTG AGACAAAAGA ACAACTTTTA TTCCTTAAAA AGGAGAAGTG TGACAAGGCT CAAGGATATC TTTTCAGCAA ACCGCTGCCG GCAGAGGAAA TTGAAAAAAT GTTAAGAGAT AAAAAATGTT TTGTCATCGG CGAGGAAGTT GACAACTGA
|
Protein sequence | MKVKCFGINL CVFSLLLIFC QVVYAKTDTI IYETELYGPP FKFIEDGEIS GFEIELNQYI FSGSEYRFDY RFNTWEKVYE KLKNGEIDTC GLLVVNEERK KDILFSDTVM NIYISIYSKE KNKNIGIKDL EKYRVGVGKE QYSEHILKDS VGISNYTTFV DVEEAIDALN EGKIDVIFEN QDVVNHYLIK KGLTGKIIPH KTELFPVKVA YGVSKRNPEL VKYINERLDS VKKNGIYEQL YRKHFLRPSD FYRRKQRIRN FAAMLALLVL LALLQVYIRH LKKKISKAYR ELRKQHEWLR ITLSSIGEAV ITTDENGTVT FSNYEIQKML GLSEEEILGK KLDKLLSGLV DKREKVYKIP IEEVVNRGSM IKLETDLSLV TPGGRRLVEG TVAPIRNDSD VIIGTVVALK DITEIKKKDE ILYNMEYYDP LTGLPNRSLF SDRLKMALAQ SKRNNEMCAL IILDLDNFKA INDTLGHSVG DMLLKQVAEK IKGYLREVDT VARIGGDEFI IIQPQIKDIN DATRAADRIL KKFQQPWILE GKEYYITASM GIGIYPNGGE DPQTIFKNAD TALYRAKELG RNNYQLYTES MNQKVLQRLD IENSLRRAIE KEEFVLFYQP QIDIKTGKIV GFEALLRWYH PDYGLMPPME FIPVAEDSGL IVVIGEWVLE TACRQNKKWI ECGLEPHLIS VNLSARQFQR SNIVEVIDRI RSSTGLAPEL LELEITESTA MQDLSFTIDV LNQLRKKGIR VSLDDFGTGY SSLNYLRQLP IDTLKIDKSF VQDIRANSKE EAIAKTVISL AHKLDLTVVA EGVETKEQLL FLKKEKCDKA QGYLFSKPLP AEEIEKMLRD KKCFVIGEEV DN
|
| |