Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0142 |
Symbol | |
ID | 4808700 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 180470 |
End bp | 182107 |
Gene Length | 1638 bp |
Protein Length | 545 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640105553 |
Product | metal dependent phosphohydrolase |
Protein accession | YP_001036576 |
Protein GI | 125972666 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2206] HD-GYP domain |
TIGRFAM ID | [TIGR00254] diguanylate cyclase (GGDEF) domain [TIGR00277] uncharacterized domain HDIG |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000694732 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTAAAG GAATTGACAA CGCAAACAGT TGCTGGAATG ATTTTCAGCA TGACAAGAAA GAGGGTTGGT TGTCATGGCT GGGAATATCG AATAAAGAGT CTTCCAAATT GTTCTTTATC ATTGCCTCAC TTCATATAAT TATTGTAATG GTTCAGGCGT CAGGTAAATT GCCTGTGGGA TTTCGGGCAG TAACAGGAAT ATTTGAGATC GGAATTTCAA TTTTATTATC CTATCGCTTT GGCTATATTG GTATGTCTTT GTCTCTGATT ACTAATGGTT TGGCGGCGAT GCGTCTTTTT GTCATAGCCA GACAGCTGGA TATGGTGGTG GCAAGCGAAG CCGGTCATTT AACAGGTGAT TTGAGAATAA TAACGGACAG TGCTCCGGGG CTTCTTCTGA ATTTATCGGC GGCAAGGGTT GCTGTAATGA TAGTGTCAAT AATTGTAGCA TACTCCTATG AACAGGAACG CAAATACATA AACAGACTGG AGTGGCTGGC CTGTGTTGAC GGAGTTACCG GAGTGTACAA CCATAGATAC TTCCAGACAA GACTTGGGGA AGAAATTGAG AAAGCAAATT TAAGAAATGG TTCTTTGGCC TTGGTAATGA TTGATGTGGA TAATTTTAAA AAATATAACG ACACCCATGG TCATATAGCA GGAGACAGGC TTCTTACGAA GACTGCCGAA ATATTTAAGG CAAGTGCAAG ACAAGAGGAT ATTGTCTGTA GATACGGAGG AGATGAATTT GTCATATTAA TGCCCGATGC CGATTCTAAA AGCATTATTT CCATGATTCA AAAAATAAGA AAAGAATTTT CAGACTTTTT GGACACCGAA GAGTTTAGAA TACATAGAAA TGAGATCAGC CTGTCCGTTG GATATTCTAT ATATCCGGAG CTTGCACGAA ACAAAGACGA TTTGATTATG CAAGCCGACA GTGCCCTTTA TCAGGCGAAA AACATGGGAA GGAACAACGT GAGAATCTAC AGGGATGTTT TTGAGGATAT AAAGACATTT TTCAACTCAA ACGAACAGCA GCTGCTGGGA GGACTGAGAG CCCTTTTAGG TACGGTATCG GCGAAAGACA AGTATACCCT GGGACATTCG GAACGTGTCA TGGAATATGC CGTAAGGATT GGAAAGGCCA TGGGGCTTAG CAGCGAAAGG CTGCGTCTTC TTAAAATAGC CGCTCTGCTT CATGATATAG GCAAAGTGGA AATTCCCGAA TCCGTGTTGA ACAAAACCGA GCCCCTGACC CCTGCAGAGA TGAAAAATCT GCGGAGGCAT CCGATGTATA GTGTTGATAT ATTGGAACCC TTGTCCAGTA TTGACATGCT GATTGATTCC ATAAAATATC ATCATGAAAG GTATGACGGC AAAGGATACC CTACCGGGAA GAAGGGTAAG GAAATACCGC TTGAAGCCCG GATTTTATCT GTGGCGGATG CCTTTGACGC TATGTTGTCC GACCGTCCCT ATAGAAAAGG AATGAAAATA AATGAAGTAC TGGCTGAGTT GAAAAACAAT TCCGGTACAC AGTTTGATCC TGAAACAGTG GAAGCCTTTC TCAGCACTTT TGATAATTCT GACTGTGACA GTCATAGTAT TAGTCATAGC ATTGATGAAG CAATTTAA
|
Protein sequence | MIKGIDNANS CWNDFQHDKK EGWLSWLGIS NKESSKLFFI IASLHIIIVM VQASGKLPVG FRAVTGIFEI GISILLSYRF GYIGMSLSLI TNGLAAMRLF VIARQLDMVV ASEAGHLTGD LRIITDSAPG LLLNLSAARV AVMIVSIIVA YSYEQERKYI NRLEWLACVD GVTGVYNHRY FQTRLGEEIE KANLRNGSLA LVMIDVDNFK KYNDTHGHIA GDRLLTKTAE IFKASARQED IVCRYGGDEF VILMPDADSK SIISMIQKIR KEFSDFLDTE EFRIHRNEIS LSVGYSIYPE LARNKDDLIM QADSALYQAK NMGRNNVRIY RDVFEDIKTF FNSNEQQLLG GLRALLGTVS AKDKYTLGHS ERVMEYAVRI GKAMGLSSER LRLLKIAALL HDIGKVEIPE SVLNKTEPLT PAEMKNLRRH PMYSVDILEP LSSIDMLIDS IKYHHERYDG KGYPTGKKGK EIPLEARILS VADAFDAMLS DRPYRKGMKI NEVLAELKNN SGTQFDPETV EAFLSTFDNS DCDSHSISHS IDEAI
|
| |