Gene Cthe_0142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0142 
Symbol 
ID4808700 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp180470 
End bp182107 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content42% 
IMG OID640105553 
Productmetal dependent phosphohydrolase 
Protein accessionYP_001036576 
Protein GI125972666 
COG category[T] Signal transduction mechanisms 
COG ID[COG2206] HD-GYP domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain
[TIGR00277] uncharacterized domain HDIG 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000694732 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTAAAG GAATTGACAA CGCAAACAGT TGCTGGAATG ATTTTCAGCA TGACAAGAAA 
GAGGGTTGGT TGTCATGGCT GGGAATATCG AATAAAGAGT CTTCCAAATT GTTCTTTATC
ATTGCCTCAC TTCATATAAT TATTGTAATG GTTCAGGCGT CAGGTAAATT GCCTGTGGGA
TTTCGGGCAG TAACAGGAAT ATTTGAGATC GGAATTTCAA TTTTATTATC CTATCGCTTT
GGCTATATTG GTATGTCTTT GTCTCTGATT ACTAATGGTT TGGCGGCGAT GCGTCTTTTT
GTCATAGCCA GACAGCTGGA TATGGTGGTG GCAAGCGAAG CCGGTCATTT AACAGGTGAT
TTGAGAATAA TAACGGACAG TGCTCCGGGG CTTCTTCTGA ATTTATCGGC GGCAAGGGTT
GCTGTAATGA TAGTGTCAAT AATTGTAGCA TACTCCTATG AACAGGAACG CAAATACATA
AACAGACTGG AGTGGCTGGC CTGTGTTGAC GGAGTTACCG GAGTGTACAA CCATAGATAC
TTCCAGACAA GACTTGGGGA AGAAATTGAG AAAGCAAATT TAAGAAATGG TTCTTTGGCC
TTGGTAATGA TTGATGTGGA TAATTTTAAA AAATATAACG ACACCCATGG TCATATAGCA
GGAGACAGGC TTCTTACGAA GACTGCCGAA ATATTTAAGG CAAGTGCAAG ACAAGAGGAT
ATTGTCTGTA GATACGGAGG AGATGAATTT GTCATATTAA TGCCCGATGC CGATTCTAAA
AGCATTATTT CCATGATTCA AAAAATAAGA AAAGAATTTT CAGACTTTTT GGACACCGAA
GAGTTTAGAA TACATAGAAA TGAGATCAGC CTGTCCGTTG GATATTCTAT ATATCCGGAG
CTTGCACGAA ACAAAGACGA TTTGATTATG CAAGCCGACA GTGCCCTTTA TCAGGCGAAA
AACATGGGAA GGAACAACGT GAGAATCTAC AGGGATGTTT TTGAGGATAT AAAGACATTT
TTCAACTCAA ACGAACAGCA GCTGCTGGGA GGACTGAGAG CCCTTTTAGG TACGGTATCG
GCGAAAGACA AGTATACCCT GGGACATTCG GAACGTGTCA TGGAATATGC CGTAAGGATT
GGAAAGGCCA TGGGGCTTAG CAGCGAAAGG CTGCGTCTTC TTAAAATAGC CGCTCTGCTT
CATGATATAG GCAAAGTGGA AATTCCCGAA TCCGTGTTGA ACAAAACCGA GCCCCTGACC
CCTGCAGAGA TGAAAAATCT GCGGAGGCAT CCGATGTATA GTGTTGATAT ATTGGAACCC
TTGTCCAGTA TTGACATGCT GATTGATTCC ATAAAATATC ATCATGAAAG GTATGACGGC
AAAGGATACC CTACCGGGAA GAAGGGTAAG GAAATACCGC TTGAAGCCCG GATTTTATCT
GTGGCGGATG CCTTTGACGC TATGTTGTCC GACCGTCCCT ATAGAAAAGG AATGAAAATA
AATGAAGTAC TGGCTGAGTT GAAAAACAAT TCCGGTACAC AGTTTGATCC TGAAACAGTG
GAAGCCTTTC TCAGCACTTT TGATAATTCT GACTGTGACA GTCATAGTAT TAGTCATAGC
ATTGATGAAG CAATTTAA
 
Protein sequence
MIKGIDNANS CWNDFQHDKK EGWLSWLGIS NKESSKLFFI IASLHIIIVM VQASGKLPVG 
FRAVTGIFEI GISILLSYRF GYIGMSLSLI TNGLAAMRLF VIARQLDMVV ASEAGHLTGD
LRIITDSAPG LLLNLSAARV AVMIVSIIVA YSYEQERKYI NRLEWLACVD GVTGVYNHRY
FQTRLGEEIE KANLRNGSLA LVMIDVDNFK KYNDTHGHIA GDRLLTKTAE IFKASARQED
IVCRYGGDEF VILMPDADSK SIISMIQKIR KEFSDFLDTE EFRIHRNEIS LSVGYSIYPE
LARNKDDLIM QADSALYQAK NMGRNNVRIY RDVFEDIKTF FNSNEQQLLG GLRALLGTVS
AKDKYTLGHS ERVMEYAVRI GKAMGLSSER LRLLKIAALL HDIGKVEIPE SVLNKTEPLT
PAEMKNLRRH PMYSVDILEP LSSIDMLIDS IKYHHERYDG KGYPTGKKGK EIPLEARILS
VADAFDAMLS DRPYRKGMKI NEVLAELKNN SGTQFDPETV EAFLSTFDNS DCDSHSISHS
IDEAI