Gene Cthe_0129 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0129 
Symbol 
ID4808687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp157782 
End bp159299 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content40% 
IMG OID640105540 
Productmetal dependent phosphohydrolase 
Protein accessionYP_001036563 
Protein GI125972653 
COG category[T] Signal transduction mechanisms 
COG ID[COG2206] HD-GYP domain
[COG3706] Response regulator containing a CheY-like receiver domain and a GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAAC ATGTTTCATT GCTCAGGAAA AGATATGTTG ATTTGCGGGA TACATATCTT 
GGCAGTGAGC TGAAAAAAGT TGAGTTGCTT TTTTATATTT CGGTTTCGAT AGCATTGGGG
CTTATACAAT ATAAGCTGCT AAATTATGAA AATACATACA ACATTCGAGC TTATATCAGC
CATATACAAT CATGCATTGT GATTCTTATG GCCTTTCGAT TTGGCTATGT GGGACTGGCC
ACGGCAGTGG TATTGGTTTT GGCTGAAACA ATATTCATAA TTGAAGAATA TTTGGTCAGT
TTCGATAAAT ATCTTTTACT TGGGCTTACG CTGAAATTCT TTACAATATT TGTAACCAGT
TTTATTGCAG TACTTACCAA CAGGCAGCAG ATTCAGAAGA AAAGGCTTGA ACGGATGGCA
ATTACGGATG AATTGACCGG GGCATACAAT CAGAGATTTT TTCACATGGT GCTTGAAAGT
GAGCTTGAGA AGGCAAAAAA TAATAACGGT TCTGTGAGCC TTATAATGAT TGACATAGAT
AACTTTAAAA TGTACAACGA TATTTACGGA CGTGACTTTG GTGACAATAT ATTGAGGACA
ACTGCAACAA TCCTTTCGGA AATTTTGGAC GAAGGCAGCT ATTTGTGCCG ATACGGCGGA
GATGAATTTG CTGTTATTAC TACAAATACC CGGCTCGATA ATTTAGAGGA TATGGCAAAC
AATCTTCGCC GGGAGTTTGA AAGACTTAAA CAAAAATATT ACAAACATAA ATTATACGAA
AAGGTAACAC TGTCCATTGG TTTGTCGGAA TACCCCAACA TGTCGCGGGA CAAAAATGAA
CTCATTTACC AGGCCGATAC AGCCTTGTAT CATGCAAAGA ACCTGGGAAA AGACAAGGTA
CATCTCTATC AGGACGCGTT AATGCAAATA CGCAAAAATA TCAGTTCCGA CCACCAGCAG
CTTATAGGAA TATTCAAGGG ATTGTTGAGT ACCATATCGG CAAAGGATAA ATATACTCAT
GGACATTGCG AGCGGGTGGC GGCTTATGCG GTGCTGATTG CGGAGGCAAT GGGACTGAGT
GCAAAAGAAA TCAGCACAAT TCAGTGTGCC GCTTTGTTGC ATGACATTGG AAAGATAGAA
ATGCCCAGGC ACATATTAAA TAAAAAAGAA GAACTGACTG AAGAGGAAAT AAAATATTTA
AGACAGCACC CTATATATAG TGAAAACATA CTTGAGCCTT TGGCGGACAT GGACAAGCTT
ACCGATTATG TAAGGCACCA TCATGAAAGA TATGATGGCA AGGGTTATCC GGACGGCCTA
AAAGGTAAGG AAATAAGCCT CGGTGCCAGA ATATTGTGTG TTGCAGACTC TTTTGATGCC
ATGGTGTCCG ACCGCCCGTA CAGTAAAAGC ATGTCAAAGG AAGATGCTTT TAAAGAACTT
GAGAAAAATG CGGGAACCCA GTTTGATCCG GAAATTGTAG AGATTTTCAT AAAAGCAATG
AAATCGTATG CCGCATAA
 
Protein sequence
MNKHVSLLRK RYVDLRDTYL GSELKKVELL FYISVSIALG LIQYKLLNYE NTYNIRAYIS 
HIQSCIVILM AFRFGYVGLA TAVVLVLAET IFIIEEYLVS FDKYLLLGLT LKFFTIFVTS
FIAVLTNRQQ IQKKRLERMA ITDELTGAYN QRFFHMVLES ELEKAKNNNG SVSLIMIDID
NFKMYNDIYG RDFGDNILRT TATILSEILD EGSYLCRYGG DEFAVITTNT RLDNLEDMAN
NLRREFERLK QKYYKHKLYE KVTLSIGLSE YPNMSRDKNE LIYQADTALY HAKNLGKDKV
HLYQDALMQI RKNISSDHQQ LIGIFKGLLS TISAKDKYTH GHCERVAAYA VLIAEAMGLS
AKEISTIQCA ALLHDIGKIE MPRHILNKKE ELTEEEIKYL RQHPIYSENI LEPLADMDKL
TDYVRHHHER YDGKGYPDGL KGKEISLGAR ILCVADSFDA MVSDRPYSKS MSKEDAFKEL
EKNAGTQFDP EIVEIFIKAM KSYAA