Gene Cthe_0273 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0273 
Symbol 
ID4808556 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp337173 
End bp338777 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content38% 
IMG OID640105685 
Productmetal dependent phosphohydrolase 
Protein accessionYP_001036705 
Protein GI125972795 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain
[COG2206] HD-GYP domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain
[TIGR00277] uncharacterized domain HDIG 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.576727 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAATATG ATGAAAAGAG AAGCAAAACT ATAGCCTTTT TTAATAAACA TTTTTTCAAT 
GTTTCATTTG TATTTTTGGT TGGTATATTG GTAATATACA TAATCTATGA AACAGGTCAT
TTTTTTCCAA CATTCCTGTA TATGAGTGCC TTTGCGGCAA TGATGATTAC GGCAAAGTTT
TTCATCAGGC ACAAGGCACT AAAACACTTG CTTGCAGTAT TGTCATATGT GCTGTTTATT
TTTGTTGTGG ATTCGCATTC TTTGCCTTAT CCGTACTATA CCGTAGTCAA CAAATGGAGC
GTTCTTACCA TTTGCTATGT GCTTCTTCTG GAAAATATAC TGTTTCCTCT CATTGCCGGG
CTGTATCCTG CACTTAGAAT ATTCATGCTT ATGCCCGAAG CCAAAGCAGG GTTGTATCCC
ATTGGCAAAG TAATTGGAAC AGCCAACGGC CAAATTTGCT TTACAATTAC ACTTGTCCTC
ATATACCAGT TATTCGGAAA GGTTATTATT GAACGCAACA AATACAAAAA GATGAGTATT
ACAGATTCAC TTACAGGTGT GGCAACCTTT GCCCACACAA TTGAAACTGC CAAAAAAATG
ATCCAAAACG GTAATATTTC AATTCTGATT ACCGATATGG ACCGCTTTAA GCAAATTAAC
GACACTTTCG GTCACGTGGC GGGAAATAAA GTGCTCATAA AAGTTTCGGA GTTTCTCAAA
GAAGAAACCG AAGGTCTTGA AAGAATAATC GGAAGGCTTG GTGGCGACGA GTTTATTATT
GTGGTAAAAA ATGATGGAAA AAACGAAAGA GTAAAAAATT TGGGAGAACA TCTTTCAAAA
GCAATAAGAG AGAAGAAGTT CGTAATTGAC GAGGAACTGG ATCCGATAAA TCTGTCTTTT
TCCGTGGGGC AGGCCAATTC GTCGCCTTCC GACACAGAAA ATGACATAGA AAAGCTTTTG
TATAAAGCAG ATATAAATAT GTATTATAAC AAGTGCAAAA ACCATAGGCT GGACATCTTT
ACAAATAACA AAAAACCTCT CCTTCCAAAG GAAGGATTTG AACTTTTAAA TGTCCTGGCG
GAAAAGGATA TGTACACTTA CGTTCATTCC AGGTACACTG CTCAGTACGC TGCTGCGCTT
GCAAAAGAAG CCGGTCTTCC GGATGAACAG GTTGAACGCA TTTATGCCGC CGGATGGCTC
CATGACATAG GTAAAATTCT TATATCCAGC GACATAATAA GAAAAAGCAC TACTTTAACT
CCCGAAGAAT ATGAGCTTAT CAAGGGACAT GTAAATTATG GACTTAACAT AATTAATAAT
TTATCTCTCC CTGTCGAAAT TATAAATTGC ATAGCATACC ATCACGAAAA CTGGGACGGC
ACAGGATATC CTCACGGACT CGCAGGAGAA AGCATACCTT TTGAAGCAAG AATTCTGCAA
TTGGCAGATT CCTATTCCGC AATGATAACA AGAAGAGTAT ACAGAAAAAC TCTAAGTCCT
GAGGATGCAC TCAATGAAAT TATCTCCGGA TGCGGAAAAC AATTTGATCC CAATCTCGTA
AAAATATTTG TAAAACTGAT ACAAAGCAAA TTTAAAGCAG CTTAG
 
Protein sequence
MQYDEKRSKT IAFFNKHFFN VSFVFLVGIL VIYIIYETGH FFPTFLYMSA FAAMMITAKF 
FIRHKALKHL LAVLSYVLFI FVVDSHSLPY PYYTVVNKWS VLTICYVLLL ENILFPLIAG
LYPALRIFML MPEAKAGLYP IGKVIGTANG QICFTITLVL IYQLFGKVII ERNKYKKMSI
TDSLTGVATF AHTIETAKKM IQNGNISILI TDMDRFKQIN DTFGHVAGNK VLIKVSEFLK
EETEGLERII GRLGGDEFII VVKNDGKNER VKNLGEHLSK AIREKKFVID EELDPINLSF
SVGQANSSPS DTENDIEKLL YKADINMYYN KCKNHRLDIF TNNKKPLLPK EGFELLNVLA
EKDMYTYVHS RYTAQYAAAL AKEAGLPDEQ VERIYAAGWL HDIGKILISS DIIRKSTTLT
PEEYELIKGH VNYGLNIINN LSLPVEIINC IAYHHENWDG TGYPHGLAGE SIPFEARILQ
LADSYSAMIT RRVYRKTLSP EDALNEIISG CGKQFDPNLV KIFVKLIQSK FKAA