Gene Cthe_1285 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1285 
Symbol 
ID4809537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1561401 
End bp1562798 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content37% 
IMG OID640106708 
Productmetal dependent phosphohydrolase 
Protein accessionYP_001037710 
Protein GI125973800 
COG category[T] Signal transduction mechanisms 
COG ID[COG2206] HD-GYP domain 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAACA AATCTGTGTA CATATTGATA GTTAAGTTTT TTTTCCTGTG TGCCGTACTC 
TTTTTCTCGG CAGTTGGATT TGCCCGGACT TCAGACTTCA GATTTGCCAT TATCATGGGA
TTTGCTTTAA TACTTGCATT AATTGATTAC ATGTCAAAGG TTGTTTCTAA TAAAAAGAAT
TATCAGGAAC TATTAATAAA CAACGAAAGA ATAAAAAAAA TGGAAGCTGA ACTTGACTCT
CTTCATGAAG TGTCGAAGGT CATTACGTCT ACTTTTGATG TAAAAGTAAT AATGGAACAT
ACATATAATG AACTGGTAAG AATTACCAGA TGTGAATGGT ATTATGTTTG TTTTGTTGAC
AAAGAATCTT TTGAGCCGGC GTTTAACTAT GAGTTTGGCA ATCAAGTATT GAAAGAAGCG
GGGAATATCG ATTATGAACA GTATGTGAGA AATCTTGCAA AGAGTAATCC TTTTTCGGAG
CCGCAAATTG AAACGCTTTT AAGGGAAGAA AAACGTGAAA TTATGGCAAT TTCGTTAAAT
GTTTCGGATG AATTGATAGG CGCGATTTTC ATTGGCAGTT CAAAAACAGG AGCTTTTTCG
GAAGTAAATT TAAGTTTTTT AAAGAGTCTT GCGGGTTATG TTGCCATAGG TATCAGAAAT
GCTGAAATGT TTAATAATAT ATACAGCCAG AAACAAGAAA TAGAAGCCTT GTATGAACAA
GCTGCGGCAA GCAATGAAGA GCTCAACAGT TATATCAAGG AATTGGACAA AACAAAAGAG
GAGCTTAATC AGAAAAATAT CGAACTGATG ACACTGTTTG ACAACATTCA GTATGGCTAT
CTTCAGACGG TTATGTGTCT TGCCAATTCC ATCGAGGCAA AGGATCCTTA CACAAGAGGG
CATTGCCAGA GGGTTATGGA AATATCCTGT GAACTTGCAA GGGCAATGAA ACTGTCTGAG
GAAGAAATCA AGGATTTAAG GTATGCGGCC ATACTGCATG ATATAGGGAA AATAGGTATT
TCGGCGAGTA TATTAAACAA GCAGGGAAAA TTGACGGATG AAGAATTTGA AGAAATAAAG
AAGCATCCGA TTATTGCATA TAATATTCTT AAAGATGTAG AATTTTTAAA GAATGGGTTG
AACGGTATAC TGCAGCATCA TGAACGATAT GACGGGAAAG GATATCCTTA CGGACTTAAA
GGAGAAGAAA TTTGCATTTT TGCAAGGATA ATGTGTGTGG CGGACGCTTT CGACGCCATG
ACAAGTGACA GACCGTACAG AAAGGGCATG GATATGAAAT CAGCCCTGGA GGAGATAAAA
AGATGCAGGG GAACGCAATT TGATCCCGAA ATTGCTGATT TGCTTCTGAT GATGGCAGAT
GAAGGTAGAA CTTATTGA
 
Protein sequence
MQNKSVYILI VKFFFLCAVL FFSAVGFART SDFRFAIIMG FALILALIDY MSKVVSNKKN 
YQELLINNER IKKMEAELDS LHEVSKVITS TFDVKVIMEH TYNELVRITR CEWYYVCFVD
KESFEPAFNY EFGNQVLKEA GNIDYEQYVR NLAKSNPFSE PQIETLLREE KREIMAISLN
VSDELIGAIF IGSSKTGAFS EVNLSFLKSL AGYVAIGIRN AEMFNNIYSQ KQEIEALYEQ
AAASNEELNS YIKELDKTKE ELNQKNIELM TLFDNIQYGY LQTVMCLANS IEAKDPYTRG
HCQRVMEISC ELARAMKLSE EEIKDLRYAA ILHDIGKIGI SASILNKQGK LTDEEFEEIK
KHPIIAYNIL KDVEFLKNGL NGILQHHERY DGKGYPYGLK GEEICIFARI MCVADAFDAM
TSDRPYRKGM DMKSALEEIK RCRGTQFDPE IADLLLMMAD EGRTY