Gene Cthe_1071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1071 
Symbol 
ID4811369 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1278892 
End bp1279893 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content41% 
IMG OID640106493 
ProductPhoH-like protein 
Protein accessionYP_001037496 
Protein GI125973586 
COG category[T] Signal transduction mechanisms 
COG ID[COG1702] Phosphate starvation-inducible protein PhoH, predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGAAAGTC TTGTGGAAGT TTCTTTGGAA TTTGACAGGA TTGAACATGC TATGAACCTT 
TTTGGGAACT TTGACGAAAA TATTAATATA ATTGAGGACG CTTTTAATGT AAAAATTATT
TCAAGGGACA ACGAAATAAG AGTTGTGGGC TACAGTGACG CAGTATACAA GGCCCAGACG
GTTCTTCAAA GGCTTATTGC CATGGCGGCG CAAGGTGATA TCATCTCAAA GCAGAATGTG
AGTTATTTCG TTCAGTTGGC TGAAGAAAAC CAGTTGGATA AGATAAAAGG TTTCACTGCG
GATTTTGTCT GCCTTACCGC AAGAGGCAGG CAGATAAAGG CGAAGACCCA TGGACAGAAG
GTTTATGTGG ATGCAATAAA AGAAAATGAC ATAGTATTCG GCATAGGACC GGCAGGCACG
GGAAAGACAT TTCTTGCCGT GGCCATGGCG GTTAATGCTT TCAGAAACAA GAAAGTAAAC
AGGATAGTTC TTACAAGACC TGCGGTTGAA GCAGGTGAAA AACTGGGATT TTTGCCGGGC
GATTTGCAAA ACAAGGTGGA TCCGTATTTA CGTCCTTTGT ATGATGCTCT TTATGAAATG
ATGGGAGCCG AAACATATCA TAAATATCTG GAAAAAGGCA TGATAGAAGT TGCGCCCCTT
GCATACATGA GAGGAAGAAC TTTGGACGAT TCATTCATTA TACTTGATGA AGCCCAAAAT
ACCACTCCGG AGCAGATGAA AATGTTTCTT ACGCGAATAG GGTTTGGTTC AAAAGCCGTT
ATTACCGGTG ATATTACCCA GATAGACCTT CCGGGGGAAA AAAAGTCAGG GCTTGTTGAG
GTCATGAAAG TGTTAAAGGA CGTAAAGGGT ATTTCTTTTG TCCATTTGTC GGACATGGAC
GTGGTAAGAC ATGAATTGGT TCAAAGAATT ATCCAGGCAT ATGAAAGATA TGATAGGGAA
AAGAAGGAAA AGGGCAAAAA GGAAAGCAAG GAAACCAATT AA
 
Protein sequence
MESLVEVSLE FDRIEHAMNL FGNFDENINI IEDAFNVKII SRDNEIRVVG YSDAVYKAQT 
VLQRLIAMAA QGDIISKQNV SYFVQLAEEN QLDKIKGFTA DFVCLTARGR QIKAKTHGQK
VYVDAIKEND IVFGIGPAGT GKTFLAVAMA VNAFRNKKVN RIVLTRPAVE AGEKLGFLPG
DLQNKVDPYL RPLYDALYEM MGAETYHKYL EKGMIEVAPL AYMRGRTLDD SFIILDEAQN
TTPEQMKMFL TRIGFGSKAV ITGDITQIDL PGEKKSGLVE VMKVLKDVKG ISFVHLSDMD
VVRHELVQRI IQAYERYDRE KKEKGKKESK ETN