Gene Tmz1t_0866 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_0866 
Symbol 
ID7084724 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp958825 
End bp959844 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content68% 
IMG OID643697889 
ProductPhoH family protein 
Protein accessionYP_002354529 
Protein GI217969295 
COG category[T] Signal transduction mechanisms 
COG ID[COG1702] Phosphate starvation-inducible protein PhoH, predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGAAAA TCCTGGAAGT CTTCTTCGAG CCGGTGGATA ACGCCCGGCT GGCCAAGCTG 
TGCGGCGTAC TCGACGAGAA CCTGCGCCAG ATCGAGAACG CCTTCGACAT CACCGTGAGC
CGGCGTGCCG AACACTTCAC GCTGCAGGGC CACCCCGCGC AGGTGCTGCG CGGCGAGATG
GCGCTCAAGC ATTTTTACGC GCTCGCCGAC AAGGATCTGT CGCGCGACGA GGTCCAGCTC
GGCCTCATCG AGATCGCCAA CAAGGGCGAG GCGGCGCAGC CCGCGCCGGT GCTGATGACG
CGCCGTACGG AGCTCCACGG CCGCACGCCG CGCCAGGTCG ACTACCTGCG CAACATCCAG
GACTTCGACA TCACCTTCGG CATCGGCCCG GCCGGCACGG GCAAGACCTA TCTCGCGGTG
GCGAGCGCGG TCGACGCCTT CGAGCGCGAC CTCGTCGAGC GCATCATCCT CACCCGCCCG
GCGGTCGAGG CCGGCGAGCG CCTGGGCTTC CTGCCCGGCG ACCTGGCGCA GAAGGTCGAC
CCCTACCTGC GCCCGCTCTA CGACGCGCTC TACGACCTGA TGGGCTTCGA CCGCGTCGGC
AAGCTCTTCG AGCGCGGCAG CATCGAGATC GCGCCGCTCG CCTTCATGCG CGGGCGCACG
CTCAATAATG CCTTCATCAT CCTCGACGAG GCGCAGAACA CGACCCCCGA GCAGATGAAG
ATGTTCCTCA CCCGCATCGG CTTCGGCGCC AAGGCGGTGG TCACCGGCGA CCTCACCCAG
ATCGACCTGG CACGCGGCCA GCGCAGCGGT CTCAAGGAGG CGCGCGCGGT GCTGGCGGGG
GTGCGCGGCA TCGCGTTCAC CGAATTCAGC AAGGAAGACG TGGTGCGTCA TCCGCTTGTC
GCGCGCATCG TCGAAGCTTA CGACCTCGAG GCTGCGCGCC TCGAGCGCGA GAAGGCCGCG
GCCCGCGCCG CGCGCCAGCA GCCGCACCCG CAGGAGCAGG AAGCCGAAGA TGGCGAATAA
 
Protein sequence
MAKILEVFFE PVDNARLAKL CGVLDENLRQ IENAFDITVS RRAEHFTLQG HPAQVLRGEM 
ALKHFYALAD KDLSRDEVQL GLIEIANKGE AAQPAPVLMT RRTELHGRTP RQVDYLRNIQ
DFDITFGIGP AGTGKTYLAV ASAVDAFERD LVERIILTRP AVEAGERLGF LPGDLAQKVD
PYLRPLYDAL YDLMGFDRVG KLFERGSIEI APLAFMRGRT LNNAFIILDE AQNTTPEQMK
MFLTRIGFGA KAVVTGDLTQ IDLARGQRSG LKEARAVLAG VRGIAFTEFS KEDVVRHPLV
ARIVEAYDLE AARLEREKAA ARAARQQPHP QEQEAEDGE