Gene Moth_0593 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0593 
Symbol 
ID3830978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp617517 
End bp618518 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content55% 
IMG OID637828534 
ProductPhoH-like protein 
Protein accessionYP_429466 
Protein GI83589457 
COG category[T] Signal transduction mechanisms 
COG ID[COG1702] Phosphate starvation-inducible protein PhoH, predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.318782 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAAAC CATTGGCCGA TATTTATGAA GTAAAACTCA CCACCGGCAA TAACGGTGAG 
GCGGCCAACA TCTTTGGCCA CCAGGATGAA AACCTGAAAT TTATTGAGAG CCACACGCCG
GCCCGGATTA TCGCCCGGGG TAACGAAATA ACCTTAAGCG GCGACCGGCG GGAAGTCCAG
GTGCTGGAAA AGCTTTTCCG GCAACTAATA AAACTCGCCC GGGCGGGAAC AACCATCAAT
ACAGCAACCA TCAACTACAC CTGGAACCTG GTCCGCAGGC AGGACGGCAG CCAGGATCAG
CCGGACCTGG CCCAGGCTCT GGGTGAAGTG ATTTATGTTA CCCCCCGGGG TAAGCAGATC
CGGCCCAAGA CCCTGGGACA ATTGCGTTAT ATTCAGGCCA TGCGCCGTTA TGATATCGTC
TTTGGTATCG GCCCGGCCGG TACCGGTAAA ACCTACCTGG CAGTAGTTAT GGCCGTCAAT
GCCCTGAGGG CGCGCAGCGT AGAAAGGATC ATCCTGGCCC GACCGGCAGT AGAAGCGGGA
GAGAAGCTGG GCTTCCTCCC CGGCGACCTG CAGGAAAAGG TCAATCCCTA CCTGCGCCCC
CTTTATGACG GCCTTTATGA CGTTTTAGGA CTGGAAACGG CACAAAAGTA TATGGAAAAA
AATATTATAG AAGTAGCGCC CCTGGCCTAT ATGCGGGGAC GGACCCTGGA CGACGCCTTT
ATCATCCTGG ATGAGGCCCA GAATACTACT TCCGAACAAA TGAAAATGTT CCTGACCAGG
ATCGGCTTCG GCTCCAGGGC GGTAATCACC GGCGATATCA CCCAGGTGGA TCTGCCCCGG
GAGACAACCT CCGGCCTGGT GGAAGTCCAG AGGATTTTAA AGGGCATTGA AGGCATTGCC
ATCGAGTATT TAACGGAAGC CGATGTGGTT CGGCATCCCC TGGTCCAGGA GATCATCAAG
GCCTACGAGA GGAGTGACCA GATGTGCCAT GGCAGCGGTT AG
 
Protein sequence
MIKPLADIYE VKLTTGNNGE AANIFGHQDE NLKFIESHTP ARIIARGNEI TLSGDRREVQ 
VLEKLFRQLI KLARAGTTIN TATINYTWNL VRRQDGSQDQ PDLAQALGEV IYVTPRGKQI
RPKTLGQLRY IQAMRRYDIV FGIGPAGTGK TYLAVVMAVN ALRARSVERI ILARPAVEAG
EKLGFLPGDL QEKVNPYLRP LYDGLYDVLG LETAQKYMEK NIIEVAPLAY MRGRTLDDAF
IILDEAQNTT SEQMKMFLTR IGFGSRAVIT GDITQVDLPR ETTSGLVEVQ RILKGIEGIA
IEYLTEADVV RHPLVQEIIK AYERSDQMCH GSG