Gene P9303_19541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_19541 
SymbolphoH 
ID4778183 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1718617 
End bp1719588 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content57% 
IMG OID640087464 
ProductPhoH-like phosphate starvation-inducible protein 
Protein accessionYP_001017961 
Protein GI124023654 
COG category[T] Signal transduction mechanisms 
COG ID[COG1702] Phosphate starvation-inducible protein PhoH, predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.47675 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGGAG TCACCTCAGA GGGTCGCTTC GTTCTGGATC TTCCCGATAC TGACGCTGCG 
CTGGCTCTTG CAGGAAACGC TGAACAGACC TTGCATCACC TTCAGGCCTT AACCGGGGCT
TCTTTGGTGA TTAGGGGTCT TCAGCTTGTG ATCGGCGGCC GCCCAGCTCA ATTGGAACGT
GCCGCAGCAG TGGTTGAGTT GATCAGACCT CTGTGGCAAG AAGGTCAGGC TGTTTCAGCT
GTTGATTTAC AAGCGGCGCT CACGGCTCTT GATACTGGCC GTAGGGATGC TCATGCTGAA
TTGGCTGATC AGGTGTTGGC CCGCAGTCAG CGAGGCAACC TGCTGCGGCC GAGGACATTG
CGCCAGAAGG CTTATGTCGA GGCGATGGAG CGCCACGATC TCACCTTCGC TCTTGGGCCT
GCAGGAACTG GTAAAACTTT TTTGGCCACA GTGTTGGCAG TGCGCATGCT CAGTGAGCGA
AAGGTTGAGC GCCTGGTGTT AACTCGGCCA GCGGTTGAGG CCGGTGAAAG ATTGGGCTTT
CTGCCTGGAG ACCTACAGCA GAAAGTGGAT CCTTATCTGC GTCCTCTTTA TGACGCTCTT
CACGCCCTAC TAGGAGCTGA GAAAACCACC ACATTGCTGG AGAAGGGGGT AATTGAAGTG
GCCCCCCTCG CTTATATGCG AGGACGCACC TTGGAAGAGG CTTTTGTGAT CCTCGATGAG
GCTCAGAACA CAACGCCGGC TCAGATGCGC ATGGTGCTCA CTCGGCTTGG GGAGCGTTCG
CGCATGGTTG TCACGGGTGA CACCACCCAG GTGGATTTGC CACCGGGCCA GCTCAGCGGA
CTTGTGGATG CTGCTGAGGT GCTCGCTGAT CTCAACGGTG TCGCTGTCTG TCGCCTCACC
TCTGCAGATG TGGTGCGTCA TCCGCTTGTG CAACGGGTTG TGGATGCTTA TGCGCGCCGA
GATCAACGAT AG
 
Protein sequence
MAGVTSEGRF VLDLPDTDAA LALAGNAEQT LHHLQALTGA SLVIRGLQLV IGGRPAQLER 
AAAVVELIRP LWQEGQAVSA VDLQAALTAL DTGRRDAHAE LADQVLARSQ RGNLLRPRTL
RQKAYVEAME RHDLTFALGP AGTGKTFLAT VLAVRMLSER KVERLVLTRP AVEAGERLGF
LPGDLQQKVD PYLRPLYDAL HALLGAEKTT TLLEKGVIEV APLAYMRGRT LEEAFVILDE
AQNTTPAQMR MVLTRLGERS RMVVTGDTTQ VDLPPGQLSG LVDAAEVLAD LNGVAVCRLT
SADVVRHPLV QRVVDAYARR DQR