Gene A9601_14831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_14831 
SymbolphoH 
ID4718204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1263387 
End bp1264343 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content36% 
IMG OID640079204 
ProductPhoH-like phosphate starvation-inducible protein 
Protein accessionYP_001009873 
Protein GI123969015 
COG category[T] Signal transduction mechanisms 
COG ID[COG1702] Phosphate starvation-inducible protein PhoH, predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.384424 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGAAG TTTCCAAAAC TGGTCACTTC ACAATAGATC TGCCAAGCTC TGATGCTGCT 
ACAGCATTAT CTGGACCTGG TAATTCTTTC TTAAAAAAAT TTGAGTCTCT TACAGGAGTT
TCTTTAACTA TAAGGGGCTT ACAACTTGAG ATGAACGGTG TCATATCTAA AATTGAGAGA
GCCTCAGCAT TAGTAGAACT AACAAGACCA ATTTGGGAAC AAGGGTTAGA AGTCCCAGAG
GTAGATCTTA AAGCGGCTTT AAGTTCTTTA AATATGGGCG AATCGTCTTC ACATGCTGAA
CTAGGAAAAA AAATTCTTGC GCGTTCCAAA GAAGGAAGAT ATTTAAGACC AAGAACTATA
AGACAAAAAG AATATGTTGA ATCAATTGAA AGCTTTGATC TTACCTTTGC GATCGGTCCA
GCTGGAACTG GTAAGACATT TTTAGCAACT GTTTGCGCGG CAAGACTATT AAACGAGAAA
AAAATTGAAA AAATTATTTT AACCAGACCA GCTGTAGAAG CTGGTGAAAG TTTGGGATTC
CTACCTGGTG ATTTGCAACA AAAAGTAGAT CCATATTTAA GACCCTTATT TGATTCTTTA
CATAGTATTT TCGGGATTGA CAGAACAAAT TCGTTAATTG ATAAGGGAAT TATTGAAGTT
GCTCCTTTGG CATTTATGAG AGGCAGAACC TTAGATAACT CCATAGTTAT CCTAGATGAA
GCGCAAAATA CTACTTGCTC TCAAATGAGA ATGTTTTTGA CCAGATTAGG AGAGAGATCC
AAAATGGTTG TAAATGGAGA TATTACACAA ATTGATTTAA AAAAAGATCA GGAAAGCGGC
CTCATCGAAG CATCGAGAAT TTTCTCAAAA ACTCAAGATA TAAAATTTTG TTATTTAACT
GTTGAAGATG TGGTTCGTCA TCCTTTAGTT CAGAAAATTA TTGAGGCTTA TCAATAA
 
Protein sequence
MKEVSKTGHF TIDLPSSDAA TALSGPGNSF LKKFESLTGV SLTIRGLQLE MNGVISKIER 
ASALVELTRP IWEQGLEVPE VDLKAALSSL NMGESSSHAE LGKKILARSK EGRYLRPRTI
RQKEYVESIE SFDLTFAIGP AGTGKTFLAT VCAARLLNEK KIEKIILTRP AVEAGESLGF
LPGDLQQKVD PYLRPLFDSL HSIFGIDRTN SLIDKGIIEV APLAFMRGRT LDNSIVILDE
AQNTTCSQMR MFLTRLGERS KMVVNGDITQ IDLKKDQESG LIEASRIFSK TQDIKFCYLT
VEDVVRHPLV QKIIEAYQ