Gene NATL1_17041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_17041 
SymbolphoH 
ID4780886 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1387996 
End bp1388973 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content39% 
IMG OID640084988 
ProductPhoH-like phosphate starvation-inducible protein 
Protein accessionYP_001015524 
Protein GI124026409 
COG category[T] Signal transduction mechanisms 
COG ID[COG1702] Phosphate starvation-inducible protein PhoH, predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAAG CAACCACTGA AGGTCGCTTT TGTATAGATC TGCCTGATTC CGATGCTGCT 
ACTGCTTTAG CGGGAACTGG TCAGTCAACA CTTCATAGAT TAGAAACCCT AACAGGTGCT
GCTTTTGCCT TAAGGGGGTT GCAACTCGAA ATAAAAGGAA ATTCTTACCA ATTAGAAAAA
GCTGCAGCAA TTGTTGAATT AGTTAGACCA ATTTGGGAAG AAGGGCAAAT TGTCTCGCCC
GTTGATTTAC ATGCCGCGGC TAAAGCATTG GACAATGGTA AAAAAAATGA TCATGCCAAA
TCCACAAATA AAGTTTTAGC GAGAAGTCAA AGAGGAAATC TTTTAAGACC AAGAACAATT
AGACAAAAAT TATATGTAGA AGCTATGGAA AAAAGTGATC TTACTTTTGC TTTAGGCCCA
GCAGGAACAG GAAAAACTTT CTTGGCAACT GTATTAGCTG TGCGAATGCT AACTGAGAGG
AAAATCGAGA AAATTATTTT GACAAGACCT GCAGTTGAAG CTGGCGAAAG ATTGGGATTT
TTACCTGGGG ACTTACAGCA AAAGGTTGAT CCTTATCTAA GGCCTTTATA TGATTCTCTC
CACTCTTTAC TTGGACAAGA AAAAACTAAT TTGCTTATAG AAAAAAACGT GATTGAAGTT
GCGCCTTTGG CTTACATGCG AGGAAGGACT TTAGAAGAAT CATTTGTCAT ACTTGATGAG
GCACAAAATA CGACACCAGC ACAAATGAGG ATGGTTCTTA CCAGATTAGG TGAGAGGTCA
AGGATGGTAG TAACTGGCGA TATAACTCAG GTTGACTTGC CATATGGACA AATGAGCGGA
CTTATAGAAG CAGCAGACTT ACTAGAAAAG GTTGATGGAA TTTCAGTTTG CAGACTTACT
TCAGCAGATG TAGTAAGACA TCCACTTGTT CAAAGCGTTG TTGATGCTTA TGCAGAACTA
GATAAAAAAA GACGATAG
 
Protein sequence
MSEATTEGRF CIDLPDSDAA TALAGTGQST LHRLETLTGA AFALRGLQLE IKGNSYQLEK 
AAAIVELVRP IWEEGQIVSP VDLHAAAKAL DNGKKNDHAK STNKVLARSQ RGNLLRPRTI
RQKLYVEAME KSDLTFALGP AGTGKTFLAT VLAVRMLTER KIEKIILTRP AVEAGERLGF
LPGDLQQKVD PYLRPLYDSL HSLLGQEKTN LLIEKNVIEV APLAYMRGRT LEESFVILDE
AQNTTPAQMR MVLTRLGERS RMVVTGDITQ VDLPYGQMSG LIEAADLLEK VDGISVCRLT
SADVVRHPLV QSVVDAYAEL DKKRR