Gene P9211_04731 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_04731 
SymbolnhaP 
ID5730308 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp443387 
End bp444595 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content45% 
IMG OID641284830 
ProductCPA1 family Na+/H+ antiporter 
Protein accessionYP_001550358 
Protein GI159903014 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0025] NhaP-type Na+/H+ and K+/H+ antiporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.281963 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.831051 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCCTG AGAGGCTAGG TCTACTTTGG GGTATCACAG TTTTTGCTGG TGCTGGGGCA 
AGGGTTTTGT CTGTGATATC TGGATTGCCA GGTGTTGTGC TATTGCTTTT GTCTGGCTTG
TTGATTGGAA GATCAGGGCT TGGACTAGTT GAGCCACTCG ATTTAGGCCA AGGTCTTGAG
ACAATTGTTG GTTTGTTGGT CAGCCTTGTC CTTTTTGATG GGGGGCTGAA TCTTCGTTTG
CCCGGGGGGA CTATTAAGGC AACTGTTTTG CGTATCTCAC TTATAAGGAT TTTTATATCT
TTTGCGGCTG TGCTGGTAAC TGCTCATCTA TTAGCCGGTT TGTCTTGGTC AGTATCGTCC
GTATATAGCG CAATTGTTCT TGCAACTGGG CCAACAGTTG TAACTCCACT TGTTCAGCAG
ATTCGTCTTG CCTCTCCTCT GGGTGATGTC CTTGAAGCAG AAGGTTTGGT GCTTGAACCC
ATAGGGGCTG TTTTAGCTTT GCTTCTCCTT GAACTCCTTG TGGGCGATTT ACATGGTCTT
AGGGAAGTCG CTTTCGGTTT GTTAGCAAGG TTAGGAGGAG GCGTGATTAT GGGCTTGTTT
GCAGGGTGGA TTCTTGCCGA AGCTCTTCAA AGAATTAAAA CGGATGCTTC TTTAGGCATA
AGGTTGCAAT TGACTCTGGG CATAGTTTTT TTACTTTATG GTTTTTGTGA ATGGTTATTA
CCTGAGTCAG GCTTGCCTGC TTCAGTCGCA GCAGGTTTCA TAGTTGGTAG AAGGCAATCT
ATTGAAGTAG ATCAGTTAGA TCAATTAATT AGAGAGTTAG CCCAATTAGC AATCACAATG
TTATTCCCAT TACTTGCTGC AGATGTGTCA TGGAGAGAAC TAAGTCCTTT GGGCTGGGGT
GGAATCAGCT GTGTTTTGGT CTTAATGTTA ATTATTCGAC CTGCAGCGGT AAGCCTTGCA
ACTATTGGAC TGCCCTTGGA TTTTAGGCAA AGATTGTTTT TGGGTTGGTT AGCTCCTAGA
GGGATTGTCA CTGCAGCAGT GGCTTCCCTT TTCTCTATTC GATTAGAACA AGCAGGAGTT
TTGGGGGCAG GACGTCTTCA GGGATTAGTT TTCTTAACGA TTTTGATGAC CGTTGGCATT
CAAGGTCTTA CAGCTAAGCC ATTGGCTAAA GGATTGGGTT TGTTGGCAAA ACAAGAGCAA
ACTCCTTAA
 
Protein sequence
MTPERLGLLW GITVFAGAGA RVLSVISGLP GVVLLLLSGL LIGRSGLGLV EPLDLGQGLE 
TIVGLLVSLV LFDGGLNLRL PGGTIKATVL RISLIRIFIS FAAVLVTAHL LAGLSWSVSS
VYSAIVLATG PTVVTPLVQQ IRLASPLGDV LEAEGLVLEP IGAVLALLLL ELLVGDLHGL
REVAFGLLAR LGGGVIMGLF AGWILAEALQ RIKTDASLGI RLQLTLGIVF LLYGFCEWLL
PESGLPASVA AGFIVGRRQS IEVDQLDQLI RELAQLAITM LFPLLAADVS WRELSPLGWG
GISCVLVLML IIRPAAVSLA TIGLPLDFRQ RLFLGWLAPR GIVTAAVASL FSIRLEQAGV
LGAGRLQGLV FLTILMTVGI QGLTAKPLAK GLGLLAKQEQ TP