Gene P9211_04581 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_04581 
Symbolppx 
ID5731118 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp431083 
End bp432702 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content41% 
IMG OID641284815 
Productputative exopolyphosphatase 
Protein accessionYP_001550343 
Protein GI159902999 
COG category[F] Nucleotide transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0248] Exopolyphosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGGAG ACAAGACTCT CTATGAGTCG GTAAAAACTA ATCTTGGCAA TGAAAAAACG 
CGTTGCATTG GGGCAATTGA TATCGGGACA AATTCAACGC ATTTGTTAGT GGCATCAGTG
GCAACTGATT TGCATACTTT TAGCATTGAC CTTGCGGAAA AATCATCCAC AAGGCTGGGA
GAGAGGGACC CTGATTCTGG CAATTTGACG GCTGTTGCTA TGGAAAGGGT TTTTGATGCT
TTGAAGAGAT TTAAGGATCT TGCCTTTAGC CATCAAGTCG AAGATTTAAT TATTGTTGCC
ACTAGTGCTG TGAGAGAGGC TCCTAATGGA AGAGAATTTG TAAATCGACT TCAAAATGAG
CTTGACCTAA ATGTTGAGTT GGTTAGTGGT GCAGAAGAAG CAAGGTTAAT TTATTTAGGT
GTTCTCTCTG GAATGCCATT TGGAGACCGC CCTCATCTCT TGGTTGATAT TGGAGGTGGG
TCAACAGAAA TGGTTCTCGC AGATGGACGA GATGCCAAGG CACTTACTAG CACTCGTGTT
GGAGCAGTTC GTCTTCAAAG AGACTTTATT AATAATGAAC CCATACCACC TGAAAGAAAA
GAGTTTTTGC AGACTTTTAT TCAGGGTTCT TTGGAGCCAG CTGCAACCAA AATCGCAAGA
AGAATGAAGT ATGGGGAGAC TCCTGTAATG GTTGCGACAA GTGGTACTGC AATGGCTATT
GGAGCAATAG CTTCTGAAGA TGCCAATCCT TCTGTATTGA AATTACATGG ATTTAAATTG
ACCAAGGATT CTTTGGATAA AATCATTTCA AGACTCTTAG TACTAAATCC TGAGGAAAGG
AAAAAGTTAC CTTCAATAAG TGATCGGCGT GCTGAAATAA TTGTTCCAGG GGCTTTGATT
CTTCAGACAA TAATGGAGAT GATGAAAGTG GAGGAGGTTG TTCTCAGCGA AAGGGCATTG
CGAGAAGGTT TGGTAGTTGA TTGGATGTTT AGGAAAGGAT TGTTAGAAGA TCGTTTTAGT
TTGCAAGGAA GCATTCGGCA GCGTACTGTG CTCCATCAAG CTCAGAGATT TGCGGTTAAT
AGTTCGCGGG CTCAGAGAGT TTCCAGCCAT GCTCTAGCTC TATATGACGA TTCTCGTGGT
GTTTTGCATC GGGACAAAGG TGAAGGTAGA GATTTGCTTT GGGCTGCAGC AATGCTTCAT
GCTTGTGGTC AACATATAAA TCTAAGTGCT TATCACAAAC ACTCTTGGTA CCTAATACGT
CATGGAGAAT TACTGGGCTA TTCACATTCT GAGCATTTAA TGATCGCAGC TATAGCTCGA
TATCATCGAA AGAGTCTCCC TAAAAAACGT CATGATGCTT GGCAAGCCTT GGGTAGCAAA
GAACAACGGA AGATTGTTTC TGAAATGGCA CTTTTACTTC GTTTGTCAGT TGCTGTAGAT
AGACGCCCTC AACCAGTAGT TGCTTCTATA GATGTTGCTT CCGAAGAGAA TAAAGTTACT
ATTAAACTTA TTCCAGAACA ATCAACTCAA AGCTTAAGTT TAGAACAGTG GAGCTTAAGC
AATTGCATCC CATTAGTAAA GAGTTTAACG GGAGTAGAAT TAAAGATTTT ATTGGATTAA
 
Protein sequence
MPGDKTLYES VKTNLGNEKT RCIGAIDIGT NSTHLLVASV ATDLHTFSID LAEKSSTRLG 
ERDPDSGNLT AVAMERVFDA LKRFKDLAFS HQVEDLIIVA TSAVREAPNG REFVNRLQNE
LDLNVELVSG AEEARLIYLG VLSGMPFGDR PHLLVDIGGG STEMVLADGR DAKALTSTRV
GAVRLQRDFI NNEPIPPERK EFLQTFIQGS LEPAATKIAR RMKYGETPVM VATSGTAMAI
GAIASEDANP SVLKLHGFKL TKDSLDKIIS RLLVLNPEER KKLPSISDRR AEIIVPGALI
LQTIMEMMKV EEVVLSERAL REGLVVDWMF RKGLLEDRFS LQGSIRQRTV LHQAQRFAVN
SSRAQRVSSH ALALYDDSRG VLHRDKGEGR DLLWAAAMLH ACGQHINLSA YHKHSWYLIR
HGELLGYSHS EHLMIAAIAR YHRKSLPKKR HDAWQALGSK EQRKIVSEMA LLLRLSVAVD
RRPQPVVASI DVASEENKVT IKLIPEQSTQ SLSLEQWSLS NCIPLVKSLT GVELKILLD