Gene P9303_06591 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_06591 
Symbolppx 
ID4777589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp620151 
End bp621797 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content55% 
IMG OID640086166 
Productputative exopolyphosphatase 
Protein accessionYP_001016676 
Protein GI124022369 
COG category[F] Nucleotide transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0248] Exopolyphosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCTTTCG GTGTGGTTGA TGTCGAGCAG GAGGAGGCCA GGTACGCCAG TGCTGGTCAG 
CTCCGTTCTG AGCGAGAGCT CCGCAATGTT GCTGCGATTG ATATTGGAAC CAATTCCACC
CATCTCCTGG TGGCGTCGGT TGATCCTTCT CTGCACACCT TCAGCGTTGA TCTTGCAGAG
AAATCAACCA CAAGACTGGG TGAGCGAGAC CCTGACTCTG GAGAGCTCAC GGGTCCTGCT
ATGGAGCGGG TATTTGAAAC ACTCAAACGT TTTAAGGAGC TAGCCGCCAG CCACCAGGTG
GAGCAGGTGG TTGCGTCGGC GACTAGCGCA GTACGTGAAG CTTCAAATGG GCGAGATTTT
TTGCAACGGA TTCAGGATCA GCTCGGCCTT GAGGTGGATC TGTTGAGCGG TGCTGAGGAA
GCACGACTGA TCTATTTGGG TGTTTTGTCT GGCATGCCTT TTGGGGAGTG TCCCCATCTT
TTGCTCGATA TCGGTGGCGG GTCCACAGAG CTCATCCTCG CGGATGGTCG CGATGCTCGG
GCTCTGACAA GCACGCGGGT TGGTGCGGTT CGGCTACAGA GGGATTTTGT CAAGCATGAA
CCAATCCCCC CTCAACGGCG TTCCTTTTTG CAGACATTTA TCCAGGGTTC TTTAGAGCCA
GCTGTTGATA AGGTCTGCCG TCGCATCAAT CCTGGGGAGA GCCCTGTGAT GGTTGCCACC
AGTGGAACCG CCATGGCCAT CGGTGCTTTG TTGGCTAGCG AAGATGATCG CCCTCCGTTG
AGATTGCATG GCTATCGATT CTCACGGCAG CGGCTTGATG GCCTGGTTGA GAAGCTGATA
GCGATGACGC CTGAGCAGCG TCGCATCCAG ACGCCGATTA ATGATCGCCG TTCCGAAATC
ATCGTTCCGG GTGCGCTGAT TCTTCAAACC GCCATGCAGA TGCTCGCTGC TGATGAGCTG
GTGCTGAGTG AAAGGGCTCT GAGGGAGGGT TTGATCGTTG ACTGGATGCT GCGACACGGC
CTACTGGAGG ATCGCTTTAG TTTTCAGAGC AGTATTCGTC AGCGCACCGT GATTCATCAG
GTGCAGCGTT TTGCAGTGAA TCACCGCCGT GCAGAGCGTG TAGCCAGCCA TGCACTCAGC
CTTTACGACC ACACACATGG TGTTCTTCAT CACGATGATG GAGGTGGTCG GGATCTGCTT
TGGGCATCGG CAATGTTGCA TGCCTGTGGT CAGCACATCA ATCTCAGTGC TTATCACAAG
CATTCCTGGT ATCTGATTCG TCATGGCGAG TTGCTCGGCT ATTCGGAAGC GGAGCATTTG
ATGGTTGCGG CTATTGCGCG CTATCACCGC CGCAGTCTTC CCAAGAAGCG CCATCTGGAA
TGGCAGGCTT TGGCCACCCG GGAGCATCGT CGTCTCGTTG CCGAGATGGC TTTGTTGCTG
CGATTAGCAG CAGCCATTGA TCGTCGTCCT GAGCCAGTGG TGGCTGCAAT CAGAGTGGAG
TCCGCTGATG ATGATCAGGT TGTCTTTGAA CTCGTGCCGG AAGGATTGAA TCAGAACCTC
AGCCTTGAAC AATGGAGCTT GAAGAGCTGT GCTTCGGTTG TGAAAGAAGC AAGCGGCGTA
ACCATGAAAG TTGTCGTTGA GGAATGA
 
Protein sequence
MPFGVVDVEQ EEARYASAGQ LRSERELRNV AAIDIGTNST HLLVASVDPS LHTFSVDLAE 
KSTTRLGERD PDSGELTGPA MERVFETLKR FKELAASHQV EQVVASATSA VREASNGRDF
LQRIQDQLGL EVDLLSGAEE ARLIYLGVLS GMPFGECPHL LLDIGGGSTE LILADGRDAR
ALTSTRVGAV RLQRDFVKHE PIPPQRRSFL QTFIQGSLEP AVDKVCRRIN PGESPVMVAT
SGTAMAIGAL LASEDDRPPL RLHGYRFSRQ RLDGLVEKLI AMTPEQRRIQ TPINDRRSEI
IVPGALILQT AMQMLAADEL VLSERALREG LIVDWMLRHG LLEDRFSFQS SIRQRTVIHQ
VQRFAVNHRR AERVASHALS LYDHTHGVLH HDDGGGRDLL WASAMLHACG QHINLSAYHK
HSWYLIRHGE LLGYSEAEHL MVAAIARYHR RSLPKKRHLE WQALATREHR RLVAEMALLL
RLAAAIDRRP EPVVAAIRVE SADDDQVVFE LVPEGLNQNL SLEQWSLKSC ASVVKEASGV
TMKVVVEE