Gene P9301_07361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_07361 
SymbolpurK 
ID4912440 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp656308 
End bp657498 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content31% 
IMG OID640160318 
Productphosphoribosylaminoimidazole carboxylase ATPase subunit 
Protein accessionYP_001090960 
Protein GI126696074 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.291651 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTTAA AAAAAAATAT AAACGATTTT AAGAAAAATT ATTCCCTGGG AATAATTGGA 
GGTGGTCAAT TGGCTTTGAT GTTAACCGAG GCAGCAAAAA AAAGAGATCT TGAAGTATGT
GTGCAAACAA AATCTTGTGA TGATCCTGCT GGGTTAAAAG CAGATCATGT CATAGAAGCT
GATCCTTTAA AGATAAGAGG TAATAAATCA TTAATTAATG AGTGTGAAAA AATAATTTTT
GAAAATGAAT GGATAAAAAT TGATAAATTA AATTTAATTG ACAATAAAGA TATTTTTGTT
CCAAGCCTTA ATGCAATTAA GCCATTAGTA GATAGGTTTT CTCAAAAAAA ATTAATAGAA
AGAATGAATA TTCCCTGCCC AAAATGGATA AGTATTGAAG ATTTTAAAAA TCTCTCGGAT
CAGGAAATCA ATAATTGGAC TTTTCCTCTA ATGGCAAAAT CAAATAAAGG TGGATATGAC
GGCAAAGGGA ACAGAAAAAT AAAGACAAAA GAAGATTTAG ATTCTTTTTT AAAAGAGAAC
AATTCTAATG AATGGTTAAT AGAAGAATGG ATAGAGTATG AAAAAGAACT GGCTCTTGTT
GGTTCGAGAG ATAGGACCGG TAAGATAAGA TTCTTTCCAA TAGTTGAGAC GTTCCAATCA
AACCACGTTT GTGATTGGGT TCTTGCCCCT GGAACAAATG AATATGATTT GAACTTATTT
GCAATAAATA TTTTCTCTTC AATAGTCAAT GAACTTAATT ACGTTGGAGT TTTAGCTATT
GAATTCTTCT ATGGAGATAA TGGTCTTTTA ATTAATGAAA TAGCTCCTAG AACACATAAC
TCAGCTCATT TCTCTATTGA AGCTTGCACT TCAAGTCAGT TTGATCAATA TGTTTGCATT
TCTTCTGGGA TAATGCCACC TGAAATTAAA ATGAACTGTG AAGGTGCAAT TATGATAAAT
TTACTGGGTT TAAAAAAGAA TTTCCCAATC TCAATGGAAA CCAGAATTAA AATGTTATCT
GAAATTGAGG GTTCTAATAT TCATTGTTAT GGCAAATCTC GCGAAATTCT GGGAAGAAAA
ATGGCTCACA TCACATTTTT ATTAAATGGT AAAACGCATT TAGAAAGATA TGATGAAGCT
CAAATTTTAT TAACTATGGT AAGAGACATT TGGCCATCTC CAAATGCATA A
 
Protein sequence
MSLKKNINDF KKNYSLGIIG GGQLALMLTE AAKKRDLEVC VQTKSCDDPA GLKADHVIEA 
DPLKIRGNKS LINECEKIIF ENEWIKIDKL NLIDNKDIFV PSLNAIKPLV DRFSQKKLIE
RMNIPCPKWI SIEDFKNLSD QEINNWTFPL MAKSNKGGYD GKGNRKIKTK EDLDSFLKEN
NSNEWLIEEW IEYEKELALV GSRDRTGKIR FFPIVETFQS NHVCDWVLAP GTNEYDLNLF
AINIFSSIVN ELNYVGVLAI EFFYGDNGLL INEIAPRTHN SAHFSIEACT SSQFDQYVCI
SSGIMPPEIK MNCEGAIMIN LLGLKKNFPI SMETRIKMLS EIEGSNIHCY GKSREILGRK
MAHITFLLNG KTHLERYDEA QILLTMVRDI WPSPNA