Gene A9601_07381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_07381 
SymbolpurK 
ID4717443 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp657471 
End bp658661 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content31% 
IMG OID640078452 
Productphosphoribosylaminoimidazole carboxylase ATPase subunit 
Protein accessionYP_001009131 
Protein GI123968273 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.210398 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTTTA AAAAAAATAT AAACGATATC AAGAAAAATT ATTCCCTAGG AATAATTGGA 
GGTGGTCAAC TGGCACTGAT GTTAACCGAG GCAGCAAAAA AAAGAGATTT AGAAGTATGT
GTGCAAACAA AATCTTGTGA TGATCCTGCT GGCTCAAAAG CAGATCATGT CATAGAAGCT
GATCCTTTAA AGATAAGAGG AAATAAATCA TTAATTAATG AGTGTGAAAA AATAATTTTT
GAAAATGAAT GGATAAAAAT TGATAAATTA AATTTAATTG GCAATGAAGA TATTTTTGTT
CCAAGCCTTA ATGCAATTAA GCCATTAGTA GATAGGTTTT CTCAAAAAAA ATTAATAGAT
AGAATGAATA TTCCCTGTCC AAAATGGATA AGTATTGAAG ATTTTAAAAA TCTCTCGGAT
GAGGAAATCA ATAATTGGAC TTTTCCTCTA ATGGCAAAAT CAAATAAAGG TGGATACGAC
GGCAAAGGGA ACAGAAAAAT AAAGACAAAA GAAGATTTAG ATTCTTTTTT AACAGAGAAT
AATTCTGATG AATGGTTAAT AGAAGAATGG ATAGAGTATG AAAAAGAACT GGCTCTTGTT
GGTTCGAGAG ATAGGACCGG TAAGATAAGA TTCTTTCCAA TAGTTGAGAC GTTCCAATCA
AACCATGTTT GTGATTGGGT TCTTGCACCT GGATCAAATG AATATGATTT GAACTTATTT
GCAATAAATA TTTTCTCTTC AATAGTCAAT GAACTTAATT ACGTTGGAGT TTTAGCTATT
GAATTCTTCT ATGGAGATAA TGGTCTTTTA ATTAATGAAA TAGCTCCTAG AACACATAAC
TCAGCTCATT TCTCTATTGA AGCTTGCACA TCAAGTCAGT TTGATCAATA TGTTTGCATT
TCTTCTGGGA TAATACCACC TGAAATTAAA ATGAACTGTG AAGGTGCAAT TATGATAAAT
CTACTGGGGT TAAAAAAGAA TTTCCCAATC TCAATGGAAA CCAGAATTAA AATGTTATCT
GAAATTGAGG GTTCTAATAT TCATTGTTAT GGCAAATCTC GCGAAATTCT TGGACGAAAA
ATGGCTCACA TCACATTTTT ATTAAATGGT AAAACGCATT CAGAAAGATA TGATGAAGCT
CAAATTTTAT TAACTATGGT AAGAGACATT TGGCCATCTC CAAATGCATA A
 
Protein sequence
MSFKKNINDI KKNYSLGIIG GGQLALMLTE AAKKRDLEVC VQTKSCDDPA GSKADHVIEA 
DPLKIRGNKS LINECEKIIF ENEWIKIDKL NLIGNEDIFV PSLNAIKPLV DRFSQKKLID
RMNIPCPKWI SIEDFKNLSD EEINNWTFPL MAKSNKGGYD GKGNRKIKTK EDLDSFLTEN
NSDEWLIEEW IEYEKELALV GSRDRTGKIR FFPIVETFQS NHVCDWVLAP GSNEYDLNLF
AINIFSSIVN ELNYVGVLAI EFFYGDNGLL INEIAPRTHN SAHFSIEACT SSQFDQYVCI
SSGIIPPEIK MNCEGAIMIN LLGLKKNFPI SMETRIKMLS EIEGSNIHCY GKSREILGRK
MAHITFLLNG KTHSERYDEA QILLTMVRDI WPSPNA