Gene P9303_15551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_15551 
SymbolpurK 
ID4776013 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1354626 
End bp1355837 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content52% 
IMG OID640087064 
Productphosphoribosylaminoimidazole carboxylase ATPase subunit 
Protein accessionYP_001017564 
Protein GI124023257 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.358737 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCAGGATT GCGTGATTGA TTCTCTCTGC ATGAGCAGCG TTACAAGCCC AATGATTGGG 
GTCGTCGGTG GTGGTCAGTT GGCTCAGATG TTGGCGCAAG CAGCAAAAAG ACGCGCTGTG
GATGTTGTCG TGCAGTCGGG ATCGGCAATC GATCCCGCTG CTGTTGAAGC AACTCGACTT
GTTTTGGCTG ACCCAGTCGA TGTAGAAGCT ACTAGCAAGC TCGTGCAGGG CTGTTGTGGC
GTCACGTTTG AGAACGAATG GGTCGATATT GAAGCTCTGA TTCCCCTTGA ACAACAAGGG
GTGTGTTTTT CTCCGTCTCT TACTGCGCTT GCTCCATTGG TCGACAAAAT CTCGCAGCGT
CAGTTGCTTC GTGAGCTTGA TCTTCCTAGC CCTGATTGGA CTTTGCTGAG TTCGATTTCT
TTTGATCAGC CCGAGCTTCC TACGGAGTGG AACTTTCCGG TGATGGCCAA GTCAAGCCGG
TGGGGATATG ACGGCAAAGG AACCAAGGTT CTCAAGAGTG TCGAGGATTT GTCGCAACTT
CAGCGCTCAG TGGATCCAAC TCAATGGCTG CTTGAGAGCT GGGTGCCGTT CGAAAAGGAA
TTAGCCATTG TTGTTAGTCG AGATGCTCAG GGCCGTGTTC GTAGCCTGCC ACTTGCTGAG
ACTCATCAGT TCCAACAGGT GTGTGATTGG GTGATAGCAC CTGCGAGTGT TGATCATGCT
GTGGAAATGA TGGCCTACAA CATGGCAGCG TCTCTTCTAA CAGAGCTCAA TTACGTGGGC
GTGCTTGCTG TTGAATTTTT CTACGGACCA GAGGGACTGC AGGTCAATGA AGTTGCACCT
CGCACTCACA ATTCCGCACA TTTTTCGATC GAAGCCTGCA GCAGCAGCCA GTTTGATCAA
CAACTTTGTA TCGCGGCGGG CTTGCCAGTG CCTGCAACCG ATCTCCATGC ACCTGGCGCC
TTAATGGTGA ACCTTCTAGG TTTGCAAAAA GGGGTTGAGC CCTCTCTAGA TGAGCGCCTA
GCGAAGCTGC GTAGTTGTGA TCGCTTCCAT TTGCACTGGT ATGGAAAAGA TTGTGAAACT
CCAGGACGCA AGCTCGGTCA TGTGACCGTG CTGCTTCATG GTGTTGATGC GCCCAGCCGT
CAGCTCGAGG CGGAAACTGC CTTAAAGCAT ATTCGCTCAA TCTGGCCGAC GCAGGACACC
GTTTGCGCTT AA
 
Protein sequence
MQDCVIDSLC MSSVTSPMIG VVGGGQLAQM LAQAAKRRAV DVVVQSGSAI DPAAVEATRL 
VLADPVDVEA TSKLVQGCCG VTFENEWVDI EALIPLEQQG VCFSPSLTAL APLVDKISQR
QLLRELDLPS PDWTLLSSIS FDQPELPTEW NFPVMAKSSR WGYDGKGTKV LKSVEDLSQL
QRSVDPTQWL LESWVPFEKE LAIVVSRDAQ GRVRSLPLAE THQFQQVCDW VIAPASVDHA
VEMMAYNMAA SLLTELNYVG VLAVEFFYGP EGLQVNEVAP RTHNSAHFSI EACSSSQFDQ
QLCIAAGLPV PATDLHAPGA LMVNLLGLQK GVEPSLDERL AKLRSCDRFH LHWYGKDCET
PGRKLGHVTV LLHGVDAPSR QLEAETALKH IRSIWPTQDT VCA