Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_15551 |
Symbol | purK |
ID | 4776013 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1354626 |
End bp | 1355837 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640087064 |
Product | phosphoribosylaminoimidazole carboxylase ATPase subunit |
Protein accession | YP_001017564 |
Protein GI | 124023257 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.358737 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCAGGATT GCGTGATTGA TTCTCTCTGC ATGAGCAGCG TTACAAGCCC AATGATTGGG GTCGTCGGTG GTGGTCAGTT GGCTCAGATG TTGGCGCAAG CAGCAAAAAG ACGCGCTGTG GATGTTGTCG TGCAGTCGGG ATCGGCAATC GATCCCGCTG CTGTTGAAGC AACTCGACTT GTTTTGGCTG ACCCAGTCGA TGTAGAAGCT ACTAGCAAGC TCGTGCAGGG CTGTTGTGGC GTCACGTTTG AGAACGAATG GGTCGATATT GAAGCTCTGA TTCCCCTTGA ACAACAAGGG GTGTGTTTTT CTCCGTCTCT TACTGCGCTT GCTCCATTGG TCGACAAAAT CTCGCAGCGT CAGTTGCTTC GTGAGCTTGA TCTTCCTAGC CCTGATTGGA CTTTGCTGAG TTCGATTTCT TTTGATCAGC CCGAGCTTCC TACGGAGTGG AACTTTCCGG TGATGGCCAA GTCAAGCCGG TGGGGATATG ACGGCAAAGG AACCAAGGTT CTCAAGAGTG TCGAGGATTT GTCGCAACTT CAGCGCTCAG TGGATCCAAC TCAATGGCTG CTTGAGAGCT GGGTGCCGTT CGAAAAGGAA TTAGCCATTG TTGTTAGTCG AGATGCTCAG GGCCGTGTTC GTAGCCTGCC ACTTGCTGAG ACTCATCAGT TCCAACAGGT GTGTGATTGG GTGATAGCAC CTGCGAGTGT TGATCATGCT GTGGAAATGA TGGCCTACAA CATGGCAGCG TCTCTTCTAA CAGAGCTCAA TTACGTGGGC GTGCTTGCTG TTGAATTTTT CTACGGACCA GAGGGACTGC AGGTCAATGA AGTTGCACCT CGCACTCACA ATTCCGCACA TTTTTCGATC GAAGCCTGCA GCAGCAGCCA GTTTGATCAA CAACTTTGTA TCGCGGCGGG CTTGCCAGTG CCTGCAACCG ATCTCCATGC ACCTGGCGCC TTAATGGTGA ACCTTCTAGG TTTGCAAAAA GGGGTTGAGC CCTCTCTAGA TGAGCGCCTA GCGAAGCTGC GTAGTTGTGA TCGCTTCCAT TTGCACTGGT ATGGAAAAGA TTGTGAAACT CCAGGACGCA AGCTCGGTCA TGTGACCGTG CTGCTTCATG GTGTTGATGC GCCCAGCCGT CAGCTCGAGG CGGAAACTGC CTTAAAGCAT ATTCGCTCAA TCTGGCCGAC GCAGGACACC GTTTGCGCTT AA
|
Protein sequence | MQDCVIDSLC MSSVTSPMIG VVGGGQLAQM LAQAAKRRAV DVVVQSGSAI DPAAVEATRL VLADPVDVEA TSKLVQGCCG VTFENEWVDI EALIPLEQQG VCFSPSLTAL APLVDKISQR QLLRELDLPS PDWTLLSSIS FDQPELPTEW NFPVMAKSSR WGYDGKGTKV LKSVEDLSQL QRSVDPTQWL LESWVPFEKE LAIVVSRDAQ GRVRSLPLAE THQFQQVCDW VIAPASVDHA VEMMAYNMAA SLLTELNYVG VLAVEFFYGP EGLQVNEVAP RTHNSAHFSI EACSSSQFDQ QLCIAAGLPV PATDLHAPGA LMVNLLGLQK GVEPSLDERL AKLRSCDRFH LHWYGKDCET PGRKLGHVTV LLHGVDAPSR QLEAETALKH IRSIWPTQDT VCA
|
| |