Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_07431 |
Symbol | purK |
ID | 4781291 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 684650 |
End bp | 685795 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640084018 |
Product | phosphoribosylaminoimidazole carboxylase ATPase subunit |
Protein accession | YP_001014566 |
Protein GI | 124025450 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0031088 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGATTGGGG TCGTGGGTGG AGGACAGCTT GCAATGCTTT TGATTGAGGC TGGAAAGAAA AGAAATGTTG ATGTCGTTGT TCAGACGGCT GCTAAAACTG ATCCTGCTGC TAAAAAGACA AATCAACTCG TTTTGCATGA CCCTACGAAT CCTGTGGGTA CAAAACTTCT TGCAGAAAAG ACCCGCTTGA TTACTTTTGA AAATGAATGG GTTGATATCT CAAGTTTACT TTCTCTTGAA AATAATGGAG TTTCTTTTGT CCCTAGACTT CAATCAATAA GACCTTTAAT TAATAAAATA ACTCAAAGGG AGCTATTAAA CAGTCTTGAT ATTCCCTGCC CTGATTGGTT GTCTATACCA TTAAAAAAAT CAACAGAAAT TGATCTTCCT GCAGATTGGG GATTTCCTTT GATGGCAAAA GCTGCCAAAG GTGGATATGA CGGGAAAGGA ACTAAAATTA TTAAAAATCT AAAGCAACTT CAAGAATTTC TATCAGTTGA AAGAGAAGGG CAATGGATGT TAGAGAAATG GATCTCTTTT GATAAGGAAT TATCCATTGT TTCTAGTAGG GATTCAAAAG GAATTGTACG TAGTCTGCCA ATCGTAGAGA CATATCAATC TAAACAAGTA TGTGACTGGG TCCTTGCTCC AGCTGATATC AATCATGACG TTGATCTTAT GGTTAGAAAT ATCGCAGCTT CGTTGCTTGC TGAGTTGCAA TATGTTGGAG TTATTGCTAT TGAATTTTTC TATGGATCTG AAGGATTACT TGTAAATGAA ATAGCTCCAA GAACTCATAA CTCAGGTCAT TTTTCTATTG ATGCTTGTAG CAGCAGTCAG TTTGATCAAC AAATATGTAT CACCTCTGGT ATTGATGTAC CCATGCCTGA AATGCTTGTT AATGGTGCTT TAATGGCAAA CTTGCTTGGT TTGCAAAGTA ACTATCCAAC ATCACTTACC CAAAGATTGA ATGATTTGAG GGGTATTCCT GGCTTGAATG TTCATTGGTA TGAAAAAGAG GAAGAAAAAA AGGGCAGGAA GCTTGGTCAC GTTACATATC TCTTGAATAA TAAGGACGCT TTGTCTAGAA AAAAAGAAGC ATTAGATGTT TTACAAACCA TACGGTCAAT TTGGCCGACC TCTTGA
|
Protein sequence | MIGVVGGGQL AMLLIEAGKK RNVDVVVQTA AKTDPAAKKT NQLVLHDPTN PVGTKLLAEK TRLITFENEW VDISSLLSLE NNGVSFVPRL QSIRPLINKI TQRELLNSLD IPCPDWLSIP LKKSTEIDLP ADWGFPLMAK AAKGGYDGKG TKIIKNLKQL QEFLSVEREG QWMLEKWISF DKELSIVSSR DSKGIVRSLP IVETYQSKQV CDWVLAPADI NHDVDLMVRN IAASLLAELQ YVGVIAIEFF YGSEGLLVNE IAPRTHNSGH FSIDACSSSQ FDQQICITSG IDVPMPEMLV NGALMANLLG LQSNYPTSLT QRLNDLRGIP GLNVHWYEKE EEKKGRKLGH VTYLLNNKDA LSRKKEALDV LQTIRSIWPT S
|
| |