Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_07981 |
Symbol | purK |
ID | 5730075 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 703189 |
End bp | 704388 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 641285162 |
Product | phosphoribosylaminoimidazole carboxylase ATPase subunit |
Protein accession | YP_001550683 |
Protein GI | 159903339 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00471712 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGAAATG AATCATTAAC AAGAAATAAC AAAGTAGGCC CAACAGTTGG TGTCGTTGGT GGAGGACAAC TTGCGCAGAT GTTGGCTAAA GCTGCCAGAG AGAGAGGGAT TGATTTAATT GTTCAGACTG GATTGAGGAG TGACCCTGCC GTTCAATATG CTAAGGGATT AGTACTTTCG GATACAAGCG ATATTAATGG AACAAAAGAA CTTGCAAGCA AATGTAGCTG CTTGACCTTT GAGAATGAAT GGATTGATGT AACGTCTCTC TCTTCGCTGG CTAACGATAG CTCCTTATTT CAGCCTAGCT TAAATTCAAT TAAGCCATTA GTAGACAAAC TTTCTCAAAG AAAGCTTTTA AACGATTTGA ATATCCCTGG TCCAGAATGG CTGCCCCTAG CATGTATTAA AAAAAAGGAT TTAGAGCTTC CAGACGGTTG GGCATATCCA GTAATGGCAA AGGCAGGTAA AGGAGGATAT GACGGTAAAG GAACAAGAGT TATAAATGAT GCCAATGAAC TTAAGGAGCT GTTTTTCTCT GTTGATGTTT CTAATTGGTT CTTAGAGAAA TGGATTAGCT ATCAAAAAGA GTTGGCAATT GTTGTCAGTC GAGATACCTG TGGACGAATA AACTCTTATC CCTTAACAGA GACTTTCCAA CATAAGCAAG TCTGTGATTG GGTTGTTGCA CCTGCGAATG TAAGTCATTC AGTCCTTGTC ACGGCTTATA ACGTAGGTGC TTCGTTATTG AGAGAGCTAA ATTATGTAGG TGTACTTGCG ATTGAATTCT TTTATGGGGA TGAGGGATTG CTCGTCAATG AAATAGCCCC ACGTACTCAC AACTCTGCTC ATTTTACAAT TGACGCATGC AGTAGTAGTC AATTTGACCA ACAAATATGC ATAGCTGCCG GTCTACCAGC CCCTCCGGTT AAATTAGTCG TACCAGGCGC AATCATGATC AATTTGTTAG GCTTGCGAGG CAAAGCAAAT TCCTTAGATG AACGATTGGA GAAGTTAAAG CAAATAAATG GCGCAAAACT TCATTGGTAT TCTAAAGATA AAGTCTTGCC TGGAAGAAAA TTAGGTCATT TAACAGTTCC CTTACTCGAT TTAGACCCCA CTTCAAGAAT TAATAAGGCG ACAAGCATTT TAAAAAAAGT AAGAGCAATC TGGCCCTTTT TTGTTCCCGA TATCAATTAG
|
Protein sequence | MRNESLTRNN KVGPTVGVVG GGQLAQMLAK AARERGIDLI VQTGLRSDPA VQYAKGLVLS DTSDINGTKE LASKCSCLTF ENEWIDVTSL SSLANDSSLF QPSLNSIKPL VDKLSQRKLL NDLNIPGPEW LPLACIKKKD LELPDGWAYP VMAKAGKGGY DGKGTRVIND ANELKELFFS VDVSNWFLEK WISYQKELAI VVSRDTCGRI NSYPLTETFQ HKQVCDWVVA PANVSHSVLV TAYNVGASLL RELNYVGVLA IEFFYGDEGL LVNEIAPRTH NSAHFTIDAC SSSQFDQQIC IAAGLPAPPV KLVVPGAIMI NLLGLRGKAN SLDERLEKLK QINGAKLHWY SKDKVLPGRK LGHLTVPLLD LDPTSRINKA TSILKKVRAI WPFFVPDIN
|
| |