Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_02891 |
Symbol | purH |
ID | 4911155 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | - |
Start bp | 265622 |
End bp | 267175 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 640159857 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_001090513 |
Protein GI | 126695627 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0953231 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTCCAT TAGCTTTAGT AAGTGTCTCT GATAAAAAAA ATATAATCCC ATTTTGCAAG GAATTGATAG AGCAATTTAA TTATAAAATT CTATCAAGTG GAGGAACTGC CAAACATCTT ATAGATGCTA AGATTCCAGT TATTAAAGTT GCTGATTTTA CAAATTCTCC AGAAATTCTT GGAGGAAGAG TTAAAACTTT ACATCCAAAA ATACACGGGG GAATATTAGC TAAAAGAACT GATGAGGAAC ACAAAAAAGA TGTAGAAACT AACAACCTTG AGTTAATTGA CTTAGTAGTT GTCAATTTAT ATCCTTTTAA AAAAACCGTA GATCAAGGAG CACAATGGGA AGATGCTATT GAAAATATCG ATATCGGAGG GCCATCTATG ATTCGTTCTG CAGCTAAAAA TCATAAAGAT GTTTCTGTTT TAGTAGATCC TAGTCAGTAT CAAAATTTTC TTGAAGAAAG TAAAAAAGGT GAATTGAAAG ACGCATATAA AGCAAAATTA GCCCTTGAAG CTTTTCAACA TACAGCAGAC TATGACACTG CAATATCTAA TTGGATAAGA AAAGAAAGAG ATTTACAATC TTCCAAATAT ATTGAATCTT ATCCACTAAT CAAAACCTTG AGATATGGGG AGAATCCACA TCAAAAAGCT TTTTGGTACG GTTTAAGTAA CATTGGATGG AACTCAGCAG AACAATTACA AGGTAAAGAC TTAAGTTATA ACAATCTATT GGATCTAGAG TCGGCACTTT CAACAGTTTT AGAATTTGGC TACACAGAAA AAGATGAACT TAAAACGGAC ATGTTTGCCT CCGTTATTTT AAAACACAAT AATCCTTGTG GTGCCTCTAT AAGTAATTCA GCTTCTAAAG CATTTTTGAA TGCCTTGGAA TGTGACTCTG TTAGTGCATT CGGAGGAATA GTTGCTTTTA ATTCAAATGT TGATAGTGAC ACCGCTGTTC ACCTCAAAGA TATTTTCTTA GAGTGTGTCG TCGCTCCATC TTTTGATGAA GAAGCCTTAG AAATTTTAAA AGTTAAAAAG AATTTAAGAA TTTTAAAGTT TTCAAAAGAT CAACTTCCAA AAAAGAATCA AAATTCTACT AAATCAATAA TGGGAGGATT ACTAGTTCAA GATACTGACG ATAGTCAAGA AAAAACTGAG GATTGGATTT CAGTAACTAA TAAAAATGCG AATAATCAAG CTAACTTAGA TCTAAATTTT GCATGGAAAA TTTGTAAACA CGTGAAATCT AATGCCATTG TTATTGCAAA AGACCAAAAA ACTATTGGTA TTGGAGCTGG ACAAATGAAT AGAGTTGGAG CAGCAAAAAT TGCATTAAAA GCAGCTGGAA GTTTATGTTC TGATGCTGTC TTGGCTAGCG ATGGGTTTTT CCCATTTGCA GATACTGTAG AACTAGCACA CGAATATGGA ATAAAAGCTA TTATTCAACC TGGAGGAAGT CTAAGAGACC AAGAAAGTAT TGATATGTGT AATTTGAAAG GAATATCAAT GATATTTACC CAAAAAAGGC ATTTTTTACA TTAA
|
Protein sequence | MSPLALVSVS DKKNIIPFCK ELIEQFNYKI LSSGGTAKHL IDAKIPVIKV ADFTNSPEIL GGRVKTLHPK IHGGILAKRT DEEHKKDVET NNLELIDLVV VNLYPFKKTV DQGAQWEDAI ENIDIGGPSM IRSAAKNHKD VSVLVDPSQY QNFLEESKKG ELKDAYKAKL ALEAFQHTAD YDTAISNWIR KERDLQSSKY IESYPLIKTL RYGENPHQKA FWYGLSNIGW NSAEQLQGKD LSYNNLLDLE SALSTVLEFG YTEKDELKTD MFASVILKHN NPCGASISNS ASKAFLNALE CDSVSAFGGI VAFNSNVDSD TAVHLKDIFL ECVVAPSFDE EALEILKVKK NLRILKFSKD QLPKKNQNST KSIMGGLLVQ DTDDSQEKTE DWISVTNKNA NNQANLDLNF AWKICKHVKS NAIVIAKDQK TIGIGAGQMN RVGAAKIALK AAGSLCSDAV LASDGFFPFA DTVELAHEYG IKAIIQPGGS LRDQESIDMC NLKGISMIFT QKRHFLH
|
| |