Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_02881 |
Symbol | purH |
ID | 4716974 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 266042 |
End bp | 267595 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 640077989 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_001008683 |
Protein GI | 123967825 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTCCAT TAGCTTTAGT AAGTGTCTCT GATAAAAAAA ATATAATCCC ATTTTGCAAG GAATTGGTAG AGCATTTCAA TTATAAAATT CTATCAAGTG GAGGAACTGC CAAACATCTT ATAGAGGCAA AAATTCCAGT TATTAAAGTT GCTGATTTTA CTAATTCTCC GGAAATTCTT GGAGGAAGAG TTAAAACTTT ACATCCAAAA ATACACGGGG GAATATTAGC TATAAGAACT GATGAGGAAC ACAAAAAAGA TATAGAAGCT AACAATCTTG AGTTAATTGA TTTGGTAGTT GTCAATTTAT ATCCTTTTAA AAAAACTGTA GATGGGGGAG CTAAATGGGA AGATGCTATT GAAAATATCG ATATCGGAGG GCCATCTATG ATTCGTTCTG CAGCTAAAAA TCATAAAGAT GTTTCCGTTT TAGTAGATTC TAGTCAGTAT CAAAGTTTTC TTGAAGAAAG TAAAAAAGGT GAATTGAAAG ACTCATATAA AGCAAAATTA GCCCTTGAAG CTTTTCAACA TACAGCAGAT TATGACACTG CAATATCTAA TTGGATAAGA AAAGAAAGAG ATTTACAATC TTCTAAATAT ATTGAATCTT ATCCACTAAT CAAAACCTTA AGATATGGAG AGAATCCACA TCAAAAAGCT TTTTGGTATG GTTTAAGTAA CATTGGATGG AACTCAGCAG AGCAATTACA AGGAAAAGAC TTAAGTTATA ACAATCTATT AGATCTAGAG TCGGCACTTT CAACAGTTTT AGAATTTGGC TACACAGAAA AAGATGAACT TGCAACCGAT ATGGTTGCCT CTGTTATTTT AAAACACAAT AATCCTTGTG GTGCCTCTAT GAGTAATTCA GCTTCTAAAG CATTTTTGAA TGCTTTAGAA TGCGACTCTG TAAGTGCATT TGGAGGAATA GTTGCTTTTA ATTCAAATGT TGATAGTGAG ACAGCAATTC ACCTCAAAGA TATTTTCTTA GAGTGTGTCG TCGCTCCATC TTTTGATGAA GAAGCTTTAG AAATTTTAAA AGTTAAAAAG AATTTAAGAA TCTTAAAGAT TTCAAAAGAT CAACTTCCAC AAAAGAATCA AAATTCTACT AAATCAATAA TGGGAGGATT ACTAGTTCAA GATACTGACG ATAGTGAAGA AAAAACTGAA AATTGGATTT CAGTAACTAA TAAAAATCCA AGTAATCAAA TTAACTTAGA TCTAAATTTT GCATGGAAAA TTTGTAAACA TGTTAAATCT AATGCAATTG TTATTGCAAA AGACCAAAAA ACTATTGGTA TTGGAGCTGG GCAAATGAAC AGAGTTGGAG CAGCAAAAAT TGCATTAAAA GCAGCTGGAA GGTTATGTTC TGATGCTGTC TTGGCTAGCG ATGGGTTTTT CCCATTTGCA GATACTGTAG AAATAGCAAA TGAATATGGA ATAAAAGCTA TTATTCAACC TGGAGGAAGT CTAAGAGACC AAGAAAGTAT TGATATGTGT AATTCAAAAG GAATCTCAAT GGTATTTACG CAAAAAAGAC ATTTTTTACA TTAA
|
Protein sequence | MSPLALVSVS DKKNIIPFCK ELVEHFNYKI LSSGGTAKHL IEAKIPVIKV ADFTNSPEIL GGRVKTLHPK IHGGILAIRT DEEHKKDIEA NNLELIDLVV VNLYPFKKTV DGGAKWEDAI ENIDIGGPSM IRSAAKNHKD VSVLVDSSQY QSFLEESKKG ELKDSYKAKL ALEAFQHTAD YDTAISNWIR KERDLQSSKY IESYPLIKTL RYGENPHQKA FWYGLSNIGW NSAEQLQGKD LSYNNLLDLE SALSTVLEFG YTEKDELATD MVASVILKHN NPCGASMSNS ASKAFLNALE CDSVSAFGGI VAFNSNVDSE TAIHLKDIFL ECVVAPSFDE EALEILKVKK NLRILKISKD QLPQKNQNST KSIMGGLLVQ DTDDSEEKTE NWISVTNKNP SNQINLDLNF AWKICKHVKS NAIVIAKDQK TIGIGAGQMN RVGAAKIALK AAGRLCSDAV LASDGFFPFA DTVEIANEYG IKAIIQPGGS LRDQESIDMC NSKGISMVFT QKRHFLH
|
| |