Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PMT9312_0268 |
Symbol | purH |
ID | 3765055 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9312 |
Kingdom | Bacteria |
Replicon accession | NC_007577 |
Strand | - |
Start bp | 256469 |
End bp | 258022 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 637796776 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_396765 |
Protein GI | 78778653 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCACCAT TAGCTTTAGT AAGTGTCTCA GATAAAAAAC ATATAGTCCC ATTTTGTATG GAATTGGTAG AACAATTTAA TTATAGAATT CTATCAAGTG GAGGAACTGC CAAACATCTA ATAGAGGCAA ATATTCCAGT TATTAAAGTT GCAGATTTTA CTAATTCTCC AGAAATCCTT GGAGGAAGAG TGAAAACTTT ACATCCAAAA ATACACGGAG GAATATTAGC TAAAAGAACT GATGAAGAAC ATAAAAGAGA TATAGAAGCT AACGATCTTG AGTTAATTGA CTTAGTAGTT GTCAATTTAT ATCCTTTTAA GAAAACTGTA GAACAGGGAT CTAAATGGGA AGATTCTATT GAAAATATCG ATATAGGAGG GCCATCAATG ATTCGTTCTG CAGCTAAAAA TCATAAAGAC GTTTCTGTTT TAGTCGATCC TAGTCAATAT CAGAATTTTC TTGAAGAAAG TAAAAAAGGT GAGTTAAAGG ATTCATATAA AGCACAATTA GCCCTTGAAG CTTTTCAACA TACAGCAGAC TATGACACTG CAATATCTAA TTGGATAAGT AAAGAAAGAG GTTTACAATC CTCTAAATAT ATTGAATCTT ATCCACTAAT CAACACCTTA AGATATGGGG AGAATCCTCA TCAAAAAGCT TTGTGGTATG GTTTGAGTAA TATTGGATGG AACTCAGCAG AACAATTACA AGGTAAAGAT TTAAGTTATA ACAATTTATT AGATCTTGAG TCAGCACTTT CAACAGTTTT AGAATTTGGC TATGCAGAAA AAGATGAACT TACTACAGAT ACGTTTGCAT CTGTTATTCT AAAACACAAT AATCCTTGTG GTGCCTCTAT AAGTAATTCA GCCTCTCAAG CATTTTTGAA TGCTTTGGAA TGCGACTCAG TTAGTGCATT TGGAGGAATA GTTGCTTTTA ATTCAAATGT TGATAGTGAA ACCGCAATTA ACCTCAAAGA TATTTTCCTA GAGTGTGTCG TAGCTCCATC TTTTGATGCG GAAGCTTTAG AAATTTTAAA AATCAAAAAG AATTTAAGAA TTTTAAAGTT ATCAAAAGAT CAACTCCCAA AAAAGAAGCA AACTTCTACT AAATCAATAA TGGGAGGATT ACTTGTTCAA GACACTAATG ATAGTGAAGA TAAAACTGAA AGTTGGATGT CAGTAACTAA GAATAATCCG AGTAATCAAA TGAACTTAGA TCTAAATTTT GCATGGAAAA TTTGTAAACA TGTGAAATCG AATGCGATTG TTATTGCAAA AGACCAAAAA ACTATTGGTA TAGGAGCTGG ACAAATGAAT AGAGTTGGAG CAGCAAAAAT CGCATTACAA GCAGCTGGAA AGTTATGTTC TGATGCTGTC TTGGCCAGTG ATGGGTTTTT TCCATTTGCA GATACTGTAG AACTTGCAAA TGAGTATGGT ATAAAAGCAA TTATCCAACC TGGAGGGAGT CTAAGAGACC AAGAAAGTAT TGATATGTGT AATTCTAAAG GAATCTCAAT GGTAATTACG CAAAAAAGGC ATTTTTTGCA TTAG
|
Protein sequence | MSPLALVSVS DKKHIVPFCM ELVEQFNYRI LSSGGTAKHL IEANIPVIKV ADFTNSPEIL GGRVKTLHPK IHGGILAKRT DEEHKRDIEA NDLELIDLVV VNLYPFKKTV EQGSKWEDSI ENIDIGGPSM IRSAAKNHKD VSVLVDPSQY QNFLEESKKG ELKDSYKAQL ALEAFQHTAD YDTAISNWIS KERGLQSSKY IESYPLINTL RYGENPHQKA LWYGLSNIGW NSAEQLQGKD LSYNNLLDLE SALSTVLEFG YAEKDELTTD TFASVILKHN NPCGASISNS ASQAFLNALE CDSVSAFGGI VAFNSNVDSE TAINLKDIFL ECVVAPSFDA EALEILKIKK NLRILKLSKD QLPKKKQTST KSIMGGLLVQ DTNDSEDKTE SWMSVTKNNP SNQMNLDLNF AWKICKHVKS NAIVIAKDQK TIGIGAGQMN RVGAAKIALQ AAGKLCSDAV LASDGFFPFA DTVELANEYG IKAIIQPGGS LRDQESIDMC NSKGISMVIT QKRHFLH
|
| |