Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PMN2A_1632 |
Symbol | purH |
ID | 3607032 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL2A |
Kingdom | Bacteria |
Replicon accession | NC_007335 |
Strand | - |
Start bp | 301066 |
End bp | 302622 |
Gene Length | 1557 bp |
Protein Length | 518 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 637688512 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_292823 |
Protein GI | 72383468 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCACCGA TAGCTCTGCT AAGTGTCTCA GACAAAACTG GCTTAATTCC ACTTGCGAAA GCATTAGTTA ATGATCTGGG CTTCAAAATC ATTTCAAGTG GCGGGACTGC AAAGTTAATT GAGAGTGAAA ATCTTCCTGT TACAAGAGTC GCAGATTACA CAGGATTCCC AGAGATTCTT GGAGGAAGAG TAAAAACTCT AAACCCAAAA ATTCATGGAG GGATATTAGC CAGACGAGAT AAACAATCTC ATTTAGATGA TTTAGATAAA CAAAATATCA ATCCAATAGA CTTGGTGGTT GTTAACTTAT ATCCATTTGT AAAAACAATT TCCAAAGAGA ATGTTTCATG GGAGGAAGCT ATCGAAAATA TTGATATTGG TGGTCCAACA ATGATCCGAG CAGCAGCAAA AAACCATCAA GATGTTCTTG TAGTTACTGA TCCAAGTCAA TACTCAAACT TAATTGATGC CTATAAATCA AAAAAGATCA CTACTGAATT ACGAAAAAAA TATTCGCAAC AAGCTTTTGA GCATACCGCG ACGTATGACC TAACAATAAG TAATTGGATT GCCAACCAAA GCTCCTCAAA AAAGGTTTCT TGGTTGCAAA GCTTGCCATT AAAGCAAGAA CTTAGGTATG GAGAAAATCC TCATCAAAAA GCTTCATGGT ATGGAGAGCC TGAAAAAGGA TGGAGTGGAG CTAATCAATT ACAAGGCAAA GAATTAAGTA CAAATAATCT TCTAGATCTG GAGGCTGCTT TATCTACTCT TCGTGAATTT GGGTATAAAA ATAATATTAG TAACCCTTCA TATCAAAAAG CAGCGGTAGT AATTAAGCAT ACAAATCCTT GTGGAGTAGC TATTGGAGAT TCTCCATCTT CAGCTCTTAA AAGAGCATTA GATGGCGATA GAGTAAGTGC TTTTGGGGGT ATTATTGCTA TCAATTGCCC CGTTGATGAA GCTGCAGCAA AAGAAATTGA AAATATATTT ATTGAATGTG TTGTAGCTCC ATATTTTGAT GAAACTGCAA AAGAAATACT TTCAAAAAAG AAAAATCTTA GGCTCTTAGA ATTAAAAGCT GAGTCTGTCC AAAAAGCAGA TAAAAATCAC ATAAGAAGCA TACTTGGTGG TTTATTAATT CAAGATTTAG ACGAACCAAG TATTGATCAA AAAAAATGGA AAAGTGTTAC TGAACTAATC CCAACAGATG AAGAAATGAA TGACTTATCT TTTGCTTGGA AAATTGTAAA ACATATACGA TCAAACGCAA TAGCTGTTGC ATCCAATCAG CAGAGTCTAG GGATTGGAGC TGGCCAAATG AATAGGGTAG GTTCAGCAAA ACTTGCATTA GAAGCTGCTG GTACAAAATC AAAAGGTGCT GTTTTGGCTA GTGATGGTTT TTTCCCATTC GACGATACTG TAAAGATGGC TTCTGATTAT GGTATTAGTT CAATTATTCA GCCTGGTGGA AGCATTAGAG ACGAAGATTC TATTAAAGCC TGCAATGAAT TAGGAATAAA AATGATTCTT ACTGGTAAAA GGCACTTTTT ACATTGA
|
Protein sequence | MSPIALLSVS DKTGLIPLAK ALVNDLGFKI ISSGGTAKLI ESENLPVTRV ADYTGFPEIL GGRVKTLNPK IHGGILARRD KQSHLDDLDK QNINPIDLVV VNLYPFVKTI SKENVSWEEA IENIDIGGPT MIRAAAKNHQ DVLVVTDPSQ YSNLIDAYKS KKITTELRKK YSQQAFEHTA TYDLTISNWI ANQSSSKKVS WLQSLPLKQE LRYGENPHQK ASWYGEPEKG WSGANQLQGK ELSTNNLLDL EAALSTLREF GYKNNISNPS YQKAAVVIKH TNPCGVAIGD SPSSALKRAL DGDRVSAFGG IIAINCPVDE AAAKEIENIF IECVVAPYFD ETAKEILSKK KNLRLLELKA ESVQKADKNH IRSILGGLLI QDLDEPSIDQ KKWKSVTELI PTDEEMNDLS FAWKIVKHIR SNAIAVASNQ QSLGIGAGQM NRVGSAKLAL EAAGTKSKGA VLASDGFFPF DDTVKMASDY GISSIIQPGG SIRDEDSIKA CNELGIKMIL TGKRHFLH
|
| |