Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_02931 |
Symbol | purH |
ID | 5731796 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 277278 |
End bp | 278834 |
Gene Length | 1557 bp |
Protein Length | 518 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 641284639 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_001550178 |
Protein GI | 159902834 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.848111 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCGTA TAGCATTAAT AAGTGTTTCC AATAAAGATG GGCTAATTCC TTTTGCGAAA ACATTAACAA CCCTTCATGG TTTTGAGATT ATTTCCAGTG GAGGTACTGC TAGAGCGCTG AAAGAAGCAA ATATCCCTGT CAAAACAGTT TCTGATTACA CTGGAGCTCC AGAAATTCTT GGTGGCCGAG TAAAAACTCT TCATCCTCGA ATACATGGCG GAATACTTGC CAAGCAAGGA AATTCATCTC ATCAATTTGA TCTCGAAAAA GAAAACATCA AAAATATTGA TCTTGTAGTC GTAAACCTTT ATCCATTTCA AGAAACAATC TCTGATCCAG ATGTTACATG GGATAACGCA ATAGAAAATA TTGATATTGG CGGGCCTGCA ATGATTCGCG CAGCAGCCAA AAACCATGAA TCAGTAAGTA TTCTGACTAA TCCAAATCAA TATGACGCTT TTCTTGAAAA ATTAGAAGCT GGGGAGATTT CAACAACTAT CAAAGCAAAA CTCGCTCTTG AAGCTTTCGA GCACACCGCA AGTTATGACA TAGCAATTAG CCAATGGTTA AGCAAGCAAA TTGAATCTAA ATATTCTCCA TATTTAACTT CTCAACCAAT TAAACAAACT CTGAGGTATG GAGAGAATCC TCATCAAAAT GCAAATTGGT ATAGCGCAGT TAATCAAGGG TGGGGGCAAG CTGAACAATT ACAAGGTAAA GAGCTCAGCA CAAATAATCT TCTAGATCTA GAAGCTGCTG TTGCAACAAT AAGAGAATTT GGATATGACT TAGGCAATAA AGGCAATTCG TGCGAGAAAG CTGCAGTCAT TATTAAGCAC ACTAATCCTT GTGGAGTAGC TGTAAGCAAT AATCTGAGCA ATGCATTCAA CCTGGCCCTT GAGTGCGACT CAATTAGTGC ATTTGGAGGA ATTGTTGCTC TTAACTGCAA TTTAGATGCT GCTACAGCAA AAGAACTAAG CAGTCTATTT TTAGAATGTG TAGTAGCTCC AGACTATGAC GCTAACGCTT TAGAGATCCT TTCAACGAAA AAAAATTTAA GGATAATTAA ACTTAGTCAC AGCTCTATAA AGTCGTCCGA ACGTAAGTAT ATAAGAAGCA TTTTAGGAGG AATATTGGTT CAGGAAGTTG ATGACAAATT AATTGAACCT AATGAATGGA AAGTTCCTAC AAAATTACAA ATGTCTATTG AAGACAAAGC TGATCTAGCT TTCGCCTGGC GAGTAGTAAG ACATGTTAGA TCAAATGCAA TAGTAGTTGC ATCTGCTGGT CAAACTTTAG GAATAGGTGC AGGGCAAATG AATAGAATAG GGGCAGCAAA AATAGCTCTG GAAGCTGCAG GAGAAAAAGC TCAAGGTGCT GTATTAGCTA GTGATGGCTT CTTTCCCTTT GATGACACAG TACATTTGGC ATCAAGATAT GGAATCAAAT CAATAATTCA ACCAGGAGGA AGTATTCGAG ACCAATCATC TATAGATGCA TGCAATCAAT TAGGTCTCTC TATGATATTT ACTGGTAAAA GACATTTCCT TCATTAA
|
Protein sequence | MARIALISVS NKDGLIPFAK TLTTLHGFEI ISSGGTARAL KEANIPVKTV SDYTGAPEIL GGRVKTLHPR IHGGILAKQG NSSHQFDLEK ENIKNIDLVV VNLYPFQETI SDPDVTWDNA IENIDIGGPA MIRAAAKNHE SVSILTNPNQ YDAFLEKLEA GEISTTIKAK LALEAFEHTA SYDIAISQWL SKQIESKYSP YLTSQPIKQT LRYGENPHQN ANWYSAVNQG WGQAEQLQGK ELSTNNLLDL EAAVATIREF GYDLGNKGNS CEKAAVIIKH TNPCGVAVSN NLSNAFNLAL ECDSISAFGG IVALNCNLDA ATAKELSSLF LECVVAPDYD ANALEILSTK KNLRIIKLSH SSIKSSERKY IRSILGGILV QEVDDKLIEP NEWKVPTKLQ MSIEDKADLA FAWRVVRHVR SNAIVVASAG QTLGIGAGQM NRIGAAKIAL EAAGEKAQGA VLASDGFFPF DDTVHLASRY GIKSIIQPGG SIRDQSSIDA CNQLGLSMIF TGKRHFLH
|
| |