Gene P9301_02891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_02891 
SymbolpurH 
ID4911155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp265622 
End bp267175 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content33% 
IMG OID640159857 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001090513 
Protein GI126695627 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0953231 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTCCAT TAGCTTTAGT AAGTGTCTCT GATAAAAAAA ATATAATCCC ATTTTGCAAG 
GAATTGATAG AGCAATTTAA TTATAAAATT CTATCAAGTG GAGGAACTGC CAAACATCTT
ATAGATGCTA AGATTCCAGT TATTAAAGTT GCTGATTTTA CAAATTCTCC AGAAATTCTT
GGAGGAAGAG TTAAAACTTT ACATCCAAAA ATACACGGGG GAATATTAGC TAAAAGAACT
GATGAGGAAC ACAAAAAAGA TGTAGAAACT AACAACCTTG AGTTAATTGA CTTAGTAGTT
GTCAATTTAT ATCCTTTTAA AAAAACCGTA GATCAAGGAG CACAATGGGA AGATGCTATT
GAAAATATCG ATATCGGAGG GCCATCTATG ATTCGTTCTG CAGCTAAAAA TCATAAAGAT
GTTTCTGTTT TAGTAGATCC TAGTCAGTAT CAAAATTTTC TTGAAGAAAG TAAAAAAGGT
GAATTGAAAG ACGCATATAA AGCAAAATTA GCCCTTGAAG CTTTTCAACA TACAGCAGAC
TATGACACTG CAATATCTAA TTGGATAAGA AAAGAAAGAG ATTTACAATC TTCCAAATAT
ATTGAATCTT ATCCACTAAT CAAAACCTTG AGATATGGGG AGAATCCACA TCAAAAAGCT
TTTTGGTACG GTTTAAGTAA CATTGGATGG AACTCAGCAG AACAATTACA AGGTAAAGAC
TTAAGTTATA ACAATCTATT GGATCTAGAG TCGGCACTTT CAACAGTTTT AGAATTTGGC
TACACAGAAA AAGATGAACT TAAAACGGAC ATGTTTGCCT CCGTTATTTT AAAACACAAT
AATCCTTGTG GTGCCTCTAT AAGTAATTCA GCTTCTAAAG CATTTTTGAA TGCCTTGGAA
TGTGACTCTG TTAGTGCATT CGGAGGAATA GTTGCTTTTA ATTCAAATGT TGATAGTGAC
ACCGCTGTTC ACCTCAAAGA TATTTTCTTA GAGTGTGTCG TCGCTCCATC TTTTGATGAA
GAAGCCTTAG AAATTTTAAA AGTTAAAAAG AATTTAAGAA TTTTAAAGTT TTCAAAAGAT
CAACTTCCAA AAAAGAATCA AAATTCTACT AAATCAATAA TGGGAGGATT ACTAGTTCAA
GATACTGACG ATAGTCAAGA AAAAACTGAG GATTGGATTT CAGTAACTAA TAAAAATGCG
AATAATCAAG CTAACTTAGA TCTAAATTTT GCATGGAAAA TTTGTAAACA CGTGAAATCT
AATGCCATTG TTATTGCAAA AGACCAAAAA ACTATTGGTA TTGGAGCTGG ACAAATGAAT
AGAGTTGGAG CAGCAAAAAT TGCATTAAAA GCAGCTGGAA GTTTATGTTC TGATGCTGTC
TTGGCTAGCG ATGGGTTTTT CCCATTTGCA GATACTGTAG AACTAGCACA CGAATATGGA
ATAAAAGCTA TTATTCAACC TGGAGGAAGT CTAAGAGACC AAGAAAGTAT TGATATGTGT
AATTTGAAAG GAATATCAAT GATATTTACC CAAAAAAGGC ATTTTTTACA TTAA
 
Protein sequence
MSPLALVSVS DKKNIIPFCK ELIEQFNYKI LSSGGTAKHL IDAKIPVIKV ADFTNSPEIL 
GGRVKTLHPK IHGGILAKRT DEEHKKDVET NNLELIDLVV VNLYPFKKTV DQGAQWEDAI
ENIDIGGPSM IRSAAKNHKD VSVLVDPSQY QNFLEESKKG ELKDAYKAKL ALEAFQHTAD
YDTAISNWIR KERDLQSSKY IESYPLIKTL RYGENPHQKA FWYGLSNIGW NSAEQLQGKD
LSYNNLLDLE SALSTVLEFG YTEKDELKTD MFASVILKHN NPCGASISNS ASKAFLNALE
CDSVSAFGGI VAFNSNVDSD TAVHLKDIFL ECVVAPSFDE EALEILKVKK NLRILKFSKD
QLPKKNQNST KSIMGGLLVQ DTDDSQEKTE DWISVTNKNA NNQANLDLNF AWKICKHVKS
NAIVIAKDQK TIGIGAGQMN RVGAAKIALK AAGSLCSDAV LASDGFFPFA DTVELAHEYG
IKAIIQPGGS LRDQESIDMC NLKGISMIFT QKRHFLH