Gene P9515_02991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9515_02991 
SymbolpurH 
ID4720172 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9515 
KingdomBacteria 
Replicon accessionNC_008817 
Strand
Start bp275364 
End bp276917 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content34% 
IMG OID640079964 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001010615 
Protein GI123965534 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCACCAT TAGCCCTAGT AAGTGTCTCT GATAAAACAA ATATCATTCC ATTTTGTAAG 
GATTTAGTTG AAAAATTTGG TTATAATATT TTATCCAGCG GAGGGACCGC TGAGTACTTG
ACAGAGGCAA AAGTTCCTGT TCTTAAAGTA GCAGATTTTA CTGAATCTCC AGAGATTCTT
GACGGGAGAG TTAAAACTTT ACATCCAAAG ATTCATGGAG GAATTCTTGC TAAAAGATCT
AATGAAGAGC ATCAAAGGGA AATATTAGAA AACAAACTAG AATTGATTGA TTTGGTAGTT
GTTAATTTGT ATCCCTTTAA GAAAAAAGTG GAAGAGCAAT GTCCTTGGGA AGAAGCAATT
GAGAATATTG ATATAGGAGG TCCATCTATG ATACGCTCTG CAGCTAAAAA TCATGCTGAT
GTTGCAGTTT TAGTAGATCC TAATCAATAT CAAAATTATA TTGAAGAGAT CAAAAAAGGA
CCACTTAGCA AAGACTTTAA AACGAAATTA GCATTTGAGG CGTTTCAACA TACCGCAAGT
TATGACTCTG CAATATCAAA TTGGATTAGT AAAGAAAAGG ATTTAAGACC TTCGAATTTT
ATAGAATCAT ACCCGCTTAT AAAGCAATTA AGGTATGGAG AAAATCCCCA TCAAAAAGCA
TTATGGTATG GATTAAATAA TATTGGATGG AATTCAGCAG AACAATTACA AGGCAAAGAG
TTAAGCTATA ACAACATACT CGATCTTGAA TCAGCTCTAT TAACTGTATT AGAATTTGGA
TATGAAACAA AGCCTAACAT TAAGACCGAA TCAATAGCTG CAGTTATTCT CAAACATAAT
AATCCTTGTG GGGCTTCGAT TAGCAACTCA GCATCTAGTT CTTTTAAGAA TGCGTTAAAG
TGCGATTCAG TTAGTGCCTT TGGGGGCATA GTAGCATTTA ATGCCAATGT TGATAAAGAA
ACTGCCCTTA TTCTGAAAGA CATTTTTTTA GAGTGCGTAG TAGCACCATC CTTTGATAAA
GAAGCTTTAG AAATATTTAA AACCAAAAAG AATTTGAGAG TTTTAAAGTT AACAAAAGAA
ATGCTGCCTA AAGAAAACCA AACTTGTTCC AAATCAATTA TGGGAGGAAT ACTCATACAA
GATTCTGATA ATCAGGAAAA TTCAGAAGAT TCTTGGATTT CAGTAACCAA AAAGAATCCA
ACTGAACAGG AATATTTAGA TTTGAAATTT GCTTGGAAAA TTTGTAAACA TGTTAAGTCG
AACGCTATTG TAGTTGCAAA AGATCAACAA ACTCTTGGCA TAGGAGCTGG GCAAATGAAT
AGAGTTGGGG CTTCAAAAAT AGCTTTAGAA GCAGCTAAAG AAATTGATTC TGGAGGGGTT
TTAGCAAGCG ATGGTTTTTT CCCGTTCGCA GATACAGTGC GACTTGCAGA TAAGTATGGA
ATAAGTTCTA TTATTCAGCC GGGAGGTAGT ATAAGAGATG AAGAAAGCAT AAAAATGTGT
GATTCAAGGG GTATTTCCAT GATATTTACC CACAAAAGAC ACTTTTTACA TTAA
 
Protein sequence
MSPLALVSVS DKTNIIPFCK DLVEKFGYNI LSSGGTAEYL TEAKVPVLKV ADFTESPEIL 
DGRVKTLHPK IHGGILAKRS NEEHQREILE NKLELIDLVV VNLYPFKKKV EEQCPWEEAI
ENIDIGGPSM IRSAAKNHAD VAVLVDPNQY QNYIEEIKKG PLSKDFKTKL AFEAFQHTAS
YDSAISNWIS KEKDLRPSNF IESYPLIKQL RYGENPHQKA LWYGLNNIGW NSAEQLQGKE
LSYNNILDLE SALLTVLEFG YETKPNIKTE SIAAVILKHN NPCGASISNS ASSSFKNALK
CDSVSAFGGI VAFNANVDKE TALILKDIFL ECVVAPSFDK EALEIFKTKK NLRVLKLTKE
MLPKENQTCS KSIMGGILIQ DSDNQENSED SWISVTKKNP TEQEYLDLKF AWKICKHVKS
NAIVVAKDQQ TLGIGAGQMN RVGASKIALE AAKEIDSGGV LASDGFFPFA DTVRLADKYG
ISSIIQPGGS IRDEESIKMC DSRGISMIFT HKRHFLH