Gene A9601_02881 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_02881 
SymbolpurH 
ID4716974 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp266042 
End bp267595 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content32% 
IMG OID640077989 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001008683 
Protein GI123967825 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTCCAT TAGCTTTAGT AAGTGTCTCT GATAAAAAAA ATATAATCCC ATTTTGCAAG 
GAATTGGTAG AGCATTTCAA TTATAAAATT CTATCAAGTG GAGGAACTGC CAAACATCTT
ATAGAGGCAA AAATTCCAGT TATTAAAGTT GCTGATTTTA CTAATTCTCC GGAAATTCTT
GGAGGAAGAG TTAAAACTTT ACATCCAAAA ATACACGGGG GAATATTAGC TATAAGAACT
GATGAGGAAC ACAAAAAAGA TATAGAAGCT AACAATCTTG AGTTAATTGA TTTGGTAGTT
GTCAATTTAT ATCCTTTTAA AAAAACTGTA GATGGGGGAG CTAAATGGGA AGATGCTATT
GAAAATATCG ATATCGGAGG GCCATCTATG ATTCGTTCTG CAGCTAAAAA TCATAAAGAT
GTTTCCGTTT TAGTAGATTC TAGTCAGTAT CAAAGTTTTC TTGAAGAAAG TAAAAAAGGT
GAATTGAAAG ACTCATATAA AGCAAAATTA GCCCTTGAAG CTTTTCAACA TACAGCAGAT
TATGACACTG CAATATCTAA TTGGATAAGA AAAGAAAGAG ATTTACAATC TTCTAAATAT
ATTGAATCTT ATCCACTAAT CAAAACCTTA AGATATGGAG AGAATCCACA TCAAAAAGCT
TTTTGGTATG GTTTAAGTAA CATTGGATGG AACTCAGCAG AGCAATTACA AGGAAAAGAC
TTAAGTTATA ACAATCTATT AGATCTAGAG TCGGCACTTT CAACAGTTTT AGAATTTGGC
TACACAGAAA AAGATGAACT TGCAACCGAT ATGGTTGCCT CTGTTATTTT AAAACACAAT
AATCCTTGTG GTGCCTCTAT GAGTAATTCA GCTTCTAAAG CATTTTTGAA TGCTTTAGAA
TGCGACTCTG TAAGTGCATT TGGAGGAATA GTTGCTTTTA ATTCAAATGT TGATAGTGAG
ACAGCAATTC ACCTCAAAGA TATTTTCTTA GAGTGTGTCG TCGCTCCATC TTTTGATGAA
GAAGCTTTAG AAATTTTAAA AGTTAAAAAG AATTTAAGAA TCTTAAAGAT TTCAAAAGAT
CAACTTCCAC AAAAGAATCA AAATTCTACT AAATCAATAA TGGGAGGATT ACTAGTTCAA
GATACTGACG ATAGTGAAGA AAAAACTGAA AATTGGATTT CAGTAACTAA TAAAAATCCA
AGTAATCAAA TTAACTTAGA TCTAAATTTT GCATGGAAAA TTTGTAAACA TGTTAAATCT
AATGCAATTG TTATTGCAAA AGACCAAAAA ACTATTGGTA TTGGAGCTGG GCAAATGAAC
AGAGTTGGAG CAGCAAAAAT TGCATTAAAA GCAGCTGGAA GGTTATGTTC TGATGCTGTC
TTGGCTAGCG ATGGGTTTTT CCCATTTGCA GATACTGTAG AAATAGCAAA TGAATATGGA
ATAAAAGCTA TTATTCAACC TGGAGGAAGT CTAAGAGACC AAGAAAGTAT TGATATGTGT
AATTCAAAAG GAATCTCAAT GGTATTTACG CAAAAAAGAC ATTTTTTACA TTAA
 
Protein sequence
MSPLALVSVS DKKNIIPFCK ELVEHFNYKI LSSGGTAKHL IEAKIPVIKV ADFTNSPEIL 
GGRVKTLHPK IHGGILAIRT DEEHKKDIEA NNLELIDLVV VNLYPFKKTV DGGAKWEDAI
ENIDIGGPSM IRSAAKNHKD VSVLVDSSQY QSFLEESKKG ELKDSYKAKL ALEAFQHTAD
YDTAISNWIR KERDLQSSKY IESYPLIKTL RYGENPHQKA FWYGLSNIGW NSAEQLQGKD
LSYNNLLDLE SALSTVLEFG YTEKDELATD MVASVILKHN NPCGASMSNS ASKAFLNALE
CDSVSAFGGI VAFNSNVDSE TAIHLKDIFL ECVVAPSFDE EALEILKVKK NLRILKISKD
QLPQKNQNST KSIMGGLLVQ DTDDSEEKTE NWISVTNKNP SNQINLDLNF AWKICKHVKS
NAIVIAKDQK TIGIGAGQMN RVGAAKIALK AAGRLCSDAV LASDGFFPFA DTVEIANEYG
IKAIIQPGGS LRDQESIDMC NSKGISMVFT QKRHFLH