Gene P9211_02931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_02931 
SymbolpurH 
ID5731796 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp277278 
End bp278834 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content38% 
IMG OID641284639 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001550178 
Protein GI159902834 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.848111 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCGTA TAGCATTAAT AAGTGTTTCC AATAAAGATG GGCTAATTCC TTTTGCGAAA 
ACATTAACAA CCCTTCATGG TTTTGAGATT ATTTCCAGTG GAGGTACTGC TAGAGCGCTG
AAAGAAGCAA ATATCCCTGT CAAAACAGTT TCTGATTACA CTGGAGCTCC AGAAATTCTT
GGTGGCCGAG TAAAAACTCT TCATCCTCGA ATACATGGCG GAATACTTGC CAAGCAAGGA
AATTCATCTC ATCAATTTGA TCTCGAAAAA GAAAACATCA AAAATATTGA TCTTGTAGTC
GTAAACCTTT ATCCATTTCA AGAAACAATC TCTGATCCAG ATGTTACATG GGATAACGCA
ATAGAAAATA TTGATATTGG CGGGCCTGCA ATGATTCGCG CAGCAGCCAA AAACCATGAA
TCAGTAAGTA TTCTGACTAA TCCAAATCAA TATGACGCTT TTCTTGAAAA ATTAGAAGCT
GGGGAGATTT CAACAACTAT CAAAGCAAAA CTCGCTCTTG AAGCTTTCGA GCACACCGCA
AGTTATGACA TAGCAATTAG CCAATGGTTA AGCAAGCAAA TTGAATCTAA ATATTCTCCA
TATTTAACTT CTCAACCAAT TAAACAAACT CTGAGGTATG GAGAGAATCC TCATCAAAAT
GCAAATTGGT ATAGCGCAGT TAATCAAGGG TGGGGGCAAG CTGAACAATT ACAAGGTAAA
GAGCTCAGCA CAAATAATCT TCTAGATCTA GAAGCTGCTG TTGCAACAAT AAGAGAATTT
GGATATGACT TAGGCAATAA AGGCAATTCG TGCGAGAAAG CTGCAGTCAT TATTAAGCAC
ACTAATCCTT GTGGAGTAGC TGTAAGCAAT AATCTGAGCA ATGCATTCAA CCTGGCCCTT
GAGTGCGACT CAATTAGTGC ATTTGGAGGA ATTGTTGCTC TTAACTGCAA TTTAGATGCT
GCTACAGCAA AAGAACTAAG CAGTCTATTT TTAGAATGTG TAGTAGCTCC AGACTATGAC
GCTAACGCTT TAGAGATCCT TTCAACGAAA AAAAATTTAA GGATAATTAA ACTTAGTCAC
AGCTCTATAA AGTCGTCCGA ACGTAAGTAT ATAAGAAGCA TTTTAGGAGG AATATTGGTT
CAGGAAGTTG ATGACAAATT AATTGAACCT AATGAATGGA AAGTTCCTAC AAAATTACAA
ATGTCTATTG AAGACAAAGC TGATCTAGCT TTCGCCTGGC GAGTAGTAAG ACATGTTAGA
TCAAATGCAA TAGTAGTTGC ATCTGCTGGT CAAACTTTAG GAATAGGTGC AGGGCAAATG
AATAGAATAG GGGCAGCAAA AATAGCTCTG GAAGCTGCAG GAGAAAAAGC TCAAGGTGCT
GTATTAGCTA GTGATGGCTT CTTTCCCTTT GATGACACAG TACATTTGGC ATCAAGATAT
GGAATCAAAT CAATAATTCA ACCAGGAGGA AGTATTCGAG ACCAATCATC TATAGATGCA
TGCAATCAAT TAGGTCTCTC TATGATATTT ACTGGTAAAA GACATTTCCT TCATTAA
 
Protein sequence
MARIALISVS NKDGLIPFAK TLTTLHGFEI ISSGGTARAL KEANIPVKTV SDYTGAPEIL 
GGRVKTLHPR IHGGILAKQG NSSHQFDLEK ENIKNIDLVV VNLYPFQETI SDPDVTWDNA
IENIDIGGPA MIRAAAKNHE SVSILTNPNQ YDAFLEKLEA GEISTTIKAK LALEAFEHTA
SYDIAISQWL SKQIESKYSP YLTSQPIKQT LRYGENPHQN ANWYSAVNQG WGQAEQLQGK
ELSTNNLLDL EAAVATIREF GYDLGNKGNS CEKAAVIIKH TNPCGVAVSN NLSNAFNLAL
ECDSISAFGG IVALNCNLDA ATAKELSSLF LECVVAPDYD ANALEILSTK KNLRIIKLSH
SSIKSSERKY IRSILGGILV QEVDDKLIEP NEWKVPTKLQ MSIEDKADLA FAWRVVRHVR
SNAIVVASAG QTLGIGAGQM NRIGAAKIAL EAAGEKAQGA VLASDGFFPF DDTVHLASRY
GIKSIIQPGG SIRDQSSIDA CNQLGLSMIF TGKRHFLH