Gene PMN2A_1632 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPMN2A_1632 
SymbolpurH 
ID3607032 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL2A 
KingdomBacteria 
Replicon accessionNC_007335 
Strand
Start bp301066 
End bp302622 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content36% 
IMG OID637688512 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_292823 
Protein GI72383468 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCACCGA TAGCTCTGCT AAGTGTCTCA GACAAAACTG GCTTAATTCC ACTTGCGAAA 
GCATTAGTTA ATGATCTGGG CTTCAAAATC ATTTCAAGTG GCGGGACTGC AAAGTTAATT
GAGAGTGAAA ATCTTCCTGT TACAAGAGTC GCAGATTACA CAGGATTCCC AGAGATTCTT
GGAGGAAGAG TAAAAACTCT AAACCCAAAA ATTCATGGAG GGATATTAGC CAGACGAGAT
AAACAATCTC ATTTAGATGA TTTAGATAAA CAAAATATCA ATCCAATAGA CTTGGTGGTT
GTTAACTTAT ATCCATTTGT AAAAACAATT TCCAAAGAGA ATGTTTCATG GGAGGAAGCT
ATCGAAAATA TTGATATTGG TGGTCCAACA ATGATCCGAG CAGCAGCAAA AAACCATCAA
GATGTTCTTG TAGTTACTGA TCCAAGTCAA TACTCAAACT TAATTGATGC CTATAAATCA
AAAAAGATCA CTACTGAATT ACGAAAAAAA TATTCGCAAC AAGCTTTTGA GCATACCGCG
ACGTATGACC TAACAATAAG TAATTGGATT GCCAACCAAA GCTCCTCAAA AAAGGTTTCT
TGGTTGCAAA GCTTGCCATT AAAGCAAGAA CTTAGGTATG GAGAAAATCC TCATCAAAAA
GCTTCATGGT ATGGAGAGCC TGAAAAAGGA TGGAGTGGAG CTAATCAATT ACAAGGCAAA
GAATTAAGTA CAAATAATCT TCTAGATCTG GAGGCTGCTT TATCTACTCT TCGTGAATTT
GGGTATAAAA ATAATATTAG TAACCCTTCA TATCAAAAAG CAGCGGTAGT AATTAAGCAT
ACAAATCCTT GTGGAGTAGC TATTGGAGAT TCTCCATCTT CAGCTCTTAA AAGAGCATTA
GATGGCGATA GAGTAAGTGC TTTTGGGGGT ATTATTGCTA TCAATTGCCC CGTTGATGAA
GCTGCAGCAA AAGAAATTGA AAATATATTT ATTGAATGTG TTGTAGCTCC ATATTTTGAT
GAAACTGCAA AAGAAATACT TTCAAAAAAG AAAAATCTTA GGCTCTTAGA ATTAAAAGCT
GAGTCTGTCC AAAAAGCAGA TAAAAATCAC ATAAGAAGCA TACTTGGTGG TTTATTAATT
CAAGATTTAG ACGAACCAAG TATTGATCAA AAAAAATGGA AAAGTGTTAC TGAACTAATC
CCAACAGATG AAGAAATGAA TGACTTATCT TTTGCTTGGA AAATTGTAAA ACATATACGA
TCAAACGCAA TAGCTGTTGC ATCCAATCAG CAGAGTCTAG GGATTGGAGC TGGCCAAATG
AATAGGGTAG GTTCAGCAAA ACTTGCATTA GAAGCTGCTG GTACAAAATC AAAAGGTGCT
GTTTTGGCTA GTGATGGTTT TTTCCCATTC GACGATACTG TAAAGATGGC TTCTGATTAT
GGTATTAGTT CAATTATTCA GCCTGGTGGA AGCATTAGAG ACGAAGATTC TATTAAAGCC
TGCAATGAAT TAGGAATAAA AATGATTCTT ACTGGTAAAA GGCACTTTTT ACATTGA
 
Protein sequence
MSPIALLSVS DKTGLIPLAK ALVNDLGFKI ISSGGTAKLI ESENLPVTRV ADYTGFPEIL 
GGRVKTLNPK IHGGILARRD KQSHLDDLDK QNINPIDLVV VNLYPFVKTI SKENVSWEEA
IENIDIGGPT MIRAAAKNHQ DVLVVTDPSQ YSNLIDAYKS KKITTELRKK YSQQAFEHTA
TYDLTISNWI ANQSSSKKVS WLQSLPLKQE LRYGENPHQK ASWYGEPEKG WSGANQLQGK
ELSTNNLLDL EAALSTLREF GYKNNISNPS YQKAAVVIKH TNPCGVAIGD SPSSALKRAL
DGDRVSAFGG IIAINCPVDE AAAKEIENIF IECVVAPYFD ETAKEILSKK KNLRLLELKA
ESVQKADKNH IRSILGGLLI QDLDEPSIDQ KKWKSVTELI PTDEEMNDLS FAWKIVKHIR
SNAIAVASNQ QSLGIGAGQM NRVGSAKLAL EAAGTKSKGA VLASDGFFPF DDTVKMASDY
GISSIIQPGG SIRDEDSIKA CNELGIKMIL TGKRHFLH