Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_20391 |
Symbol | purM |
ID | 4779866 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 1681835 |
End bp | 1682875 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640085333 |
Product | phosphoribosylaminoimidazole synthetase |
Protein accession | YP_001015859 |
Protein GI | 124026744 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0150] Phosphoribosylaminoimidazole (AIR) synthetase |
TIGRFAM ID | [TIGR00878] phosphoribosylaminoimidazole synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.812351 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATTACA AAACTGCGGG AGTTGATGTT ACTGCTGGAA GAGCTTTTGT GGAGAGAATT AAATCATGCG TTGAAAAAAC TCACAAAAGT GAGGTCATAG GAGGGTTAGG AGGTTTTGGA GGATGTATAA GAATTCCAAA AGGATTTGAA AGTCCAGTCT TGGTCTCTGG GACAGATGGT GTTGGTACAA AATTAGAATT AGCTCAGCAA TATGGTTGTC ATTTTGGAGT CGGAATAGAT TTGGTTGCAA TGTGTGTAAA TGATGTAATA ACTAATGGGG CTCGACCTTT ATTTTTTCTT GATTACATAG CAAGCGGAAC TTTGACTCCT GATGCTTTGG CTGAAGTGAT AGAGGGCATT GCAGCAGGTT GTTGTCAATC GGATTGTTCT CTCTTGGGAG GCGAAACAGC TGAAATGCCA GGTTTTTATC CCAGTGGAAG ATATGACCTG GCAGGTTTCT GTGTTGGGAT CGTTGAAAAT CATCACTTAA TAGACGGCAC GAAAATTAAT TGTGGAGATC AGATCATTGG GATTAAAAGT AACGGTGTTC ATAGCAATGG TTTTAGTCTT GTTCGTAAAG TTCTTTCTAT GGCGAATGTA GATGAAAACA CTCTTTATGG GAAAGACAAA AGGAACTTGA TCCAATCTTT GCTGGAACCA ACAGCAATTT ATGTTCAACT TGTTGAGAAA TTGTTGAGAG AAAATTTACC AATTCATGGA ATGACGCATA TTACAGGTGG AGGATTGCCA GAGAATCTTC CTAGGATTTT CCCTTCTGGA TTGTTACCAC ATATAGATAT AACTACTTGG GAAATAACTG AAATCTTTAA TTGGTTACAA AATGCTGGAG ATATTCCTGA AATTGATCTT TGGAATACTT TTAATATGGG TATTGGTTTT TGTCTAATTG TTCCTAAAAA TGAGGTGAAT TCTGCTTTAG AAATATGTAT GAAAAATGAT TTTGAAGCTT GGAATATAGG TCAAGTTGTT GAAAGTCAGA ACAATTCAAA ACATAGCGGT ATTTTAGGAA TACCTAGCTG A
|
Protein sequence | MDYKTAGVDV TAGRAFVERI KSCVEKTHKS EVIGGLGGFG GCIRIPKGFE SPVLVSGTDG VGTKLELAQQ YGCHFGVGID LVAMCVNDVI TNGARPLFFL DYIASGTLTP DALAEVIEGI AAGCCQSDCS LLGGETAEMP GFYPSGRYDL AGFCVGIVEN HHLIDGTKIN CGDQIIGIKS NGVHSNGFSL VRKVLSMANV DENTLYGKDK RNLIQSLLEP TAIYVQLVEK LLRENLPIHG MTHITGGGLP ENLPRIFPSG LLPHIDITTW EITEIFNWLQ NAGDIPEIDL WNTFNMGIGF CLIVPKNEVN SALEICMKND FEAWNIGQVV ESQNNSKHSG ILGIPS
|
| |