Gene NATL1_20391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_20391 
SymbolpurM 
ID4779866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1681835 
End bp1682875 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content37% 
IMG OID640085333 
Productphosphoribosylaminoimidazole synthetase 
Protein accessionYP_001015859 
Protein GI124026744 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.812351 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTACA AAACTGCGGG AGTTGATGTT ACTGCTGGAA GAGCTTTTGT GGAGAGAATT 
AAATCATGCG TTGAAAAAAC TCACAAAAGT GAGGTCATAG GAGGGTTAGG AGGTTTTGGA
GGATGTATAA GAATTCCAAA AGGATTTGAA AGTCCAGTCT TGGTCTCTGG GACAGATGGT
GTTGGTACAA AATTAGAATT AGCTCAGCAA TATGGTTGTC ATTTTGGAGT CGGAATAGAT
TTGGTTGCAA TGTGTGTAAA TGATGTAATA ACTAATGGGG CTCGACCTTT ATTTTTTCTT
GATTACATAG CAAGCGGAAC TTTGACTCCT GATGCTTTGG CTGAAGTGAT AGAGGGCATT
GCAGCAGGTT GTTGTCAATC GGATTGTTCT CTCTTGGGAG GCGAAACAGC TGAAATGCCA
GGTTTTTATC CCAGTGGAAG ATATGACCTG GCAGGTTTCT GTGTTGGGAT CGTTGAAAAT
CATCACTTAA TAGACGGCAC GAAAATTAAT TGTGGAGATC AGATCATTGG GATTAAAAGT
AACGGTGTTC ATAGCAATGG TTTTAGTCTT GTTCGTAAAG TTCTTTCTAT GGCGAATGTA
GATGAAAACA CTCTTTATGG GAAAGACAAA AGGAACTTGA TCCAATCTTT GCTGGAACCA
ACAGCAATTT ATGTTCAACT TGTTGAGAAA TTGTTGAGAG AAAATTTACC AATTCATGGA
ATGACGCATA TTACAGGTGG AGGATTGCCA GAGAATCTTC CTAGGATTTT CCCTTCTGGA
TTGTTACCAC ATATAGATAT AACTACTTGG GAAATAACTG AAATCTTTAA TTGGTTACAA
AATGCTGGAG ATATTCCTGA AATTGATCTT TGGAATACTT TTAATATGGG TATTGGTTTT
TGTCTAATTG TTCCTAAAAA TGAGGTGAAT TCTGCTTTAG AAATATGTAT GAAAAATGAT
TTTGAAGCTT GGAATATAGG TCAAGTTGTT GAAAGTCAGA ACAATTCAAA ACATAGCGGT
ATTTTAGGAA TACCTAGCTG A
 
Protein sequence
MDYKTAGVDV TAGRAFVERI KSCVEKTHKS EVIGGLGGFG GCIRIPKGFE SPVLVSGTDG 
VGTKLELAQQ YGCHFGVGID LVAMCVNDVI TNGARPLFFL DYIASGTLTP DALAEVIEGI
AAGCCQSDCS LLGGETAEMP GFYPSGRYDL AGFCVGIVEN HHLIDGTKIN CGDQIIGIKS
NGVHSNGFSL VRKVLSMANV DENTLYGKDK RNLIQSLLEP TAIYVQLVEK LLRENLPIHG
MTHITGGGLP ENLPRIFPSG LLPHIDITTW EITEIFNWLQ NAGDIPEIDL WNTFNMGIGF
CLIVPKNEVN SALEICMKND FEAWNIGQVV ESQNNSKHSG ILGIPS