Gene P9301_00031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_00031 
SymbolpurF 
ID4912393 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp4427 
End bp5887 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content33% 
IMG OID640159567 
Productamidophosphoribosyltransferase 
Protein accessionYP_001090227 
Protein GI126695341 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0034] Glutamine phosphoribosylpyrophosphate amidotransferase 
TIGRFAM ID[TIGR01134] amidophosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGCGGAA TAGTTGGAAT CGTTTCTTCA GATGATGTAA ATCAGCAAAT TTACGATAGT 
CTTTTGCTTT TGCAGCATAG AGGTCAAGAT TCAACAGGTA TAGCTACAAT GGAAAATACT
GTTTTCCATA TACATAAGGC TAAAGGTCAG GTTAATACTG CTTATAGAAC GAGAGATATG
AGGAATTTAA TCGGCAAAAT TGGATTGGGT CATGTTAGGT ATGCAACAAA AGGATCAGCA
GAAAGTGTAG AAGAAGCACA GCCTTTTTAT GTTAATGCTC CTTATGGAAT TGTTTTGATA
CATAATGGAA ATTTGACGAA CACTAGAGTT CTAGAAAAAC AATTATTTAA TATTGATAAA
AGGCATACAA ATTCTTCAAG TGATACTGAA ATGTTGTTAA ATGTATTTGC GACAGAATTA
CAAGAACAAA TTCATAATCA AGAATTAGAA CCTGATATTA TTTTTAGTGC CGTTAAATCT
TTACATAAAA GAATTCAGGG ATCATATGCT TCAATTGCAT TAATTTCAGG ACATGGTTTA
TTAGCATTCA GAGATCCCTT CGGTATTAGG CCTTTAGTCA TAGGAAAAAG ATTTTCGTTA
ACTACAAAAA AAGAAGAGTG GATGGTTGCT AGCGAATCTC TAGTTCTTGA GAATAACGAT
TATCAAGTAG TGAGAGACGT AGATCCTGGA GAAGCTGTTT TTATAAATCT TAATGGTGAG
TTTTTTTCTA AGCAATGTTC TGAAAATCCA ATGTTATTTC CTTGTTCTTT TGAATATGTT
TATTTAGCTA GACCAGATTC AATAATGAAT GGAATTTCAG TATATAAAGC TCGCTTAAAG
ATGGGAGATT ATTTGGCAGA AACAATAAAA CAGACAATTA ATTCTGGAGA CGTTGATGTA
GTTATGCCTA TTCCTGATTC TTCTAGACCT GCTGCAATGC AAGTTGCAAG ACAGTTAGGG
ATTGAATATA GGGAGGGTTT TTTTAAAAAC AGATATGTTG GCCGAACATT CATAATGCCT
GGTCAACAGA AACGTAAGAA ATCTGTAAGA CAAAAGTTAA ATGCAATGAG CGCAGAGTTT
AAAAATAAAA ATGTATTAAT TGTTGATGAC TCGATAGTAA GAGGTACCAC TTCAAAAGAA
ATTGTCCAGA TGGCTAAAGA TGCAGGAGCA AATAAGGTTT TCTTCACATC AGCAGCCCCT
CCTGTTCGTT TTCCTCATGT TTATGGAATT AATATGCCGA ATAGAGATGA ATTAATAGCT
CATGACAGAA CAATAGCTGA AATTGCTGAT CATCTTTCAA TTGATAACCT TGTTTATCAA
AGTGTTGAAA GTTTACGGAA ATCTATAATA AGTGATTCTC CTATTCAGGA TTTGGAGATG
AGTTGCTTTA CCGGGTCTTA TGTAACCGGA ACAGTAAATC AAGAATACTT AAATTGGGTT
GAAAATGAAT ATAAATCTTA G
 
Protein sequence
MCGIVGIVSS DDVNQQIYDS LLLLQHRGQD STGIATMENT VFHIHKAKGQ VNTAYRTRDM 
RNLIGKIGLG HVRYATKGSA ESVEEAQPFY VNAPYGIVLI HNGNLTNTRV LEKQLFNIDK
RHTNSSSDTE MLLNVFATEL QEQIHNQELE PDIIFSAVKS LHKRIQGSYA SIALISGHGL
LAFRDPFGIR PLVIGKRFSL TTKKEEWMVA SESLVLENND YQVVRDVDPG EAVFINLNGE
FFSKQCSENP MLFPCSFEYV YLARPDSIMN GISVYKARLK MGDYLAETIK QTINSGDVDV
VMPIPDSSRP AAMQVARQLG IEYREGFFKN RYVGRTFIMP GQQKRKKSVR QKLNAMSAEF
KNKNVLIVDD SIVRGTTSKE IVQMAKDAGA NKVFFTSAAP PVRFPHVYGI NMPNRDELIA
HDRTIAEIAD HLSIDNLVYQ SVESLRKSII SDSPIQDLEM SCFTGSYVTG TVNQEYLNWV
ENEYKS