Gene A9601_00031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_00031 
SymbolpurF 
ID4716685 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp4425 
End bp5885 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content33% 
IMG OID640077700 
Productamidophosphoribosyltransferase 
Protein accessionYP_001008398 
Protein GI123967540 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0034] Glutamine phosphoribosylpyrophosphate amidotransferase 
TIGRFAM ID[TIGR01134] amidophosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.486133 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGCGGAA TAGTTGGAAT CGTTTCTTCG AATGATGTAA ATCAACAAAT TTACGATAGT 
CTTTTGCTTC TGCAGCATAG AGGTCAAGAC TCAACAGGGA TAGCAACAAT GGAAAATACT
GTTTTCCATA TACATAAGGT TAAAGGTCAG GTTAATACTG CTTATAGAAC TAGAGATATG
AGGAATTTAA TTGGCAAAAT TGGATTGGGT CATGTTAGGT ATGCAACTAA GGGATCAGCA
GAAAGTGTAG AAGAAGCACA GCCTTTTTAT GTTAATGCTC CTTATGGAAT TGTTTTGATA
CATAACGGAA ATTTGACTAA TACCAGAGAT TTAGAAAAAC AGTTATTTAA TGTGGACAAG
CGGCATACAA ATTCTTCAAG TGATACTGAA ATGTTGTTAA ATGTATTTGC GACAGAATTA
CAAGAACAAA TTCATAATCA AGAATTAGAA CCTGATATTA TTTTTAGTGC GGTCAAATCT
TTACATAAAA GAATTCAGGG ATCATATGCT TCAATTGCAT TAATTTCAGG ACATGGTTTA
TTGGCATTTA GAGATCCTTT TGGGATAAGG CCTTTAGTCA TAGGGAAAAG ACTTTCCTTA
ACCACAAAAA AAGAAGAATG GATGGTTGCA AGCGAATCTT TAGTACTTGA GAATAATGAT
TATCAAGTAG TGAGAGATGT AGATCCAGGA GAAGCTGTTT TTATAAATCT TGATGGGGAG
TTTTTCTCTA AGCAATGTTC TGATAATCCC ATGCTATTTC CCTGTGCTTT TGAATATGTT
TACTTAGCAA GGCCAGATTC AATTATGAAT GGAATTTCCG TTTATAAAGC TCGTTTAAAG
ATGGGAGATT ATTTAGCAGA AACAATAAAA GAAACAATTA ATTCTGGAGA TATTGATGTA
GTTATGCCTA TTCCTGATTC TTCTCGACCT GCGGCAATGC AAGTTGCAAG ACAGTTAGGG
ATAGAATATA GGGAAGGTTT TTTTAAAAAT AGATATGTTG GCAGAACATT TATAATGCCT
GGTCAGCAGA AACGTAAGAA ATCTGTAAGG CAAAAATTAA ATGCTATGAG TGCAGAGTTT
AAAAATAAAA ATGTATTAAT TGTTGATGAC TCGATAGTAA GAGGTACTAC TTCAAAAGAA
ATTGTGCAGA TGGCTAAAGA TGCAGGAGCA AACAAAGTTT TTTTTACATC AGCAGCTCCT
CCTGTTCGTT ATCCTCACGT TTATGGAATT AATATGCCTA ATAGAGATGA ATTAATAGCA
CATAATAGGA CAATAAGTGA AATCGCCGAT AAACTTGAAA TTGATAATCT TGTTTATCAA
AGTGTTGAAA GTTTGCGTAA ATCTATAATT AGTGATTCTC CTATTAAAGG TTTAGAGATG
AGTTGTTTCA CTGGTGATTA TGTAACTGGA ACAGTAAATC AAGAATACTT AAATTGGGTT
GAAAATGAAT ATAAATCTTA G
 
Protein sequence
MCGIVGIVSS NDVNQQIYDS LLLLQHRGQD STGIATMENT VFHIHKVKGQ VNTAYRTRDM 
RNLIGKIGLG HVRYATKGSA ESVEEAQPFY VNAPYGIVLI HNGNLTNTRD LEKQLFNVDK
RHTNSSSDTE MLLNVFATEL QEQIHNQELE PDIIFSAVKS LHKRIQGSYA SIALISGHGL
LAFRDPFGIR PLVIGKRLSL TTKKEEWMVA SESLVLENND YQVVRDVDPG EAVFINLDGE
FFSKQCSDNP MLFPCAFEYV YLARPDSIMN GISVYKARLK MGDYLAETIK ETINSGDIDV
VMPIPDSSRP AAMQVARQLG IEYREGFFKN RYVGRTFIMP GQQKRKKSVR QKLNAMSAEF
KNKNVLIVDD SIVRGTTSKE IVQMAKDAGA NKVFFTSAAP PVRYPHVYGI NMPNRDELIA
HNRTISEIAD KLEIDNLVYQ SVESLRKSII SDSPIKGLEM SCFTGDYVTG TVNQEYLNWV
ENEYKS