Gene P9211_00031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_00031 
SymbolpurF 
ID5730037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp4620 
End bp6077 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content34% 
IMG OID641284345 
Productamidophosphoribosyltransferase 
Protein accessionYP_001549888 
Protein GI159902544 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0034] Glutamine phosphoribosylpyrophosphate amidotransferase 
TIGRFAM ID[TIGR01134] amidophosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.495809 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00403524 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTGCGGCA TAGTAGGTAT TTTTTCTAAT CATCAGATTA ATCAATTGAT TTATGACAGC 
TTGTTATTAT TGCAGCATAG AGGGCAAGAC TCTACAGGTA TAGCAACGAT GGAAGGAAGT
TTATTCCATA TATGTAAATC AAAAGGACAA GTAAAAGAAG CTTATAGAAC GCGAGATATG
AGAAGTTTAC TTGGCAATAT AGGTATTGGT CATGTCAGAT ATGCGACTAA AGGAACTGCA
GATAGTGAAA ATGAAGCACA ACCCTTTTAT GTCAATGCAC CTTATGGCAT TATTCTTGTA
CATAATGGAA ATTTGACTAA TACAAGAGAA CTAGAAAAGC AGCTTTTTAA TATAGATCGA
AGACATACAA ATTCTTCTAG TGATACTGAA ATGTTGCTAA ATATCTTTGC AACTGAAATA
CAAGCTCAAA TTCATGGAAG TTCTTTATCA CCAGAACATA TTTTTTCTGC TATTAAATCA
TTACATAAAA GAGTGGAAGG TTCTTATGCT GCTATTGCTT TAATTGCTGG CCATGGATTA
GTTGCTTTTA GAGATCCGTA TGGAATTAGA CCATTAGTTT TAGGTAAAAG AATTTCGGAA
GATAATCGGG ATGAATGGAT TCTTGCAAGC GAATCTCTGG TCTTGGAAAA TAACGATTTT
CAAATTGTTC GTGATTTAGA TCCAGGGGAG GCAGTATTTA TATCAGTCAA TGGTGAACTT
CATTCCCAAC AATGTTCAGA TAATCCCAAA CTATTTCCTT GTTCTTTCGA ATATGTTTAT
TTAGCTAGGC CTGATTCAAT TATGAATGGA ATCTCTGTCT ATGAAGCAAG GCTTCGGATG
GGTGATCGAT TAGCGAATAC TATAAAAAAG ACACTTAATT CTGGAGATAT TGATGTTGTT
ATGCCTATTC CCGATTCTTC CAGGCCTTCT GCCATGCAAG TTGCTAGACA ATTAGGTGTT
GAATATAGAG AAGGATTTTT TAAAAATAGA TATGTAGGCA GAACTTTTAT TATGCCAGGG
CAATCTCAAC GAAAGAAATC AGTACGACAA AAGTTAAATG CTATGAGTAC AGAGTTTAAA
GGAAAAAATA TTCTCATTGT TGACGACTCT ATAGTTCGAG GAACTACTTC TAGAGAAATA
GTTCAAATGG CAAGACTTGC TGGAGCGAAT AAAGTTACAT TTACTTCAGC AGCTCCGCCT
ATACGATTTC CGCATGTTTA TGGGATTAAT ATGCCTTCAA AAGATGAGCT AATAGCATAT
GATCGATCCA TTCTTGAAAT TCAGAATATT TTATTAGTTG ATCAATTAGT TTATCAAGAG
GTCGGTGATC TTAAGACTGC AATTTTGGAT GACTCCAAGA TAGAAGATTT AGATATGTCT
TGCTTTACAG GTCATTATGT TACAGGAACA GTTACTAATG AATATTTAAA TTGGGTTGAA
ACTGAATATA TTTCTTAA
 
Protein sequence
MCGIVGIFSN HQINQLIYDS LLLLQHRGQD STGIATMEGS LFHICKSKGQ VKEAYRTRDM 
RSLLGNIGIG HVRYATKGTA DSENEAQPFY VNAPYGIILV HNGNLTNTRE LEKQLFNIDR
RHTNSSSDTE MLLNIFATEI QAQIHGSSLS PEHIFSAIKS LHKRVEGSYA AIALIAGHGL
VAFRDPYGIR PLVLGKRISE DNRDEWILAS ESLVLENNDF QIVRDLDPGE AVFISVNGEL
HSQQCSDNPK LFPCSFEYVY LARPDSIMNG ISVYEARLRM GDRLANTIKK TLNSGDIDVV
MPIPDSSRPS AMQVARQLGV EYREGFFKNR YVGRTFIMPG QSQRKKSVRQ KLNAMSTEFK
GKNILIVDDS IVRGTTSREI VQMARLAGAN KVTFTSAAPP IRFPHVYGIN MPSKDELIAY
DRSILEIQNI LLVDQLVYQE VGDLKTAILD DSKIEDLDMS CFTGHYVTGT VTNEYLNWVE
TEYIS