Gene NATL1_00031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_00031 
SymbolpurF 
ID4780992 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp4597 
End bp6054 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content33% 
IMG OID640083266 
Productamidophosphoribosyltransferase 
Protein accessionYP_001013832 
Protein GI124024716 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0034] Glutamine phosphoribosylpyrophosphate amidotransferase 
TIGRFAM ID[TIGR01134] amidophosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.763757 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.837003 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGTGGAA TAGTAGGTGT TGTTTCAACT GATCAATGTA ATCAACAAAT TTACGATAGT 
TTGCTATTGC TTCAGCATAG AGGTCAAGAT TCCACTGGTA TAGCAACAAT GGATGGCAGT
GTTTTTCATA TTAATAAATC TAAAGGTCAA GTTCGAGAAG CTTATAGAAC TAGAGATATG
AGAGCTTTGA TCGGAAGATC AGGATTGGGA CATGTACGTT ATGCAACTAA AGGTGCTGCT
CATAGAGAAG AAGAAGCACA ACCTTTTTAT GTCAATGCTC CTTACGGAAT TATTCTTGTT
CATAATGGAA ACTTAACAAA TACAAGAGAG CTAGAAAAAG AACTTTTTAA AGTTGATAAA
AGACATACAA ATACATCGAG CGATACTGAA ATGTTATTAA ATGTTTTGGC AACAGAATTA
AATAGTGAAG TAAAAGGAAA AGATGTAGAT CCAGAGGATC TTTTTAATGC TGTTAAAAGA
CTTCACTGTA GAGTAGAGGG ATCCTATGCT TCAATCGCTT TAATAGCAGG TCATGGTCTT
TTAGCATTTA GAGATCCTTT TGGAATTCGC CCTTTGGTTA TTGGTAAAAG GGTAAAAGAA
AATAACAAAC CTGAGTGGGT AATCGCTAGT GAATCGCTAG TCATTGAAAA CAATGATTAT
GTAATTGTTA GAGATGTTGA ACCAGGTGAG GCGATATTTA TAACTTCAAA TGGTGAGTTT
TTTTCTAAAC AATGCTCAGA TAATCCTCAA CTATTCCCCT GTTCATTTGA ATATGTTTAT
TTAGCAAGAC CTGACTCAAT AATGAATGGT ATTTCTGTTT ATGAGTCAAG ATTAAGAATG
GGGGATTTAT TAGCAGAAAC TATAAAAAAA CAAATTTCTC TTGGTGATAT AGATGTCGTA
ATGCCAATTC CTGATTCTTC TAGACCTGCT GCAATGCAGG TAGCAAGACA GTTAGGGATT
GAATATAGGG AGGGTTTTTT CAAGAATCGT TATGTTGGTA GAACATTTAT AATGCCTGGT
CAATCTCTAA GAAAACGTTC AGTGCGTCAG AAATTAAATG CAATGAGTAC TGAATTTAAA
AATAAAAATG TATTGATAGT TGATGATTCA GTGGTTAGAG GAACGACATC TCAGCAAATA
GTTCAAATGG CAAGAAGTGC AGGGGCTAAT AAAGTTACTT TTACATCTGC TGCACCACCA
ATAAGATATC CACATGTTTA TGGAATTAAT ATGCCAAGTA GAAATGAATT GATTGCTTAT
AATAGAGATA TAAATCAAAT AGAAAATAAT TTATTTATTG ATAAAATGAT TTATCAAGAG
GTAAATGATC TTACTCAGGC TATTACGCAA AATTCTAAAA TTAAGGAATT AGATTTATCA
TGCTTTACAG GAAAATATAT TACAGGAACA GTCACTGATG AATATTTAAC TTGGGTTGAA
GAAACATCTT TGTCTTAA
 
Protein sequence
MCGIVGVVST DQCNQQIYDS LLLLQHRGQD STGIATMDGS VFHINKSKGQ VREAYRTRDM 
RALIGRSGLG HVRYATKGAA HREEEAQPFY VNAPYGIILV HNGNLTNTRE LEKELFKVDK
RHTNTSSDTE MLLNVLATEL NSEVKGKDVD PEDLFNAVKR LHCRVEGSYA SIALIAGHGL
LAFRDPFGIR PLVIGKRVKE NNKPEWVIAS ESLVIENNDY VIVRDVEPGE AIFITSNGEF
FSKQCSDNPQ LFPCSFEYVY LARPDSIMNG ISVYESRLRM GDLLAETIKK QISLGDIDVV
MPIPDSSRPA AMQVARQLGI EYREGFFKNR YVGRTFIMPG QSLRKRSVRQ KLNAMSTEFK
NKNVLIVDDS VVRGTTSQQI VQMARSAGAN KVTFTSAAPP IRYPHVYGIN MPSRNELIAY
NRDINQIENN LFIDKMIYQE VNDLTQAITQ NSKIKELDLS CFTGKYITGT VTDEYLTWVE
ETSLS