Gene P9211_13941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_13941 
SymbolpurD 
ID5730749 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1260989 
End bp1262329 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content40% 
IMG OID641285770 
Productphosphoribosylamine--glycine ligase 
Protein accessionYP_001551279 
Protein GI159903935 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0151] Phosphoribosylamine-glycine ligase 
TIGRFAM ID[TIGR00877] phosphoribosylamine--glycine ligase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.144546 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCTA CAAACACAAG CAATGATTCT CTGCCTTCAT TCCAAAGAGT CCTTGTAGTT 
GGTAATGGTG GGCGCGAAAA TTCATTGGCA TGGGCGCTAA GCAAATGTGA AGGAATCTGC
GAAGTATTTG TTGCTCCAGG CAATGGGGGT ACAGAGGATC ATCATCGCTG CCATTGCCTC
AGTATTGATA CCTCAAATGT TGAAGCATTA ATTAGTTTTT GCCAGTCTAG AGAGATTCAA
TTAGTAGTGA TAGGACCAGA AGCCCCTCTT GCTTCAGGTT TGGCTGACAA ACTTCGAAAA
GCAGGATTGT TGGTATTTGG CCCTGGTGCT GATGGGGCCC AAATAGAAGC CAGTAAAGAT
TGGGCCAAGA AATTAATGAT TGAAGCTGGC ATTCCAACTG CGCTTTATTG GTCTGCAAAC
TCAAAAGAAC AAGCAATAGG ATTACTCAAA AATTTTGAGC AATCCTTAGT TATCAAGGCC
GATGGGCTTG CTTCAGGGAA AGGGGTGACG GTATGTAAAT CCAAGGAAGA AGCTTTAAAT
GCAATAAATA ATATCTTCGA GGGTAAGTTT GGTACTGCAG GAGAAACTGT CTTACTCGAA
GAATGCCTTG AAGGTCCAGA AGTCTCTGTT TTTGCACTAT GTGATGGAGA AGAGCTTTTA
GTCTTACCAA CAGCACAAGA CCACAAACGC CTACTTGATA AAGATCAAGG TCCAAATACA
GGAGGCATGG GTTCTTATGC ACCCGCAAAT ATTCTTAGCA AACAACAATT AGAGGAAGTA
CAAGAAAAAA TTCTTGATCC AACTTTAAAA GCTCTTAAAA GTAATAATAT CGATTATCGA
GGAGTTATAT ATGTAGGTCT AATGATTACT ACTCAAGGAC CAAAAGTTAT TGAATTCAAT
TGTCGATTTG GGGACCCAGA ATGCCAGGCT TTGATGCCAT TAATGGGACC AGAATTTGCT
CATATTCTTC AAGCTTGTGC AATGGGCTGT CTCAGAAAAG CTCCTAAGCT AACTGTCAAT
GATCTTTGTA GCGTTTGTAT AGTTGCATCC TCCGCTGGGT ACCCAGAAGC TCCTAAGAAA
GGTGACATCA TAAATATTGA GGTTATATCA AACCCATTAT TTCAGATCTT TCAGGCTGGT
ACTAAAAAAA TTGAATCTGG AGAATTATTA ACTTCAGGTG GAAGAGTACT ATCAGTTGTT
GCTCAAGGGA ATAACTTTGA CGAGGCATTT AACCTCGCAT ATAAAGAGTT GAGTAAAATT
AAATTCAAAG GAATGCATTA CCGAAATGAT ATTGGCCATC AAATAAGAAA AAGTTCTTTT
CTTCCCGAAA ATTCTCTTTA A
 
Protein sequence
MKATNTSNDS LPSFQRVLVV GNGGRENSLA WALSKCEGIC EVFVAPGNGG TEDHHRCHCL 
SIDTSNVEAL ISFCQSREIQ LVVIGPEAPL ASGLADKLRK AGLLVFGPGA DGAQIEASKD
WAKKLMIEAG IPTALYWSAN SKEQAIGLLK NFEQSLVIKA DGLASGKGVT VCKSKEEALN
AINNIFEGKF GTAGETVLLE ECLEGPEVSV FALCDGEELL VLPTAQDHKR LLDKDQGPNT
GGMGSYAPAN ILSKQQLEEV QEKILDPTLK ALKSNNIDYR GVIYVGLMIT TQGPKVIEFN
CRFGDPECQA LMPLMGPEFA HILQACAMGC LRKAPKLTVN DLCSVCIVAS SAGYPEAPKK
GDIINIEVIS NPLFQIFQAG TKKIESGELL TSGGRVLSVV AQGNNFDEAF NLAYKELSKI
KFKGMHYRND IGHQIRKSSF LPENSL