Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_13941 |
Symbol | purD |
ID | 5730749 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 1260989 |
End bp | 1262329 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 641285770 |
Product | phosphoribosylamine--glycine ligase |
Protein accession | YP_001551279 |
Protein GI | 159903935 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0151] Phosphoribosylamine-glycine ligase |
TIGRFAM ID | [TIGR00877] phosphoribosylamine--glycine ligase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.144546 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGCTA CAAACACAAG CAATGATTCT CTGCCTTCAT TCCAAAGAGT CCTTGTAGTT GGTAATGGTG GGCGCGAAAA TTCATTGGCA TGGGCGCTAA GCAAATGTGA AGGAATCTGC GAAGTATTTG TTGCTCCAGG CAATGGGGGT ACAGAGGATC ATCATCGCTG CCATTGCCTC AGTATTGATA CCTCAAATGT TGAAGCATTA ATTAGTTTTT GCCAGTCTAG AGAGATTCAA TTAGTAGTGA TAGGACCAGA AGCCCCTCTT GCTTCAGGTT TGGCTGACAA ACTTCGAAAA GCAGGATTGT TGGTATTTGG CCCTGGTGCT GATGGGGCCC AAATAGAAGC CAGTAAAGAT TGGGCCAAGA AATTAATGAT TGAAGCTGGC ATTCCAACTG CGCTTTATTG GTCTGCAAAC TCAAAAGAAC AAGCAATAGG ATTACTCAAA AATTTTGAGC AATCCTTAGT TATCAAGGCC GATGGGCTTG CTTCAGGGAA AGGGGTGACG GTATGTAAAT CCAAGGAAGA AGCTTTAAAT GCAATAAATA ATATCTTCGA GGGTAAGTTT GGTACTGCAG GAGAAACTGT CTTACTCGAA GAATGCCTTG AAGGTCCAGA AGTCTCTGTT TTTGCACTAT GTGATGGAGA AGAGCTTTTA GTCTTACCAA CAGCACAAGA CCACAAACGC CTACTTGATA AAGATCAAGG TCCAAATACA GGAGGCATGG GTTCTTATGC ACCCGCAAAT ATTCTTAGCA AACAACAATT AGAGGAAGTA CAAGAAAAAA TTCTTGATCC AACTTTAAAA GCTCTTAAAA GTAATAATAT CGATTATCGA GGAGTTATAT ATGTAGGTCT AATGATTACT ACTCAAGGAC CAAAAGTTAT TGAATTCAAT TGTCGATTTG GGGACCCAGA ATGCCAGGCT TTGATGCCAT TAATGGGACC AGAATTTGCT CATATTCTTC AAGCTTGTGC AATGGGCTGT CTCAGAAAAG CTCCTAAGCT AACTGTCAAT GATCTTTGTA GCGTTTGTAT AGTTGCATCC TCCGCTGGGT ACCCAGAAGC TCCTAAGAAA GGTGACATCA TAAATATTGA GGTTATATCA AACCCATTAT TTCAGATCTT TCAGGCTGGT ACTAAAAAAA TTGAATCTGG AGAATTATTA ACTTCAGGTG GAAGAGTACT ATCAGTTGTT GCTCAAGGGA ATAACTTTGA CGAGGCATTT AACCTCGCAT ATAAAGAGTT GAGTAAAATT AAATTCAAAG GAATGCATTA CCGAAATGAT ATTGGCCATC AAATAAGAAA AAGTTCTTTT CTTCCCGAAA ATTCTCTTTA A
|
Protein sequence | MKATNTSNDS LPSFQRVLVV GNGGRENSLA WALSKCEGIC EVFVAPGNGG TEDHHRCHCL SIDTSNVEAL ISFCQSREIQ LVVIGPEAPL ASGLADKLRK AGLLVFGPGA DGAQIEASKD WAKKLMIEAG IPTALYWSAN SKEQAIGLLK NFEQSLVIKA DGLASGKGVT VCKSKEEALN AINNIFEGKF GTAGETVLLE ECLEGPEVSV FALCDGEELL VLPTAQDHKR LLDKDQGPNT GGMGSYAPAN ILSKQQLEEV QEKILDPTLK ALKSNNIDYR GVIYVGLMIT TQGPKVIEFN CRFGDPECQA LMPLMGPEFA HILQACAMGC LRKAPKLTVN DLCSVCIVAS SAGYPEAPKK GDIINIEVIS NPLFQIFQAG TKKIESGELL TSGGRVLSVV AQGNNFDEAF NLAYKELSKI KFKGMHYRND IGHQIRKSSF LPENSL
|
| |