Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_09541 |
Symbol | dxs |
ID | 4717663 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | + |
Start bp | 821829 |
End bp | 823718 |
Gene Length | 1890 bp |
Protein Length | 629 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640078667 |
Product | 1-deoxy-D-xylulose-5-phosphate synthase |
Protein accession | YP_001009345 |
Protein GI | 123968487 |
COG category | [H] Coenzyme transport and metabolism [I] Lipid transport and metabolism |
COG ID | [COG1154] Deoxyxylulose-5-phosphate synthase |
TIGRFAM ID | [TIGR00204] 1-deoxy-D-xylulose-5-phosphate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.277651 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTTTAA GTGAGTTAAG TCATCCAAAT CAACTTCATG GCTTAACAGT TTCACAATTA GAGGAAATTG CTTGTCAAAT TAGAGAAAGA CATCTTCAGG TAGTATCTAC TAGTGGTGGA CATCTTGGTC CTGGATTAGG TGTGGTTGAG TTGACATTGG CTCTATATCA AACTCTCGAT CTTGATTTTG ACAAAGTTGT TTGGGATGTA GGACATCAAG GTTACCCTCA TAAATTAATT ACAGGACGTT TCAGTCAATT TGATTCTCTA AGGCAACAAA ATGGAGTCGC TGGATATTTA AAAAGAAGTG AAAGTAAATT TGATCATTTT GGTGCTGGAC ATGCAAGTAC ATCTATTTCT GCTGCTTTAG GAATGGCAAT AGCGAGAGAT AGAAAAGGTG AAAATTATAA ATGTGTTGCT GTTATTGGAG ATGGAGCACT AACTGGAGGA ATGGCATTAG AAGCTATAAA TCATGCAGGT CACTTACCAA ATACTCCTTT AGTTGTAGTA TTGAACGATA ATGACATGTC TATTTCACCT CCTGTTGGAG CCCTTTCATC GTACTTAAAT AAGGTAAGAG TTAGTCCACC ATTGCAATTT TTGTCCGATA GTGTTCAAGA AAGTGTAAAA AATATACCCT TAATTGGTAA GGATATCCCA GAAGAACTCA AAAATATTAA AGGAAGTGTT AGACGACTAT CTGTGCCTAA GGTTGGAGCT GTTTTTGAAG AACTTGGATT TACATATATG GGTCCAATTG AAGGTCATGA TATTGCTAAT TTAATTAAGA CCTTTAACGC AGCCCATAAA CTTAAAAGAC CTGTACTTGT TCATGTTGTC ACAACAAAAG GGAAGGGATA CCCATATGCA GAAGCCGATC AGGTTGGATA TCATGCACAG TCTGCATTTG ATCTTACAAC TGGGAAATCT ATTCCATCAA AGAAACCTAA ACCTGTTAGT TATAGTAAAA TTTTTGGTCA AACCTTATTA AAAATATGTG AGCAAGATAG CAAAGTCATT GGTATCACAG CTGCAATGGC TACAGGTACT GGTTTAGATA TATTGCAAAA AAACATCCCT GATCAATATA TTGATGTAGG AATAGCAGAA CAACATGCAG TTACTCTTGC GGCAGGAATG TCTTGCGATG GTCTTAAACC TGTTGTAGCT ATTTATAGTA CTTTTCTTCA ACGTGCCTTT GATCAATTAA TTCATGATGT AGGGATACAA AATTTACCTG TATCATTCGT ACTTGATAGA GCTGGGATAG TTGGAGCTGA CGGTCCTACT CACCAAGGTC AGTACGATAT CAGTTATATG AGATCTATAC CTAATTTTGT ATTGATGGCT CCAAAGGATG AGTCTGAATT ACAGAGAATG TTAATAACTT CAATAAACCA TAATGGTCCT ACAGCTCTAA GAATACCAAG AGGCTCTGGA TTAGGAGTAG CTGTAATGGA TGAGGGTTGG GAACCTATGA ATATAGGCGA AGCTGAAATA CTTGAAGAAG GAGAAGATAT TTTAATTATT GCTTATGGTT CAATGGTCGC ATCAGCAATC GAAACAGCAA AGATATTAAA AAATATGAAC ATTAATGCAT GCATTGTTAA TGCGAGATTT GTTAAACCTC TTGATAAAAA TCTTATTATG CCTCTAGCAA GTAGGATTCA AAAAGTTGTA ACTATGGAAG AAGGAACTTT AATAGGTGGT TTTGGTTCCG CGATAGTTGA ACTATTTAAC GATAATGAAA TAAACATTCC TGTATACAGA ATAGGTATTC CTGATGTTTT AGTTGATCAT GCTTCACCTG ATCAGAGTAA AGAAAAATTA GGGCTTTTGC CTGATCAGAT GGCAGATAAA ATTATTAAGA AATTTAAGTT AGTTATTTAA
|
Protein sequence | MLLSELSHPN QLHGLTVSQL EEIACQIRER HLQVVSTSGG HLGPGLGVVE LTLALYQTLD LDFDKVVWDV GHQGYPHKLI TGRFSQFDSL RQQNGVAGYL KRSESKFDHF GAGHASTSIS AALGMAIARD RKGENYKCVA VIGDGALTGG MALEAINHAG HLPNTPLVVV LNDNDMSISP PVGALSSYLN KVRVSPPLQF LSDSVQESVK NIPLIGKDIP EELKNIKGSV RRLSVPKVGA VFEELGFTYM GPIEGHDIAN LIKTFNAAHK LKRPVLVHVV TTKGKGYPYA EADQVGYHAQ SAFDLTTGKS IPSKKPKPVS YSKIFGQTLL KICEQDSKVI GITAAMATGT GLDILQKNIP DQYIDVGIAE QHAVTLAAGM SCDGLKPVVA IYSTFLQRAF DQLIHDVGIQ NLPVSFVLDR AGIVGADGPT HQGQYDISYM RSIPNFVLMA PKDESELQRM LITSINHNGP TALRIPRGSG LGVAVMDEGW EPMNIGEAEI LEEGEDILII AYGSMVASAI ETAKILKNMN INACIVNARF VKPLDKNLIM PLASRIQKVV TMEEGTLIGG FGSAIVELFN DNEINIPVYR IGIPDVLVDH ASPDQSKEKL GLLPDQMADK IIKKFKLVI
|
| |