Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_17661 |
Symbol | ppc |
ID | 4911812 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | - |
Start bp | 1487262 |
End bp | 1490231 |
Gene Length | 2970 bp |
Protein Length | 989 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 640161367 |
Product | phosphoenolpyruvate carboxylase |
Protein accession | YP_001091990 |
Protein GI | 126697104 |
COG category | [C] Energy production and conversion |
COG ID | [COG2352] Phosphoenolpyruvate carboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAATCTT TTCGACAGAT AAAAAATAAT AATGTGGATC TGATAAGTAA CAATGATCCA CTTGATAAAA ATCGTCTTCT AATTGAAGAT CTCTGGGAAT CTGTGCTCAG AGAAGAATGC CCAGATGATC AAGCAGAGAG ATTGATACAG CTTAAAGAAT TAAGTTATTC AAAACAAATT GATGGTAATA GTTCAAAAAC TTTTAAAAAT GAAATAGTTG ATATTGTAAA TTCTATGGAT TTAGCAGAAT CCATAGCTGC TGCAAGGGCG TTTTCATTAT ATTTTCAACT CGTAAATATT TTGGAACAAA GAGTTGAGGA AGATAGATAT ATTCAAAGCT TTACTAATAA GGATGTTCAA AAATCGCCAG ACAATCTTGA TCCTTTTGCC CCAGCATTGG CTAGGCAAAA TGCTCCAGTA ACTTTTAGAG AATTATTTTA CAGGCTGAGA AAATTAAATG TACCACCAGG AAAATTAGAG GAGTTATTAC AGGAAATGGA TATTCGTTTA GTTTTTACTG CGCATCCGAC CGAGATAGTA AGACATACGA TTAGACATAA GCAAACCAGA GTAGCAAATT TGTTAAAAAA AATACAGATT GAGCAATTTC TGACAAAAGA TGAAAAAATT TCTCTAAAAA CCCAATTAAA AGAGGAAGTA AGGCTTTGGT GGAGAACAGA TGAATTGCAT CAATTTAAAC CTTCAGTTTT AGATGAAGTT GATTATTCAT TGCATTATTT TCAGCAAGTT TTATTTAATG CGATGCCTCA ATTGAGAGGC AGGATTACTG AAGCACTTAC CGAAAATTAT CCAGATGTTC AGTTGCCCCC AGAATCTTTT TGTAACTTCG GTTCTTGGGT AGGCTCTGAT AGGGACGGTA ATCCATCAGT AACTCCTGAT ATAACATGGC GAACTGCATG TTACCAAAGG CAGTTGATGT TGGATAGATA TATTGTTGCG ACGTCTAATC TTAGAGATCA ATTAAGTGTA TCTATGCAAT GGAGTCAAGT AAGTTCCTCC TTATTAGAAT CACTCGAAAC TGACAGGGTT AAGTTCCCTG AAATATATGA AGCTAGAGCT ACAAGGTATA GATCAGAACC TTACAGATTA AAATTAAGTT ATATTTTAGA GAAATTAAGG TTAACACAAG AAAGAAATAA TTTATTAGCT GATAGTGGGT GGAAATTTGA CTTTGAGGGG GAAATTGATA ACAAAAATAT AGATAAAGTT GAAAATTTAT ATTACAAGTC AGTAAACGAA TTTACATATG ATCTTGAGCT GATTAAAAAT AGCTTAATTA GTACAGATTT AACTTGCGAT TCTGTAAATA ACTTACTTAC TCAGGTTCAT ATTTTTGGAT TTTCTTTAGC AAGTTTAGAT ATTCGTCAAG AGAGTACAAG GCATAGTGAT GCTATTCAAG AGCTTACAAA TTATCTTGAT TTGTCTGTTC AATATGACCA AATGTCTGAG GAAGAGAAAA TTAAATGGCT CATAGACGAA TTAAATACAA AAAGGCCTTT AATTCCATCT GACGTTAACT GGACAAAAAC TACAGAAGAA ACCTTTTCAG TTTTTAAGAT GGTTAAGAGA CTACAGCAAG AATTTGGAAG TCGCATTTGT CATTCTTATG TAATTTCAAT GAGTCATAGT GCATCTGATT TGCTTGAAGT TCTCTTACTT GCAAAAGAAA TGGGACTTCT TGATCAAGAT TCACAAAAGT CAAAATTATT AGTTGTTCCT CTTTTTGAAA CTGTGGAAGA TCTTCAAAGA GCACCAGAAG TAATGGAAAA GTTGTTTAAA TTAGATTTCT ATAAATCATT ATTGCCAAAA GTAGGAGAAT CTTTTAAACC TCTACAAGAA TTAATGCTTG GATATTCTGA TAGTAATAAA GATTCAGGAT TTGTTTCTAG TAATTGGGAA ATTCATAGAG CCCAAATAGC TCTTCAAAAT CTCTCAAGTA GAAATAACAT ATTGCTAAGA CTTTTTCATG GAAGAGGAGG GTCTGTTGGT AGAGGAGGCG GACCAGCATA TCAGGCAATA TTAGCTCAAC CAAGCGGCAC TTTAAAAGGA CGAATAAAAA TAACAGAACA AGGTGAAGTT TTAGCTTCCA AATATAGTCT TCCCGAACTG GCTTTATACA ATCTTGAGAC TGTAACTACA GCGGTAATTC AAAATAGTTT GGTAAATAGT AGACTTGACG CTACGCCAGA ATGGAATCAA TTAATGTCTA GGCTGGCAGA AACATCAAGA TCTCATTACA GAAAATTAGT GCATGAGAAT CCTGATTTGT TAAATTTCTT TCAAGAGGTC ACTCCGATAG AAGAAATAAG TAAATTACAG ATATCTAGTA GGCCTGCTAG AAGAAAAAAA GGTGCAAAAG ATTTGTCAAG TTTAAGAGCT ATTCCATGGG TATTTGGCTG GACACAAAGT AGATTTCTTT TGCCAAGTTG GTTTGGAGTA GGCACTGCTT TGTCAGCTGA ATTAAATTCA GATCCACAAC AAATTGAACT ATTAAGAGTT CTGCATCAAA GATGGCCATT TTTTAGGATG CTTATATCTA AAGTGGAAAT GACATTATCG AAGGTGGATC TGGAAGTTGC TAGATATTAT GTTGATACTC TTGGCAGCAA AGAAAATAAA GACTCTTTTG ATAATATTTT TGAAGTAATT TCTAAAGAAT ATAATCTTAC AAAATCTTTA ATACTTGAAA TTACTGGTAA AAATAAGCTC CTAGAATCTG ATAGAGACTT GAAATCATCA GTAAGCTTGA GAAATAAGAC AATAATTCCA TTAGGGTTTT TGCAGGTTTC ACTTCTAAGA AGATTAAGAG ACCAGACAAG ACAACCTCCA ATAAGCGAAT TCCTTATAGA TAAGGACGAA TCCAGAAGAG CTTACAGCAG AAGTGAACTA TTAAGGGGTG CACTTTTAAC TATTAATGGG ATAGCAGCTG GCATGAGAAA TACAGGTTGA
|
Protein sequence | MESFRQIKNN NVDLISNNDP LDKNRLLIED LWESVLREEC PDDQAERLIQ LKELSYSKQI DGNSSKTFKN EIVDIVNSMD LAESIAAARA FSLYFQLVNI LEQRVEEDRY IQSFTNKDVQ KSPDNLDPFA PALARQNAPV TFRELFYRLR KLNVPPGKLE ELLQEMDIRL VFTAHPTEIV RHTIRHKQTR VANLLKKIQI EQFLTKDEKI SLKTQLKEEV RLWWRTDELH QFKPSVLDEV DYSLHYFQQV LFNAMPQLRG RITEALTENY PDVQLPPESF CNFGSWVGSD RDGNPSVTPD ITWRTACYQR QLMLDRYIVA TSNLRDQLSV SMQWSQVSSS LLESLETDRV KFPEIYEARA TRYRSEPYRL KLSYILEKLR LTQERNNLLA DSGWKFDFEG EIDNKNIDKV ENLYYKSVNE FTYDLELIKN SLISTDLTCD SVNNLLTQVH IFGFSLASLD IRQESTRHSD AIQELTNYLD LSVQYDQMSE EEKIKWLIDE LNTKRPLIPS DVNWTKTTEE TFSVFKMVKR LQQEFGSRIC HSYVISMSHS ASDLLEVLLL AKEMGLLDQD SQKSKLLVVP LFETVEDLQR APEVMEKLFK LDFYKSLLPK VGESFKPLQE LMLGYSDSNK DSGFVSSNWE IHRAQIALQN LSSRNNILLR LFHGRGGSVG RGGGPAYQAI LAQPSGTLKG RIKITEQGEV LASKYSLPEL ALYNLETVTT AVIQNSLVNS RLDATPEWNQ LMSRLAETSR SHYRKLVHEN PDLLNFFQEV TPIEEISKLQ ISSRPARRKK GAKDLSSLRA IPWVFGWTQS RFLLPSWFGV GTALSAELNS DPQQIELLRV LHQRWPFFRM LISKVEMTLS KVDLEVARYY VDTLGSKENK DSFDNIFEVI SKEYNLTKSL ILEITGKNKL LESDRDLKSS VSLRNKTIIP LGFLQVSLLR RLRDQTRQPP ISEFLIDKDE SRRAYSRSEL LRGALLTING IAAGMRNTG
|
| |