Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_17821 |
Symbol | ppc |
ID | 4718516 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 1513603 |
End bp | 1516572 |
Gene Length | 2970 bp |
Protein Length | 989 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 640079512 |
Product | phosphoenolpyruvate carboxylase |
Protein accession | YP_001010172 |
Protein GI | 123969314 |
COG category | [C] Energy production and conversion |
COG ID | [COG2352] Phosphoenolpyruvate carboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAATCTT TTCGCCAGGT AAAAAATAAT AATGTGGATC TGATAAGTAA CAATGATCCA CTTGATAAAA ATCGTCTTCT AATTGAAGAT CTCTGGGAAT CTGTGCTAAG AGAAGAATGC CCAGATGATC AAGCAGAAAG GTTGATACAG CTTAAAGAAT TAAGTTATTC AAAACAAATT GATGGCGATA GTTCAAAAAC TTTTAAAAAT GAAATAGTTG ATATTGTAAA TTCTATGGAT TTAGCAGAAT CCATAGCTGC AGCAAGGGCT TTTTCATTAT ATTTTCAACT TGTGAATATT TTGGAACAAA GAGTTGAGGA AGATAGATAT ATTCAAAGCT TTACCAATAA GGATGTTCAA AAATCGCCCG ACAATCTTGA TCCTTTTGCC CCAGCATTGG CTAGGCAAAA TGCTCCTGTA ACTTTTAGAG AATTGTTTTA CAGGCTTAGG AAACTAAATG TGCCACCAGG CAAATTAGAA GAGTTATTAC AGGAAATGGA TATTCGTTTA GTTTTTACTG CACATCCGAC GGAGATAGTA AGACATACGA TTAGACATAA GCAGACAAGA GTAGCAAATT TGTTAAAAAA AATACAGATT GAGCAATTTC TGACAAAAGA AGAAAAAAAT TCTTTAAAAA CCCAATTAAA AGAGGAAGTA AGACTTTGGT GGAGGACTGA TGAATTGCAT CAATTTAAAC CTTCAGTTTT AGATGAAGTT GATTATGCCT TGCATTATTT TCAGCAAGTT TTATTTAATG CGATGCCTCA GTTGAGGGGA AGGATCGCTG AAGCACTTAC CGATAATTAT CCAGATGTTC AGTTGCCCTC AGAATCTTTT TGCAACTTCG GTTCTTGGGT AGGCTCTGAT AGGGACGGTA ATCCATCGGT CACTCCTGAG ATAACATGGA GAACTGCTTG CTACCAAAGG CAGTTGATGT TGGAAAGATA TATTATTGCG ACGTCTAATC TTAGAGATCA ATTAAGTGTT TCGATGCAAT GGAGTCAAGT CAGTTCCTCC TTATTAGAGT CACTTGAAAC TGACAGGGTT AAGTTCCCTG AAATATATGA AGCTAGAGCT ACAAGGTATA GATCAGAACC CTACAGATTA AAATTAAGTT ATATCCTAGA GAAATTAAGA TTAACACAAG AAAGAAATAA TTTATTAGCT GATAGTGGAT GGAAATTTGA CTTGGAAGGA GAAACTGATA ACAAAAATTT AGATAAAGTT GAAAGCTTAT ATTACAAGTC AGTAAAAGAA TTTACTTATG ATCTAGAACT TATCAAAAAT AGTTTAATTA GTACAGATTT AAATTGTGAG TCTGTAAACA CCTTACTTAC TCAAGTTCAT ATTTTTGGAT TTTCCTTAGC AAGTTTAGAT ATTCGTCAAG AGAGTACAAG GCATAGTGAC GCTATTCAAG AGCTTACAAA TTATCTTGAT TTATCTGTTC AATATGACCA AATGTCTGAG GAAGAGAAAA TTAAATGGCT TATAGACGAA TTAAATACAA AAAGGCCTTT AATTCCATCT GACGTTCACT GGACAAAAAC CACAGAAGAA ACATTTTCAG TTTTTAAAAT GGTTAAGAGA CTACAGCAAG AATTTGGAAG TCGCATTTGT CATTCTTATG TAATTTCAAT GAGTCATAGT GCATCTGATT TGCTTGAAGT TCTCTTACTG GCAAAAGAAA TGGGACTTCT TGATCAAAAT TCACAAAAGT CAAAATTATT AGTTGTTCCT CTTTTTGAAA CTGTGGAAGA CCTTAAAAGA GCACCAGAAG TAATGGAAAA GTTGTTTAAA TTAGATTTCT ATAGATCATT ATTGCCAAAA GTAGGAGAAT CTTTTAAACC TCTGCAAGAA TTAATGCTTG GATATTCTGA TAGCAATAAA GATTCGGGGT TTGTTTCTAG TAATTGGGAA ATTCATAGAG CCCAAATAGC TCTTCAAAAT CTTTCAAGTA GGAATAACAT ATTGTTAAGA CTGTTTCATG GAAGAGGTGG TTCTGTAGGT AGAGGAGGAG GACCAGCCTA TCAGGCAATA TTGGCTCAAC CAAGCGGTAC TTTAAAAGGG CGAATAAAAA TAACAGAACA AGGAGAAGTT TTAGCTTCTA AATATAGTCT TCCGGAACTT GCTTTATACA ATCTTGAAAC TGTAACTACA GCGGTAATTC AAAATAGCTT GGTAAATAAT AGACTTGACG CTACTCCAGA ATGGAATCAA TTAATGTCTA GGTTGGCAGA AACATCAAGG TCTCACTACC GAAAATTAGT GCATGAGAAT CCTGACTTGT TGAATTTCTT TCAAGAGGTC ACTCCAATAG AAGAAATAAG TAAATTACAG ATATCCAGTA GGCCTGCAAG AAGAAAAAAA GGTGCAAAAG ATTTATCAAG TTTACGAGCT ATTCCATGGG TATTTGGATG GACACAAAGT AGATTTCTTT TACCAAGTTG GTTTGGAGTA GGTACTGCAT TGTCATCTGA ATTAAATTTA GATCCACAAC AAATTGAATT ACTAAGAGTC TTGCATCAAA GATGGCCATT TTTTAGGATG CTTATATCTA AGGTAGAAAT GACATTATCT AAGGTGGATT TAGAAGTGGC AAGATATTAT GTTGATACTC TTGGCAGTAA AGAAAATAAA GATTCTTTTG ATAATATTTT TGAAGTAATT TCTAAAGAAT ATAATCTCAC GAAATCTTTA ATACTTGAAA TTACTGGTAA AAATAAGCTT CTTGAATCTG ATAGAGACTT GAAGTCATCT GTAAGCTTGA GAAATAAGAC AATCATTCCA TTGGGGTTTT TGCAAGTTTC ACTTTTAAGA AGATTAAGAG ACCAGACAAG ACAACCCCCA ATAAGCGAGT TTTTTCTGGA TAAAGATGAA TCTACAAGAG CTTACAGCAG AAGTGAACTA TTAAGGGGAG CACTTTTAAC TATTAATGGG ATAGCAGCTG GTATGAGAAA TACAGGATAA
|
Protein sequence | MESFRQVKNN NVDLISNNDP LDKNRLLIED LWESVLREEC PDDQAERLIQ LKELSYSKQI DGDSSKTFKN EIVDIVNSMD LAESIAAARA FSLYFQLVNI LEQRVEEDRY IQSFTNKDVQ KSPDNLDPFA PALARQNAPV TFRELFYRLR KLNVPPGKLE ELLQEMDIRL VFTAHPTEIV RHTIRHKQTR VANLLKKIQI EQFLTKEEKN SLKTQLKEEV RLWWRTDELH QFKPSVLDEV DYALHYFQQV LFNAMPQLRG RIAEALTDNY PDVQLPSESF CNFGSWVGSD RDGNPSVTPE ITWRTACYQR QLMLERYIIA TSNLRDQLSV SMQWSQVSSS LLESLETDRV KFPEIYEARA TRYRSEPYRL KLSYILEKLR LTQERNNLLA DSGWKFDLEG ETDNKNLDKV ESLYYKSVKE FTYDLELIKN SLISTDLNCE SVNTLLTQVH IFGFSLASLD IRQESTRHSD AIQELTNYLD LSVQYDQMSE EEKIKWLIDE LNTKRPLIPS DVHWTKTTEE TFSVFKMVKR LQQEFGSRIC HSYVISMSHS ASDLLEVLLL AKEMGLLDQN SQKSKLLVVP LFETVEDLKR APEVMEKLFK LDFYRSLLPK VGESFKPLQE LMLGYSDSNK DSGFVSSNWE IHRAQIALQN LSSRNNILLR LFHGRGGSVG RGGGPAYQAI LAQPSGTLKG RIKITEQGEV LASKYSLPEL ALYNLETVTT AVIQNSLVNN RLDATPEWNQ LMSRLAETSR SHYRKLVHEN PDLLNFFQEV TPIEEISKLQ ISSRPARRKK GAKDLSSLRA IPWVFGWTQS RFLLPSWFGV GTALSSELNL DPQQIELLRV LHQRWPFFRM LISKVEMTLS KVDLEVARYY VDTLGSKENK DSFDNIFEVI SKEYNLTKSL ILEITGKNKL LESDRDLKSS VSLRNKTIIP LGFLQVSLLR RLRDQTRQPP ISEFFLDKDE STRAYSRSEL LRGALLTING IAAGMRNTG
|
| |