Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_22741 |
Symbol | ppc |
ID | 4778665 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 2009618 |
End bp | 2012626 |
Gene Length | 3009 bp |
Protein Length | 1002 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640087792 |
Product | phosphoenolpyruvate carboxylase |
Protein accession | YP_001018274 |
Protein GI | 124023967 |
COG category | [C] Energy production and conversion |
COG ID | [COG2352] Phosphoenolpyruvate carboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.114136 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCAAAC CTGAGTCCAC TAGCGCATCG ATGCAGCAGT CTTCCGCCCA AAAACCTGAT TGCGATCAGC CCAGGGCAAT CGGGGAAGGG CAGCAGGCAG GACGCTTACT GCAGAACCGC CTTGAACTGG TGGAAGACCT TTGGCAAACG GTGCTACGCA GCGAATGTCC ACCGGATCAG GCAGAACGAC TACTTCGACT GAAGCAACTC AGTGAACCTT TGGCCCTTGA AGGTGCAGAT GAGAACAGTG CCAGCACAGC AATCGTGCTG TTGATCAAAG AGATGGACCT GGCAGAAGCG ATCACCGCAG CACGCGCCTT CTCCCTCTAT TTCCAACTGG TGAATATCCT CGAGCAACGG ATCGAGGAAG ACAGCTACCT CGCAAGTATG TCTTCTGGCA AAGAAAACAA TCGCCAAGAC AAACCCTATG ACCCCTTTGC TCCACCGCTC GCCACGCAGA CAGACCCAGC CACATTCAGC GAGCTGTTCG AACGACTTCG CCTGCTCAAC GTGCCACCAG CTCAGCTGGA GACCCTTCTA CAGGAGATGG ATATCCGCCT AGTTTTCACT GCTCATCCCA CTGAAATCGT CCGTCATACC GTTCGCCATA AGCAACGCAA GGTTGCAAAC CTGTTGCAAC AGTTGCAATC AGATCCAACA AAATCGTCAT CTGAAAAAGA AAGCCTGAGA CTGCAGCTTG AGGAGGAGAT CAGGCTCTGG TGGCGGACCG ATGAGCTTCA TCAGTTCAAG CCCTCCGTTC TGGATGAGGT GGATTACGCC CTCCACTATT TCCAACAGGT GCTTTTTGAT GCCATGCCCC AGCTACGGCG TCGCCTGATC ACTGCAATGG CCGAGAGCTA TCCAGATGTA CACATTCCAC AAGCTGCCTT CTGCACTTTT GGATCTTGGG TTGGATCAGA CCGAGACGGC AATCCTTCCG TTACACCAGA AATCACCTGG CGCACCGCCT GTTATCAACG TCAGCTGATG CTGGAGCGCT ATGTCAATGC CGTTCAAAAA CTCCGTGATC AACTCAGCAT CTCCATGCAA TGGAGCCAGG TAAGCACTCC ACTGTTGGAG TCACTGGAAA TGGACCGGCT TCGATTCCCC GAGGTTTACG AGGAACGAGC AGCCCGCTAT CGACTCGAGC CTTATCGCCT CAAACTCAGC TACACCCTGG AGAGGTTGAA ACTCACTCAA GAACGCAACC AGCAATTGGC CGAAGCAGGC TGGCAAACCC CACCAGAAGG CCTCAACCCC AGTCTCAATC TAATTAATGC GGGAGAAGCC CTTCACTACA AATCAGTAGC AGAATTCCGC AGCGACCTGG AACTGATCCG CAACAGCCTG GTCAGCACAG ATCTGAGCTG TGAGCCTCTG GATACCCTCT TAAATCAGGT CCACATCTTC GCTTTCTCAC TAGCCAGCCT CGACATCCGT CAGGAGAGCA CTCGCCACAG CGATGCTCTC GACGAATTAA CCCGCTACCT AAACCTGCCT AAGGCCTATG GCGACATGGC CGAAAATGAG CGGGTGCAAT GGTTAATAGA GGAACTACAG ACTCGCCGCC CCCTGATCCC CTCTGCCGTT ATCTGGTCGC CCAGTACAGC AGAAACGGTG GCCGTGTTTC GCATGCTGCA CCGACTTCAG GAGGAATTCG GCAGCCGAAT CTGCCGCACA TATGTGATCT CAATGAGTCA CACAGTTTCC GATCTACTCG AGGTGCTGCT GCTAGCCAAA GAAGCCGGCC TCGTGGATCC GGCGGCTGGC CATGCCGAAC TGCTTGTGGT GCCCCTATTC GAAACCGTAG AGGATCTCCA GCGGGCTCCA GCAGTGATGG AGGCACTTTT AAGTTCACCT GTCTACCGCA ATTTGCTCCC TCGTGTCAGC GAACAGGTCC AACCTCTGCA GGAGCTGATG CTTGGCTATT CGGACAGCAA CAAAGACTCC GGATTTCTAT CCAGCAATTG GGAGATCCAT CAAGCCCAAA TTGCCCTGCA AGATCTAGCT AACCGCCAGG GTGTGGCTCT GCGCCTTTTC CATGGACGCG GTGGATCTGT TGGCAGAGGA GGTGGCCCTG CCTACCAAGC AATCCTTGCC CAACCAAGCG GAACCGTGCG CGGCCGAATC AAGATCACTG AGCAGGGAGA AGTGCTGGCA TCGAAATACA GCCTGCCCGA GCTAGCCCTT TACAACCTAG AAACATTCAC TACGGCCGTC CTGCAAAACA GTCTGGTAAC CAACCAGCTG GATGCCACCC CAAGCTGGAA CCAACTGATG ACCAGACTGG CTGGTCGTTC ACGTGAGCAC TATCGGGCCC TCGTCCACAA CAACCCTGAT CTGGTGGCCT TCTTCCAACA GGTCACGCCG ATCGAAGAAA TCAGCAAATT GCAAATCTCC AGCCGTCCTG CTAGACGCAA AAGCGGCGCC AAGGACCTCT CTAGCCTACG AGCGATCCCC TGGGTATTTG GCTGGACGCA AAGTCGCTTT CTACTACCGA GCTGGTTTGG CGTGGGAACA GCACTAGCCG CGGAAGTCGA ATCAGACGCT GACCAGCTTG ACCTTCTGCG CAGACTGCAC CAACGCTGGC CATTCTTCCG AATGCTGATC TCCAAGGTGG AGATGACCCT TTCAAAAGTA GACCTAGACC TGGCCCATCA CTACATGACT AGCCTTGGCA GCGAAGATTA CCGTGAAGCC TTTAACCGTA TCTTCGAGAT CATCGAAACG GAGTACAGCC TCACCCGCCG GTTGGTCTTA AACATCACCG GACAACCCAG GTTGCTCGGG GCCGATCCCG CCCTACAGCA ATCAGTAGAT CTTCGCAATC GCACGATCGT GCCACTTGGT TTTCTGCAAG TGGCCCTGCT TCGCAAACTG CGTGATCAGA ACCGGCAGCC ACCAATGAAC GAGGCTGGCG ATGGTCGCAC ATACAGCCGC AGCGAACTTC TAAGAGGCGC ACTGCTCACC ATTAACGGCA TTGCTGCAGG CATGCGCAAC ACCGGTTGA
|
Protein sequence | MAKPESTSAS MQQSSAQKPD CDQPRAIGEG QQAGRLLQNR LELVEDLWQT VLRSECPPDQ AERLLRLKQL SEPLALEGAD ENSASTAIVL LIKEMDLAEA ITAARAFSLY FQLVNILEQR IEEDSYLASM SSGKENNRQD KPYDPFAPPL ATQTDPATFS ELFERLRLLN VPPAQLETLL QEMDIRLVFT AHPTEIVRHT VRHKQRKVAN LLQQLQSDPT KSSSEKESLR LQLEEEIRLW WRTDELHQFK PSVLDEVDYA LHYFQQVLFD AMPQLRRRLI TAMAESYPDV HIPQAAFCTF GSWVGSDRDG NPSVTPEITW RTACYQRQLM LERYVNAVQK LRDQLSISMQ WSQVSTPLLE SLEMDRLRFP EVYEERAARY RLEPYRLKLS YTLERLKLTQ ERNQQLAEAG WQTPPEGLNP SLNLINAGEA LHYKSVAEFR SDLELIRNSL VSTDLSCEPL DTLLNQVHIF AFSLASLDIR QESTRHSDAL DELTRYLNLP KAYGDMAENE RVQWLIEELQ TRRPLIPSAV IWSPSTAETV AVFRMLHRLQ EEFGSRICRT YVISMSHTVS DLLEVLLLAK EAGLVDPAAG HAELLVVPLF ETVEDLQRAP AVMEALLSSP VYRNLLPRVS EQVQPLQELM LGYSDSNKDS GFLSSNWEIH QAQIALQDLA NRQGVALRLF HGRGGSVGRG GGPAYQAILA QPSGTVRGRI KITEQGEVLA SKYSLPELAL YNLETFTTAV LQNSLVTNQL DATPSWNQLM TRLAGRSREH YRALVHNNPD LVAFFQQVTP IEEISKLQIS SRPARRKSGA KDLSSLRAIP WVFGWTQSRF LLPSWFGVGT ALAAEVESDA DQLDLLRRLH QRWPFFRMLI SKVEMTLSKV DLDLAHHYMT SLGSEDYREA FNRIFEIIET EYSLTRRLVL NITGQPRLLG ADPALQQSVD LRNRTIVPLG FLQVALLRKL RDQNRQPPMN EAGDGRTYSR SELLRGALLT INGIAAGMRN TG
|
| |