Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_16951 |
Symbol | ppc |
ID | 5730126 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 1519264 |
End bp | 1522287 |
Gene Length | 3024 bp |
Protein Length | 1007 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 641286077 |
Product | phosphoenolpyruvate carboxylase |
Protein accession | YP_001551580 |
Protein GI | 159904236 |
COG category | [C] Energy production and conversion |
COG ID | [COG2352] Phosphoenolpyruvate carboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCATGT CAAAATCAGA ATTAGGCAAT ACACCCTTAG AGCAATCTTC AGATTTGCAT TCATCTTTCC GCAATAACCA AGCAAGTACT ACGAAAAACT CTGGAAGCTT GTTGCAACAG AGATTGGCAC TTGTTGAAGA TCTGTGGGAG ACAGTTTTAT GCAGTGAATG CCCATCAGAT CAAATTAATC GGGTTCTTCG TTTAAAGAAA TTGAGTAACC CGAATAATTT TGCAGGAGAA GAAGAGCAAA AAAACCCATT TAATGAAATT GTTTTGTTAA TTGAAGAGAT GGACTTGGCT GAGTCAATTG CAGCCGCGCG TGCGTTCTCT TTATACTTTC AACTTGTCAA TATTTTGGAG CAGAGAATTG AGGAAGACAC ATATCTAGAA ACTATTGGCA GGATCGAAGA AGATGCGTCT AATCAATTTG ATCCATTTGC CCCTCCTCTT GCGAGTCAGA CTGCACCAGC AACTTTCAGT CAATTATTTG AACGTTTAAA ACGTCTTAAT GTCCCTCCTG GGCAGCTAGA GGCATTAATG CAAGAAATAG ATATACGTCT TGTCTTTACC GCCCATCCTA CAGAAATAGT CCGACATACC GTTAGGCATA AGCAACGCAG GGTAGCCAAT TTGTTACAGC AATTTCACTC TGAAGCAGCA ATTTCTTTCT CTCAGAAAGA AAGCTTGCGA CAACAGTTAG AAGAAGAAAT ACGTCTTTGG TGGCGTACAG ATGAGCTACA TCAATTTAAA CCAACTGTTT TAGACGAGGT AGATTATGCA CTGCATTATT TCCAACAAGT TCTATTTGAC GCGATGCCCC AATTGCGTCG TCGGATTAAT GCAGCATTAG TTAAAAGTTA TCCTGATGTT GAGGTGCCTA GAGAATCTTT TTGCACATTT GGTTCTTGGG TGGGGTCAGA TCGTGATGGC AACCCATCAG TTACTCCGGA AATAACTTGG AGAGCAGCAT GTTATCAACG TCAATTAATG CTTGAGCGGT ATATCAACTC TGTTCAAGAC CTTAGGGATC AATTGAGTAT TTCTATGCAA TGGAGTCAGG TTGGCTCACA ATTGTTAGAG TCTTTAGAAA TGGATAGAGT TCAGTTTCCA GAGGTTTACG AGGAAAGAGC AGCGAGATAC AGGCTAGAGC CATATCGACT AAAGATTAGC TATATTTTGC AACGACTTAA GCTAACTCAT CAACGTAATC AAGAATTAGC AGCTGCAGGA TGGAAGATAG TTCCTGAAGC TATGAACTCC TTGATTACTT CTGGTAAGCC ATTTGAGGAT TTGTATTACT GCTCTATAGC CGAGTTTCGT AGTGATCTAG AGTTAATTCG CAATAGCCTG GTGGCAACAG ATCTTAGTTG TGAGTCGTTA GATACGCTAT TGACTCAGGT ACATATTTTT GGCTTTTCTC TTGCTAGTCT GGATATTCGA CAAGAGAGTA ACCGACATAG TGATGCTATA GACGAGGTTA CAAGATTTAT AGACCTCCCT GTCATCTACT CAGAAATGGA TGAGAAAGAA AAAGTTCAAT GGTTAATGCA AGAATTAGAA ACTCGTAGGC CATTAATTCC TTCTTCAGTT GATTGGTCAC CTTCTACTCA GGAAACAATA TCTGTTTTTC AGATGCTCCA CCGTTTACAG GAAGAGTTTG GTAGTCGGAT TTGCAGATCT TATGTAATTT CAATGAGTCA TTCAGTTTCT GATCTTTTAG AAGTACTTCT TCTCGCAAAA GAAGCTGGAC TAGTAGATAT TTCGTCTGGT TCAGCTGATT TACTAGTGGT GCCTTTATTT GAAACTGTTG AGGATCTGCA GCGAGCTCCC TCAGTGATGG AGGAACTCTT TAGCTCTAGT TTTTATCTCA ACTTATTACC TCGTGTTGGA GAGAAACTTC AGCCTTTACA AGAACTAATG CTTGGTTATT CAGATAGCAA TAAAGATTCT GGTTTTTTAT CAAGTAATTG GGAAATTCAT CAGGCACAAA TTGCACTTCA GAACCTTGCA AGTAGTCATG GTGTAGCTTT ACGTTTATTT CACGGTCGTG GAGGTTCTGT TGGGCGAGGT GGTGGCCCTG CTTATCAGGC AATCTTGGCT CAACCTAGTG GCACTTTAAA AGGACGAATT AAGATTACTG AGCAAGGGGA AGTTCTTGCT TCAAAATATA GCCTACCTGA ACTAGCAATG TATAACCTCG AAACTGTTAC GACAGCTGTG CTGCAAAATA GTTTGGTTAC TAATCAGTTA GATGCAACCC CTAGTTGGAA TGAGCTGATG ACTCGATTGG CAGCGCGTTC TAGGCGGCAT TACAGGTCGC TTGTACATGA CAACCCTGAC TTGGTGCCAT TTTTTCAAGA AGTGACGCCA ATTGAAGAAA TCAGTAAATT GCAAATTTCT AGTCGACCTA CTCGTAGAAA ATCAGGCAGT AAGGATTTAT CTAGTCTTAG GGCCATACCT TGGGTCTTTG GTTGGACTCA GAGTAGGTTT CTTTTGCCCA GTTGGTTTGG AGTCGGGCAT GCTTTGTTTA TAGAACTTGA AGAAGATCCT GCACAAATTG AGCTTTTGAA AATGCTTCAT CAGCGTTGGC CTTTCTTTCG AATGTTGATT TCTAAGGTAG AGATGACTCT TTCGAAAGTA GATCTAGAGG TGGCTAATCA CTATGTCACT AGTCTTGGTA GTAGGCAGAA TAAAGAAGCT TTTAATAAAA TTTTTGAGGT GATTTCTGAT GAATACCATC TCACTAAGAG GTTGGTTTTA AGAATAACTG GAAAATCAAA ACTTCTTAGC GCTGATCCTG CATTGCAAGC GTCTGTAGAG CTTAGAAACC GTACGATTGT ACCTTTAGGG TTTCTTCAAG TGGCACTATT ACGTCGATTA CGTGATCAAA AGCGTCAACC ACCTGTGAGC GAGGAGCTTT TTAATGAACG AGATCTTGCT CGTACTTATA GTCGCAGTGA ATTATTGAGA GGAGCTTTGT TGACTATTAA TGGCATTGCT GCGGGAATGC GAAATACAGG TTGA
|
Protein sequence | MIMSKSELGN TPLEQSSDLH SSFRNNQAST TKNSGSLLQQ RLALVEDLWE TVLCSECPSD QINRVLRLKK LSNPNNFAGE EEQKNPFNEI VLLIEEMDLA ESIAAARAFS LYFQLVNILE QRIEEDTYLE TIGRIEEDAS NQFDPFAPPL ASQTAPATFS QLFERLKRLN VPPGQLEALM QEIDIRLVFT AHPTEIVRHT VRHKQRRVAN LLQQFHSEAA ISFSQKESLR QQLEEEIRLW WRTDELHQFK PTVLDEVDYA LHYFQQVLFD AMPQLRRRIN AALVKSYPDV EVPRESFCTF GSWVGSDRDG NPSVTPEITW RAACYQRQLM LERYINSVQD LRDQLSISMQ WSQVGSQLLE SLEMDRVQFP EVYEERAARY RLEPYRLKIS YILQRLKLTH QRNQELAAAG WKIVPEAMNS LITSGKPFED LYYCSIAEFR SDLELIRNSL VATDLSCESL DTLLTQVHIF GFSLASLDIR QESNRHSDAI DEVTRFIDLP VIYSEMDEKE KVQWLMQELE TRRPLIPSSV DWSPSTQETI SVFQMLHRLQ EEFGSRICRS YVISMSHSVS DLLEVLLLAK EAGLVDISSG SADLLVVPLF ETVEDLQRAP SVMEELFSSS FYLNLLPRVG EKLQPLQELM LGYSDSNKDS GFLSSNWEIH QAQIALQNLA SSHGVALRLF HGRGGSVGRG GGPAYQAILA QPSGTLKGRI KITEQGEVLA SKYSLPELAM YNLETVTTAV LQNSLVTNQL DATPSWNELM TRLAARSRRH YRSLVHDNPD LVPFFQEVTP IEEISKLQIS SRPTRRKSGS KDLSSLRAIP WVFGWTQSRF LLPSWFGVGH ALFIELEEDP AQIELLKMLH QRWPFFRMLI SKVEMTLSKV DLEVANHYVT SLGSRQNKEA FNKIFEVISD EYHLTKRLVL RITGKSKLLS ADPALQASVE LRNRTIVPLG FLQVALLRRL RDQKRQPPVS EELFNERDLA RTYSRSELLR GALLTINGIA AGMRNTG
|
| |