Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_20211 |
Symbol | ppc |
ID | 4779666 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 1663493 |
End bp | 1666477 |
Gene Length | 2985 bp |
Protein Length | 994 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640085314 |
Product | phosphoenolpyruvate carboxylase |
Protein accession | YP_001015841 |
Protein GI | 124026726 |
COG category | [C] Energy production and conversion |
COG ID | [COG2352] Phosphoenolpyruvate carboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGAAAA ATCCTTCTAA CGAAAATATC TCTAATCACT CGACAGTATG TGTTGAGGAT CAAGATCCTG GATCTTTATT GCAACAAAGG CTTGAACTAG TGGAGGATCT ATGGAAAACC GTTCTCAAAA GTGAATGTCC ACCTGATCAA ACAGAGAGAT TATTGCGATT AAAACAATTA AGTGATCCTA GTAAGTCAAA TCAAGACAAT TCCTCAAAGG CAATAGTCCA ATTGATTACA AAAATGGATT TGGCTGAGGC GATATCTGCT GCTAGGGCTT TTTCTCTTTA TTTTCAGTTG GTAAACATTC TGGAGCAACG CATAGAAGAG GATAGTTATT TAGAAAGCAT AGAAAAAGGG AAGTTAGACA CCAGTAATTA TAAAATAGAT CCATTCGCTC CAGCTTTAGC TAGTCAGACT GCTCCAGCAA CTTTTACACA ATTATTTGAG CGATTACGTC GTTTGAATGT CCCTCCAGCT CAGCTTGATG GTTTGATGAG AGAAATGGAT ATTCGTCTTG TTTTTACAGC TCATCCAACT GAAATAGTTA GACACACGGT TCGCCATAAG CAACGCAGGG TTGCCACTCT TTTGCAGCAA CTTCAATCTA ATAGCTTGAT TTCCGAATCA GAGAAAGAAA TATTTAGGTT GCAGCTAGAG GAGGAGATAA GACTTTGGTG GAGAACTGAT GAGCTTCATC AGTTTAAACC GACTGTCCTT GATGAGGTTG ACTATGCCCT TCATTATTTT CAGCAGGTTT TGTTTGATGC AATGCCTCAA TTAAGAAGGA GACTGACTAC CGCACTTGCT TCAAGTTATC CAGACGTAGA GATCCCTAAT GAAGCTTTTT GCACATTTGG CTCTTGGGTA GGTTCAGATC GTGATGGGAA TCCGTCAGTT ACTCCTGAAA TCACATGGAG AACAGCGTGT TACCAAAGAC AATTAATGTT GGATAGGTAT ATTGCATCAG TTCAAGATCT AAGAGACCAA CTCAGTATAT CTATGCAATG GAGTCAAGTA AGTTCTCCTT TGTTAGAGTC ATTGGAAATG GACAGAGTTC GTTTCCCTGA GGTTTATGAG GAAAGGGCTG CAAGATATAG ACTGGAACCT TATCGCTTGA AGCTTAGCTA TACACTTGAG AGGCTACGAC TTACTCAACT ACGTAATAAG CAGCTAGCGG ATGCTGGATG GCAATTCTCA CCCGATGGGA AGCCGCAAAT ATCTACTAAT AATAGTTTTG ATGAAGTACT CCACTACAAA TCTGTAGAAG AATTAAAAAA TGAATTAGAG CTTATTAGAA ATAGTTTGGT TAGTACAGAT CTTACTTGTG AACCATTAGA TACTTTGCTA AATCAAGTTC ATATTTTTGG GTTCTCGTTG GCTAGTTTAG ACATTAGACA AGAAAGCACA CGACATAGTG ATGCGTTGGA TGAGCTCACT CGCTATTTAG ATCTCCCTGA GTCGTATGGA GTGATGAGCG AGGAAAGTCG AGTTCAATGG TTGATGAAAG AATTAAGAAC TCGGAGACCA CTTATTCCGC CTTCTTTTGA GTGGTCTAAA AGTACTCAAG AAACTATCTC AGTTTTTCAT ATGCTTCATA GGCTTCAGAA AGAATTTGGT ACTCGTATAT GTCGCTCGTA TGTAATTTCG ATGAGTCATA CGGCATCAGA TTTATTAGAA GTTCTTCTTT TAGCTAAAGA GTCGGGTTTG ATTGATCCAA CTTTAGGAGC TTCTGATCTT CTTGTTGTTC CATTATTTGA AACGGTTGAG GACTTACAAC ATGCTCCTTC TGTAATGGAG TCGTTACTAC AATCTGATGT TTATCGCGAA TTACTTCCAC GAGTAGGAGA GAAAAAACAA CCGCTTCAAG AACTTATGCT GGGATATTCC GATAGTAATA AGGATTCTGG TTTTCTCTCA AGTAATTGGG AAATTCATAA GGCCCAAATA GCACTCCAAG ACCTAGCTAG TAGACAAGGA ATAGCATTAC GTATTTTTCA TGGTAGAGGT GGGTCCGTAG GAAGAGGCGG TGGACCAGCT TATCAAGCTA TTTTGGCTCA ACCTAGTGGT ACACTTCAGG GACGTATAAA AATAACAGAG CAAGGGGAAG TACTTGCTTC AAAATATAGT CTTCCAGAAT TAGCTTTATA TAATCTGGAA ACTGTAACCA CTGCAGTTAT TCAAAATAGC TTGGTTACCA ATAAATTGGA TGCTACGCCA AGTTGGAATG AATTGATGAC CAGACTTGCA GCTCGTTCAA GGGAGCATTA CCGAGCTTTA GTTCATGATA ATCCAGATTT AGTTCAATTT TTTCAGGTAG TTACTCCAAT AGAAGAGATA AGTAAGTTGC AAATTTCTAG TCGTCCTGCT CGACGAAAGA GTGGTGCAAA AGACTTATCA AGTCTTCGAG CTATCCCATG GGTCTTTGGT TGGACTCAAA GTCGTTTCCT TTTGCCAAGT TGGTTTGGTG TTGGTACGGC TTTAGCTACT GAATTAAAGG CTGACCCCGA CCAAATGGAG ATGTTGCGAA TGTTGAATCA GAGATGGCCA TTCTTTAGAA TGTTGATATC TAAAGTAGAG ATGACACTTT CAAAAGTTGA TTTAGATGTT GCCCATCATT ATGTGGTTAG TTTGGGTGGA AGTGATGATC GGGATGCTTT CGCTAGCATT TTCGATATTA TCTCAAGCGA ATACAGCTTG ACTAAGAAAT TAATTTTAGA AATTACTGGC AAGTCAAAAC TATTAAGTGC AGACCCTGCT TTGCAGTTGT CTGTCAACCT GAGAAATAGG ACTATTGTCC CTTTAGGATT TTTACAAGTT GCTCTTCTCA AGCGATTAAG AGATCAGAAT CGTCAACCAC CAATTAGTGA AGATGTAAGT ATTGACTCTA CTCAAAGTTC TCGTACATAT AGCCGTAGTG AATTATTGCG TGGTGCATTG TTGACTATCA ATGGTATCGC TGCAGGTATG AGAAACACAG GATGA
|
Protein sequence | MLKNPSNENI SNHSTVCVED QDPGSLLQQR LELVEDLWKT VLKSECPPDQ TERLLRLKQL SDPSKSNQDN SSKAIVQLIT KMDLAEAISA ARAFSLYFQL VNILEQRIEE DSYLESIEKG KLDTSNYKID PFAPALASQT APATFTQLFE RLRRLNVPPA QLDGLMREMD IRLVFTAHPT EIVRHTVRHK QRRVATLLQQ LQSNSLISES EKEIFRLQLE EEIRLWWRTD ELHQFKPTVL DEVDYALHYF QQVLFDAMPQ LRRRLTTALA SSYPDVEIPN EAFCTFGSWV GSDRDGNPSV TPEITWRTAC YQRQLMLDRY IASVQDLRDQ LSISMQWSQV SSPLLESLEM DRVRFPEVYE ERAARYRLEP YRLKLSYTLE RLRLTQLRNK QLADAGWQFS PDGKPQISTN NSFDEVLHYK SVEELKNELE LIRNSLVSTD LTCEPLDTLL NQVHIFGFSL ASLDIRQEST RHSDALDELT RYLDLPESYG VMSEESRVQW LMKELRTRRP LIPPSFEWSK STQETISVFH MLHRLQKEFG TRICRSYVIS MSHTASDLLE VLLLAKESGL IDPTLGASDL LVVPLFETVE DLQHAPSVME SLLQSDVYRE LLPRVGEKKQ PLQELMLGYS DSNKDSGFLS SNWEIHKAQI ALQDLASRQG IALRIFHGRG GSVGRGGGPA YQAILAQPSG TLQGRIKITE QGEVLASKYS LPELALYNLE TVTTAVIQNS LVTNKLDATP SWNELMTRLA ARSREHYRAL VHDNPDLVQF FQVVTPIEEI SKLQISSRPA RRKSGAKDLS SLRAIPWVFG WTQSRFLLPS WFGVGTALAT ELKADPDQME MLRMLNQRWP FFRMLISKVE MTLSKVDLDV AHHYVVSLGG SDDRDAFASI FDIISSEYSL TKKLILEITG KSKLLSADPA LQLSVNLRNR TIVPLGFLQV ALLKRLRDQN RQPPISEDVS IDSTQSSRTY SRSELLRGAL LTINGIAAGM RNTG
|
| |