Gene A9601_17821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_17821 
Symbolppc 
ID4718516 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1513603 
End bp1516572 
Gene Length2970 bp 
Protein Length989 aa 
Translation table11 
GC content34% 
IMG OID640079512 
Productphosphoenolpyruvate carboxylase 
Protein accessionYP_001010172 
Protein GI123969314 
COG category[C] Energy production and conversion 
COG ID[COG2352] Phosphoenolpyruvate carboxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATCTT TTCGCCAGGT AAAAAATAAT AATGTGGATC TGATAAGTAA CAATGATCCA 
CTTGATAAAA ATCGTCTTCT AATTGAAGAT CTCTGGGAAT CTGTGCTAAG AGAAGAATGC
CCAGATGATC AAGCAGAAAG GTTGATACAG CTTAAAGAAT TAAGTTATTC AAAACAAATT
GATGGCGATA GTTCAAAAAC TTTTAAAAAT GAAATAGTTG ATATTGTAAA TTCTATGGAT
TTAGCAGAAT CCATAGCTGC AGCAAGGGCT TTTTCATTAT ATTTTCAACT TGTGAATATT
TTGGAACAAA GAGTTGAGGA AGATAGATAT ATTCAAAGCT TTACCAATAA GGATGTTCAA
AAATCGCCCG ACAATCTTGA TCCTTTTGCC CCAGCATTGG CTAGGCAAAA TGCTCCTGTA
ACTTTTAGAG AATTGTTTTA CAGGCTTAGG AAACTAAATG TGCCACCAGG CAAATTAGAA
GAGTTATTAC AGGAAATGGA TATTCGTTTA GTTTTTACTG CACATCCGAC GGAGATAGTA
AGACATACGA TTAGACATAA GCAGACAAGA GTAGCAAATT TGTTAAAAAA AATACAGATT
GAGCAATTTC TGACAAAAGA AGAAAAAAAT TCTTTAAAAA CCCAATTAAA AGAGGAAGTA
AGACTTTGGT GGAGGACTGA TGAATTGCAT CAATTTAAAC CTTCAGTTTT AGATGAAGTT
GATTATGCCT TGCATTATTT TCAGCAAGTT TTATTTAATG CGATGCCTCA GTTGAGGGGA
AGGATCGCTG AAGCACTTAC CGATAATTAT CCAGATGTTC AGTTGCCCTC AGAATCTTTT
TGCAACTTCG GTTCTTGGGT AGGCTCTGAT AGGGACGGTA ATCCATCGGT CACTCCTGAG
ATAACATGGA GAACTGCTTG CTACCAAAGG CAGTTGATGT TGGAAAGATA TATTATTGCG
ACGTCTAATC TTAGAGATCA ATTAAGTGTT TCGATGCAAT GGAGTCAAGT CAGTTCCTCC
TTATTAGAGT CACTTGAAAC TGACAGGGTT AAGTTCCCTG AAATATATGA AGCTAGAGCT
ACAAGGTATA GATCAGAACC CTACAGATTA AAATTAAGTT ATATCCTAGA GAAATTAAGA
TTAACACAAG AAAGAAATAA TTTATTAGCT GATAGTGGAT GGAAATTTGA CTTGGAAGGA
GAAACTGATA ACAAAAATTT AGATAAAGTT GAAAGCTTAT ATTACAAGTC AGTAAAAGAA
TTTACTTATG ATCTAGAACT TATCAAAAAT AGTTTAATTA GTACAGATTT AAATTGTGAG
TCTGTAAACA CCTTACTTAC TCAAGTTCAT ATTTTTGGAT TTTCCTTAGC AAGTTTAGAT
ATTCGTCAAG AGAGTACAAG GCATAGTGAC GCTATTCAAG AGCTTACAAA TTATCTTGAT
TTATCTGTTC AATATGACCA AATGTCTGAG GAAGAGAAAA TTAAATGGCT TATAGACGAA
TTAAATACAA AAAGGCCTTT AATTCCATCT GACGTTCACT GGACAAAAAC CACAGAAGAA
ACATTTTCAG TTTTTAAAAT GGTTAAGAGA CTACAGCAAG AATTTGGAAG TCGCATTTGT
CATTCTTATG TAATTTCAAT GAGTCATAGT GCATCTGATT TGCTTGAAGT TCTCTTACTG
GCAAAAGAAA TGGGACTTCT TGATCAAAAT TCACAAAAGT CAAAATTATT AGTTGTTCCT
CTTTTTGAAA CTGTGGAAGA CCTTAAAAGA GCACCAGAAG TAATGGAAAA GTTGTTTAAA
TTAGATTTCT ATAGATCATT ATTGCCAAAA GTAGGAGAAT CTTTTAAACC TCTGCAAGAA
TTAATGCTTG GATATTCTGA TAGCAATAAA GATTCGGGGT TTGTTTCTAG TAATTGGGAA
ATTCATAGAG CCCAAATAGC TCTTCAAAAT CTTTCAAGTA GGAATAACAT ATTGTTAAGA
CTGTTTCATG GAAGAGGTGG TTCTGTAGGT AGAGGAGGAG GACCAGCCTA TCAGGCAATA
TTGGCTCAAC CAAGCGGTAC TTTAAAAGGG CGAATAAAAA TAACAGAACA AGGAGAAGTT
TTAGCTTCTA AATATAGTCT TCCGGAACTT GCTTTATACA ATCTTGAAAC TGTAACTACA
GCGGTAATTC AAAATAGCTT GGTAAATAAT AGACTTGACG CTACTCCAGA ATGGAATCAA
TTAATGTCTA GGTTGGCAGA AACATCAAGG TCTCACTACC GAAAATTAGT GCATGAGAAT
CCTGACTTGT TGAATTTCTT TCAAGAGGTC ACTCCAATAG AAGAAATAAG TAAATTACAG
ATATCCAGTA GGCCTGCAAG AAGAAAAAAA GGTGCAAAAG ATTTATCAAG TTTACGAGCT
ATTCCATGGG TATTTGGATG GACACAAAGT AGATTTCTTT TACCAAGTTG GTTTGGAGTA
GGTACTGCAT TGTCATCTGA ATTAAATTTA GATCCACAAC AAATTGAATT ACTAAGAGTC
TTGCATCAAA GATGGCCATT TTTTAGGATG CTTATATCTA AGGTAGAAAT GACATTATCT
AAGGTGGATT TAGAAGTGGC AAGATATTAT GTTGATACTC TTGGCAGTAA AGAAAATAAA
GATTCTTTTG ATAATATTTT TGAAGTAATT TCTAAAGAAT ATAATCTCAC GAAATCTTTA
ATACTTGAAA TTACTGGTAA AAATAAGCTT CTTGAATCTG ATAGAGACTT GAAGTCATCT
GTAAGCTTGA GAAATAAGAC AATCATTCCA TTGGGGTTTT TGCAAGTTTC ACTTTTAAGA
AGATTAAGAG ACCAGACAAG ACAACCCCCA ATAAGCGAGT TTTTTCTGGA TAAAGATGAA
TCTACAAGAG CTTACAGCAG AAGTGAACTA TTAAGGGGAG CACTTTTAAC TATTAATGGG
ATAGCAGCTG GTATGAGAAA TACAGGATAA
 
Protein sequence
MESFRQVKNN NVDLISNNDP LDKNRLLIED LWESVLREEC PDDQAERLIQ LKELSYSKQI 
DGDSSKTFKN EIVDIVNSMD LAESIAAARA FSLYFQLVNI LEQRVEEDRY IQSFTNKDVQ
KSPDNLDPFA PALARQNAPV TFRELFYRLR KLNVPPGKLE ELLQEMDIRL VFTAHPTEIV
RHTIRHKQTR VANLLKKIQI EQFLTKEEKN SLKTQLKEEV RLWWRTDELH QFKPSVLDEV
DYALHYFQQV LFNAMPQLRG RIAEALTDNY PDVQLPSESF CNFGSWVGSD RDGNPSVTPE
ITWRTACYQR QLMLERYIIA TSNLRDQLSV SMQWSQVSSS LLESLETDRV KFPEIYEARA
TRYRSEPYRL KLSYILEKLR LTQERNNLLA DSGWKFDLEG ETDNKNLDKV ESLYYKSVKE
FTYDLELIKN SLISTDLNCE SVNTLLTQVH IFGFSLASLD IRQESTRHSD AIQELTNYLD
LSVQYDQMSE EEKIKWLIDE LNTKRPLIPS DVHWTKTTEE TFSVFKMVKR LQQEFGSRIC
HSYVISMSHS ASDLLEVLLL AKEMGLLDQN SQKSKLLVVP LFETVEDLKR APEVMEKLFK
LDFYRSLLPK VGESFKPLQE LMLGYSDSNK DSGFVSSNWE IHRAQIALQN LSSRNNILLR
LFHGRGGSVG RGGGPAYQAI LAQPSGTLKG RIKITEQGEV LASKYSLPEL ALYNLETVTT
AVIQNSLVNN RLDATPEWNQ LMSRLAETSR SHYRKLVHEN PDLLNFFQEV TPIEEISKLQ
ISSRPARRKK GAKDLSSLRA IPWVFGWTQS RFLLPSWFGV GTALSSELNL DPQQIELLRV
LHQRWPFFRM LISKVEMTLS KVDLEVARYY VDTLGSKENK DSFDNIFEVI SKEYNLTKSL
ILEITGKNKL LESDRDLKSS VSLRNKTIIP LGFLQVSLLR RLRDQTRQPP ISEFFLDKDE
STRAYSRSEL LRGALLTING IAAGMRNTG