Gene P9303_22741 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_22741 
Symbolppc 
ID4778665 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2009618 
End bp2012626 
Gene Length3009 bp 
Protein Length1002 aa 
Translation table11 
GC content55% 
IMG OID640087792 
Productphosphoenolpyruvate carboxylase 
Protein accessionYP_001018274 
Protein GI124023967 
COG category[C] Energy production and conversion 
COG ID[COG2352] Phosphoenolpyruvate carboxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.114136 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAAC CTGAGTCCAC TAGCGCATCG ATGCAGCAGT CTTCCGCCCA AAAACCTGAT 
TGCGATCAGC CCAGGGCAAT CGGGGAAGGG CAGCAGGCAG GACGCTTACT GCAGAACCGC
CTTGAACTGG TGGAAGACCT TTGGCAAACG GTGCTACGCA GCGAATGTCC ACCGGATCAG
GCAGAACGAC TACTTCGACT GAAGCAACTC AGTGAACCTT TGGCCCTTGA AGGTGCAGAT
GAGAACAGTG CCAGCACAGC AATCGTGCTG TTGATCAAAG AGATGGACCT GGCAGAAGCG
ATCACCGCAG CACGCGCCTT CTCCCTCTAT TTCCAACTGG TGAATATCCT CGAGCAACGG
ATCGAGGAAG ACAGCTACCT CGCAAGTATG TCTTCTGGCA AAGAAAACAA TCGCCAAGAC
AAACCCTATG ACCCCTTTGC TCCACCGCTC GCCACGCAGA CAGACCCAGC CACATTCAGC
GAGCTGTTCG AACGACTTCG CCTGCTCAAC GTGCCACCAG CTCAGCTGGA GACCCTTCTA
CAGGAGATGG ATATCCGCCT AGTTTTCACT GCTCATCCCA CTGAAATCGT CCGTCATACC
GTTCGCCATA AGCAACGCAA GGTTGCAAAC CTGTTGCAAC AGTTGCAATC AGATCCAACA
AAATCGTCAT CTGAAAAAGA AAGCCTGAGA CTGCAGCTTG AGGAGGAGAT CAGGCTCTGG
TGGCGGACCG ATGAGCTTCA TCAGTTCAAG CCCTCCGTTC TGGATGAGGT GGATTACGCC
CTCCACTATT TCCAACAGGT GCTTTTTGAT GCCATGCCCC AGCTACGGCG TCGCCTGATC
ACTGCAATGG CCGAGAGCTA TCCAGATGTA CACATTCCAC AAGCTGCCTT CTGCACTTTT
GGATCTTGGG TTGGATCAGA CCGAGACGGC AATCCTTCCG TTACACCAGA AATCACCTGG
CGCACCGCCT GTTATCAACG TCAGCTGATG CTGGAGCGCT ATGTCAATGC CGTTCAAAAA
CTCCGTGATC AACTCAGCAT CTCCATGCAA TGGAGCCAGG TAAGCACTCC ACTGTTGGAG
TCACTGGAAA TGGACCGGCT TCGATTCCCC GAGGTTTACG AGGAACGAGC AGCCCGCTAT
CGACTCGAGC CTTATCGCCT CAAACTCAGC TACACCCTGG AGAGGTTGAA ACTCACTCAA
GAACGCAACC AGCAATTGGC CGAAGCAGGC TGGCAAACCC CACCAGAAGG CCTCAACCCC
AGTCTCAATC TAATTAATGC GGGAGAAGCC CTTCACTACA AATCAGTAGC AGAATTCCGC
AGCGACCTGG AACTGATCCG CAACAGCCTG GTCAGCACAG ATCTGAGCTG TGAGCCTCTG
GATACCCTCT TAAATCAGGT CCACATCTTC GCTTTCTCAC TAGCCAGCCT CGACATCCGT
CAGGAGAGCA CTCGCCACAG CGATGCTCTC GACGAATTAA CCCGCTACCT AAACCTGCCT
AAGGCCTATG GCGACATGGC CGAAAATGAG CGGGTGCAAT GGTTAATAGA GGAACTACAG
ACTCGCCGCC CCCTGATCCC CTCTGCCGTT ATCTGGTCGC CCAGTACAGC AGAAACGGTG
GCCGTGTTTC GCATGCTGCA CCGACTTCAG GAGGAATTCG GCAGCCGAAT CTGCCGCACA
TATGTGATCT CAATGAGTCA CACAGTTTCC GATCTACTCG AGGTGCTGCT GCTAGCCAAA
GAAGCCGGCC TCGTGGATCC GGCGGCTGGC CATGCCGAAC TGCTTGTGGT GCCCCTATTC
GAAACCGTAG AGGATCTCCA GCGGGCTCCA GCAGTGATGG AGGCACTTTT AAGTTCACCT
GTCTACCGCA ATTTGCTCCC TCGTGTCAGC GAACAGGTCC AACCTCTGCA GGAGCTGATG
CTTGGCTATT CGGACAGCAA CAAAGACTCC GGATTTCTAT CCAGCAATTG GGAGATCCAT
CAAGCCCAAA TTGCCCTGCA AGATCTAGCT AACCGCCAGG GTGTGGCTCT GCGCCTTTTC
CATGGACGCG GTGGATCTGT TGGCAGAGGA GGTGGCCCTG CCTACCAAGC AATCCTTGCC
CAACCAAGCG GAACCGTGCG CGGCCGAATC AAGATCACTG AGCAGGGAGA AGTGCTGGCA
TCGAAATACA GCCTGCCCGA GCTAGCCCTT TACAACCTAG AAACATTCAC TACGGCCGTC
CTGCAAAACA GTCTGGTAAC CAACCAGCTG GATGCCACCC CAAGCTGGAA CCAACTGATG
ACCAGACTGG CTGGTCGTTC ACGTGAGCAC TATCGGGCCC TCGTCCACAA CAACCCTGAT
CTGGTGGCCT TCTTCCAACA GGTCACGCCG ATCGAAGAAA TCAGCAAATT GCAAATCTCC
AGCCGTCCTG CTAGACGCAA AAGCGGCGCC AAGGACCTCT CTAGCCTACG AGCGATCCCC
TGGGTATTTG GCTGGACGCA AAGTCGCTTT CTACTACCGA GCTGGTTTGG CGTGGGAACA
GCACTAGCCG CGGAAGTCGA ATCAGACGCT GACCAGCTTG ACCTTCTGCG CAGACTGCAC
CAACGCTGGC CATTCTTCCG AATGCTGATC TCCAAGGTGG AGATGACCCT TTCAAAAGTA
GACCTAGACC TGGCCCATCA CTACATGACT AGCCTTGGCA GCGAAGATTA CCGTGAAGCC
TTTAACCGTA TCTTCGAGAT CATCGAAACG GAGTACAGCC TCACCCGCCG GTTGGTCTTA
AACATCACCG GACAACCCAG GTTGCTCGGG GCCGATCCCG CCCTACAGCA ATCAGTAGAT
CTTCGCAATC GCACGATCGT GCCACTTGGT TTTCTGCAAG TGGCCCTGCT TCGCAAACTG
CGTGATCAGA ACCGGCAGCC ACCAATGAAC GAGGCTGGCG ATGGTCGCAC ATACAGCCGC
AGCGAACTTC TAAGAGGCGC ACTGCTCACC ATTAACGGCA TTGCTGCAGG CATGCGCAAC
ACCGGTTGA
 
Protein sequence
MAKPESTSAS MQQSSAQKPD CDQPRAIGEG QQAGRLLQNR LELVEDLWQT VLRSECPPDQ 
AERLLRLKQL SEPLALEGAD ENSASTAIVL LIKEMDLAEA ITAARAFSLY FQLVNILEQR
IEEDSYLASM SSGKENNRQD KPYDPFAPPL ATQTDPATFS ELFERLRLLN VPPAQLETLL
QEMDIRLVFT AHPTEIVRHT VRHKQRKVAN LLQQLQSDPT KSSSEKESLR LQLEEEIRLW
WRTDELHQFK PSVLDEVDYA LHYFQQVLFD AMPQLRRRLI TAMAESYPDV HIPQAAFCTF
GSWVGSDRDG NPSVTPEITW RTACYQRQLM LERYVNAVQK LRDQLSISMQ WSQVSTPLLE
SLEMDRLRFP EVYEERAARY RLEPYRLKLS YTLERLKLTQ ERNQQLAEAG WQTPPEGLNP
SLNLINAGEA LHYKSVAEFR SDLELIRNSL VSTDLSCEPL DTLLNQVHIF AFSLASLDIR
QESTRHSDAL DELTRYLNLP KAYGDMAENE RVQWLIEELQ TRRPLIPSAV IWSPSTAETV
AVFRMLHRLQ EEFGSRICRT YVISMSHTVS DLLEVLLLAK EAGLVDPAAG HAELLVVPLF
ETVEDLQRAP AVMEALLSSP VYRNLLPRVS EQVQPLQELM LGYSDSNKDS GFLSSNWEIH
QAQIALQDLA NRQGVALRLF HGRGGSVGRG GGPAYQAILA QPSGTVRGRI KITEQGEVLA
SKYSLPELAL YNLETFTTAV LQNSLVTNQL DATPSWNQLM TRLAGRSREH YRALVHNNPD
LVAFFQQVTP IEEISKLQIS SRPARRKSGA KDLSSLRAIP WVFGWTQSRF LLPSWFGVGT
ALAAEVESDA DQLDLLRRLH QRWPFFRMLI SKVEMTLSKV DLDLAHHYMT SLGSEDYREA
FNRIFEIIET EYSLTRRLVL NITGQPRLLG ADPALQQSVD LRNRTIVPLG FLQVALLRKL
RDQNRQPPMN EAGDGRTYSR SELLRGALLT INGIAAGMRN TG