Gene P9211_16951 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_16951 
Symbolppc 
ID5730126 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1519264 
End bp1522287 
Gene Length3024 bp 
Protein Length1007 aa 
Translation table11 
GC content41% 
IMG OID641286077 
Productphosphoenolpyruvate carboxylase 
Protein accessionYP_001551580 
Protein GI159904236 
COG category[C] Energy production and conversion 
COG ID[COG2352] Phosphoenolpyruvate carboxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCATGT CAAAATCAGA ATTAGGCAAT ACACCCTTAG AGCAATCTTC AGATTTGCAT 
TCATCTTTCC GCAATAACCA AGCAAGTACT ACGAAAAACT CTGGAAGCTT GTTGCAACAG
AGATTGGCAC TTGTTGAAGA TCTGTGGGAG ACAGTTTTAT GCAGTGAATG CCCATCAGAT
CAAATTAATC GGGTTCTTCG TTTAAAGAAA TTGAGTAACC CGAATAATTT TGCAGGAGAA
GAAGAGCAAA AAAACCCATT TAATGAAATT GTTTTGTTAA TTGAAGAGAT GGACTTGGCT
GAGTCAATTG CAGCCGCGCG TGCGTTCTCT TTATACTTTC AACTTGTCAA TATTTTGGAG
CAGAGAATTG AGGAAGACAC ATATCTAGAA ACTATTGGCA GGATCGAAGA AGATGCGTCT
AATCAATTTG ATCCATTTGC CCCTCCTCTT GCGAGTCAGA CTGCACCAGC AACTTTCAGT
CAATTATTTG AACGTTTAAA ACGTCTTAAT GTCCCTCCTG GGCAGCTAGA GGCATTAATG
CAAGAAATAG ATATACGTCT TGTCTTTACC GCCCATCCTA CAGAAATAGT CCGACATACC
GTTAGGCATA AGCAACGCAG GGTAGCCAAT TTGTTACAGC AATTTCACTC TGAAGCAGCA
ATTTCTTTCT CTCAGAAAGA AAGCTTGCGA CAACAGTTAG AAGAAGAAAT ACGTCTTTGG
TGGCGTACAG ATGAGCTACA TCAATTTAAA CCAACTGTTT TAGACGAGGT AGATTATGCA
CTGCATTATT TCCAACAAGT TCTATTTGAC GCGATGCCCC AATTGCGTCG TCGGATTAAT
GCAGCATTAG TTAAAAGTTA TCCTGATGTT GAGGTGCCTA GAGAATCTTT TTGCACATTT
GGTTCTTGGG TGGGGTCAGA TCGTGATGGC AACCCATCAG TTACTCCGGA AATAACTTGG
AGAGCAGCAT GTTATCAACG TCAATTAATG CTTGAGCGGT ATATCAACTC TGTTCAAGAC
CTTAGGGATC AATTGAGTAT TTCTATGCAA TGGAGTCAGG TTGGCTCACA ATTGTTAGAG
TCTTTAGAAA TGGATAGAGT TCAGTTTCCA GAGGTTTACG AGGAAAGAGC AGCGAGATAC
AGGCTAGAGC CATATCGACT AAAGATTAGC TATATTTTGC AACGACTTAA GCTAACTCAT
CAACGTAATC AAGAATTAGC AGCTGCAGGA TGGAAGATAG TTCCTGAAGC TATGAACTCC
TTGATTACTT CTGGTAAGCC ATTTGAGGAT TTGTATTACT GCTCTATAGC CGAGTTTCGT
AGTGATCTAG AGTTAATTCG CAATAGCCTG GTGGCAACAG ATCTTAGTTG TGAGTCGTTA
GATACGCTAT TGACTCAGGT ACATATTTTT GGCTTTTCTC TTGCTAGTCT GGATATTCGA
CAAGAGAGTA ACCGACATAG TGATGCTATA GACGAGGTTA CAAGATTTAT AGACCTCCCT
GTCATCTACT CAGAAATGGA TGAGAAAGAA AAAGTTCAAT GGTTAATGCA AGAATTAGAA
ACTCGTAGGC CATTAATTCC TTCTTCAGTT GATTGGTCAC CTTCTACTCA GGAAACAATA
TCTGTTTTTC AGATGCTCCA CCGTTTACAG GAAGAGTTTG GTAGTCGGAT TTGCAGATCT
TATGTAATTT CAATGAGTCA TTCAGTTTCT GATCTTTTAG AAGTACTTCT TCTCGCAAAA
GAAGCTGGAC TAGTAGATAT TTCGTCTGGT TCAGCTGATT TACTAGTGGT GCCTTTATTT
GAAACTGTTG AGGATCTGCA GCGAGCTCCC TCAGTGATGG AGGAACTCTT TAGCTCTAGT
TTTTATCTCA ACTTATTACC TCGTGTTGGA GAGAAACTTC AGCCTTTACA AGAACTAATG
CTTGGTTATT CAGATAGCAA TAAAGATTCT GGTTTTTTAT CAAGTAATTG GGAAATTCAT
CAGGCACAAA TTGCACTTCA GAACCTTGCA AGTAGTCATG GTGTAGCTTT ACGTTTATTT
CACGGTCGTG GAGGTTCTGT TGGGCGAGGT GGTGGCCCTG CTTATCAGGC AATCTTGGCT
CAACCTAGTG GCACTTTAAA AGGACGAATT AAGATTACTG AGCAAGGGGA AGTTCTTGCT
TCAAAATATA GCCTACCTGA ACTAGCAATG TATAACCTCG AAACTGTTAC GACAGCTGTG
CTGCAAAATA GTTTGGTTAC TAATCAGTTA GATGCAACCC CTAGTTGGAA TGAGCTGATG
ACTCGATTGG CAGCGCGTTC TAGGCGGCAT TACAGGTCGC TTGTACATGA CAACCCTGAC
TTGGTGCCAT TTTTTCAAGA AGTGACGCCA ATTGAAGAAA TCAGTAAATT GCAAATTTCT
AGTCGACCTA CTCGTAGAAA ATCAGGCAGT AAGGATTTAT CTAGTCTTAG GGCCATACCT
TGGGTCTTTG GTTGGACTCA GAGTAGGTTT CTTTTGCCCA GTTGGTTTGG AGTCGGGCAT
GCTTTGTTTA TAGAACTTGA AGAAGATCCT GCACAAATTG AGCTTTTGAA AATGCTTCAT
CAGCGTTGGC CTTTCTTTCG AATGTTGATT TCTAAGGTAG AGATGACTCT TTCGAAAGTA
GATCTAGAGG TGGCTAATCA CTATGTCACT AGTCTTGGTA GTAGGCAGAA TAAAGAAGCT
TTTAATAAAA TTTTTGAGGT GATTTCTGAT GAATACCATC TCACTAAGAG GTTGGTTTTA
AGAATAACTG GAAAATCAAA ACTTCTTAGC GCTGATCCTG CATTGCAAGC GTCTGTAGAG
CTTAGAAACC GTACGATTGT ACCTTTAGGG TTTCTTCAAG TGGCACTATT ACGTCGATTA
CGTGATCAAA AGCGTCAACC ACCTGTGAGC GAGGAGCTTT TTAATGAACG AGATCTTGCT
CGTACTTATA GTCGCAGTGA ATTATTGAGA GGAGCTTTGT TGACTATTAA TGGCATTGCT
GCGGGAATGC GAAATACAGG TTGA
 
Protein sequence
MIMSKSELGN TPLEQSSDLH SSFRNNQAST TKNSGSLLQQ RLALVEDLWE TVLCSECPSD 
QINRVLRLKK LSNPNNFAGE EEQKNPFNEI VLLIEEMDLA ESIAAARAFS LYFQLVNILE
QRIEEDTYLE TIGRIEEDAS NQFDPFAPPL ASQTAPATFS QLFERLKRLN VPPGQLEALM
QEIDIRLVFT AHPTEIVRHT VRHKQRRVAN LLQQFHSEAA ISFSQKESLR QQLEEEIRLW
WRTDELHQFK PTVLDEVDYA LHYFQQVLFD AMPQLRRRIN AALVKSYPDV EVPRESFCTF
GSWVGSDRDG NPSVTPEITW RAACYQRQLM LERYINSVQD LRDQLSISMQ WSQVGSQLLE
SLEMDRVQFP EVYEERAARY RLEPYRLKIS YILQRLKLTH QRNQELAAAG WKIVPEAMNS
LITSGKPFED LYYCSIAEFR SDLELIRNSL VATDLSCESL DTLLTQVHIF GFSLASLDIR
QESNRHSDAI DEVTRFIDLP VIYSEMDEKE KVQWLMQELE TRRPLIPSSV DWSPSTQETI
SVFQMLHRLQ EEFGSRICRS YVISMSHSVS DLLEVLLLAK EAGLVDISSG SADLLVVPLF
ETVEDLQRAP SVMEELFSSS FYLNLLPRVG EKLQPLQELM LGYSDSNKDS GFLSSNWEIH
QAQIALQNLA SSHGVALRLF HGRGGSVGRG GGPAYQAILA QPSGTLKGRI KITEQGEVLA
SKYSLPELAM YNLETVTTAV LQNSLVTNQL DATPSWNELM TRLAARSRRH YRSLVHDNPD
LVPFFQEVTP IEEISKLQIS SRPTRRKSGS KDLSSLRAIP WVFGWTQSRF LLPSWFGVGH
ALFIELEEDP AQIELLKMLH QRWPFFRMLI SKVEMTLSKV DLEVANHYVT SLGSRQNKEA
FNKIFEVISD EYHLTKRLVL RITGKSKLLS ADPALQASVE LRNRTIVPLG FLQVALLRRL
RDQKRQPPVS EELFNERDLA RTYSRSELLR GALLTINGIA AGMRNTG