Gene NATL1_20211 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_20211 
Symbolppc 
ID4779666 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1663493 
End bp1666477 
Gene Length2985 bp 
Protein Length994 aa 
Translation table11 
GC content39% 
IMG OID640085314 
Productphosphoenolpyruvate carboxylase 
Protein accessionYP_001015841 
Protein GI124026726 
COG category[C] Energy production and conversion 
COG ID[COG2352] Phosphoenolpyruvate carboxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGAAAA ATCCTTCTAA CGAAAATATC TCTAATCACT CGACAGTATG TGTTGAGGAT 
CAAGATCCTG GATCTTTATT GCAACAAAGG CTTGAACTAG TGGAGGATCT ATGGAAAACC
GTTCTCAAAA GTGAATGTCC ACCTGATCAA ACAGAGAGAT TATTGCGATT AAAACAATTA
AGTGATCCTA GTAAGTCAAA TCAAGACAAT TCCTCAAAGG CAATAGTCCA ATTGATTACA
AAAATGGATT TGGCTGAGGC GATATCTGCT GCTAGGGCTT TTTCTCTTTA TTTTCAGTTG
GTAAACATTC TGGAGCAACG CATAGAAGAG GATAGTTATT TAGAAAGCAT AGAAAAAGGG
AAGTTAGACA CCAGTAATTA TAAAATAGAT CCATTCGCTC CAGCTTTAGC TAGTCAGACT
GCTCCAGCAA CTTTTACACA ATTATTTGAG CGATTACGTC GTTTGAATGT CCCTCCAGCT
CAGCTTGATG GTTTGATGAG AGAAATGGAT ATTCGTCTTG TTTTTACAGC TCATCCAACT
GAAATAGTTA GACACACGGT TCGCCATAAG CAACGCAGGG TTGCCACTCT TTTGCAGCAA
CTTCAATCTA ATAGCTTGAT TTCCGAATCA GAGAAAGAAA TATTTAGGTT GCAGCTAGAG
GAGGAGATAA GACTTTGGTG GAGAACTGAT GAGCTTCATC AGTTTAAACC GACTGTCCTT
GATGAGGTTG ACTATGCCCT TCATTATTTT CAGCAGGTTT TGTTTGATGC AATGCCTCAA
TTAAGAAGGA GACTGACTAC CGCACTTGCT TCAAGTTATC CAGACGTAGA GATCCCTAAT
GAAGCTTTTT GCACATTTGG CTCTTGGGTA GGTTCAGATC GTGATGGGAA TCCGTCAGTT
ACTCCTGAAA TCACATGGAG AACAGCGTGT TACCAAAGAC AATTAATGTT GGATAGGTAT
ATTGCATCAG TTCAAGATCT AAGAGACCAA CTCAGTATAT CTATGCAATG GAGTCAAGTA
AGTTCTCCTT TGTTAGAGTC ATTGGAAATG GACAGAGTTC GTTTCCCTGA GGTTTATGAG
GAAAGGGCTG CAAGATATAG ACTGGAACCT TATCGCTTGA AGCTTAGCTA TACACTTGAG
AGGCTACGAC TTACTCAACT ACGTAATAAG CAGCTAGCGG ATGCTGGATG GCAATTCTCA
CCCGATGGGA AGCCGCAAAT ATCTACTAAT AATAGTTTTG ATGAAGTACT CCACTACAAA
TCTGTAGAAG AATTAAAAAA TGAATTAGAG CTTATTAGAA ATAGTTTGGT TAGTACAGAT
CTTACTTGTG AACCATTAGA TACTTTGCTA AATCAAGTTC ATATTTTTGG GTTCTCGTTG
GCTAGTTTAG ACATTAGACA AGAAAGCACA CGACATAGTG ATGCGTTGGA TGAGCTCACT
CGCTATTTAG ATCTCCCTGA GTCGTATGGA GTGATGAGCG AGGAAAGTCG AGTTCAATGG
TTGATGAAAG AATTAAGAAC TCGGAGACCA CTTATTCCGC CTTCTTTTGA GTGGTCTAAA
AGTACTCAAG AAACTATCTC AGTTTTTCAT ATGCTTCATA GGCTTCAGAA AGAATTTGGT
ACTCGTATAT GTCGCTCGTA TGTAATTTCG ATGAGTCATA CGGCATCAGA TTTATTAGAA
GTTCTTCTTT TAGCTAAAGA GTCGGGTTTG ATTGATCCAA CTTTAGGAGC TTCTGATCTT
CTTGTTGTTC CATTATTTGA AACGGTTGAG GACTTACAAC ATGCTCCTTC TGTAATGGAG
TCGTTACTAC AATCTGATGT TTATCGCGAA TTACTTCCAC GAGTAGGAGA GAAAAAACAA
CCGCTTCAAG AACTTATGCT GGGATATTCC GATAGTAATA AGGATTCTGG TTTTCTCTCA
AGTAATTGGG AAATTCATAA GGCCCAAATA GCACTCCAAG ACCTAGCTAG TAGACAAGGA
ATAGCATTAC GTATTTTTCA TGGTAGAGGT GGGTCCGTAG GAAGAGGCGG TGGACCAGCT
TATCAAGCTA TTTTGGCTCA ACCTAGTGGT ACACTTCAGG GACGTATAAA AATAACAGAG
CAAGGGGAAG TACTTGCTTC AAAATATAGT CTTCCAGAAT TAGCTTTATA TAATCTGGAA
ACTGTAACCA CTGCAGTTAT TCAAAATAGC TTGGTTACCA ATAAATTGGA TGCTACGCCA
AGTTGGAATG AATTGATGAC CAGACTTGCA GCTCGTTCAA GGGAGCATTA CCGAGCTTTA
GTTCATGATA ATCCAGATTT AGTTCAATTT TTTCAGGTAG TTACTCCAAT AGAAGAGATA
AGTAAGTTGC AAATTTCTAG TCGTCCTGCT CGACGAAAGA GTGGTGCAAA AGACTTATCA
AGTCTTCGAG CTATCCCATG GGTCTTTGGT TGGACTCAAA GTCGTTTCCT TTTGCCAAGT
TGGTTTGGTG TTGGTACGGC TTTAGCTACT GAATTAAAGG CTGACCCCGA CCAAATGGAG
ATGTTGCGAA TGTTGAATCA GAGATGGCCA TTCTTTAGAA TGTTGATATC TAAAGTAGAG
ATGACACTTT CAAAAGTTGA TTTAGATGTT GCCCATCATT ATGTGGTTAG TTTGGGTGGA
AGTGATGATC GGGATGCTTT CGCTAGCATT TTCGATATTA TCTCAAGCGA ATACAGCTTG
ACTAAGAAAT TAATTTTAGA AATTACTGGC AAGTCAAAAC TATTAAGTGC AGACCCTGCT
TTGCAGTTGT CTGTCAACCT GAGAAATAGG ACTATTGTCC CTTTAGGATT TTTACAAGTT
GCTCTTCTCA AGCGATTAAG AGATCAGAAT CGTCAACCAC CAATTAGTGA AGATGTAAGT
ATTGACTCTA CTCAAAGTTC TCGTACATAT AGCCGTAGTG AATTATTGCG TGGTGCATTG
TTGACTATCA ATGGTATCGC TGCAGGTATG AGAAACACAG GATGA
 
Protein sequence
MLKNPSNENI SNHSTVCVED QDPGSLLQQR LELVEDLWKT VLKSECPPDQ TERLLRLKQL 
SDPSKSNQDN SSKAIVQLIT KMDLAEAISA ARAFSLYFQL VNILEQRIEE DSYLESIEKG
KLDTSNYKID PFAPALASQT APATFTQLFE RLRRLNVPPA QLDGLMREMD IRLVFTAHPT
EIVRHTVRHK QRRVATLLQQ LQSNSLISES EKEIFRLQLE EEIRLWWRTD ELHQFKPTVL
DEVDYALHYF QQVLFDAMPQ LRRRLTTALA SSYPDVEIPN EAFCTFGSWV GSDRDGNPSV
TPEITWRTAC YQRQLMLDRY IASVQDLRDQ LSISMQWSQV SSPLLESLEM DRVRFPEVYE
ERAARYRLEP YRLKLSYTLE RLRLTQLRNK QLADAGWQFS PDGKPQISTN NSFDEVLHYK
SVEELKNELE LIRNSLVSTD LTCEPLDTLL NQVHIFGFSL ASLDIRQEST RHSDALDELT
RYLDLPESYG VMSEESRVQW LMKELRTRRP LIPPSFEWSK STQETISVFH MLHRLQKEFG
TRICRSYVIS MSHTASDLLE VLLLAKESGL IDPTLGASDL LVVPLFETVE DLQHAPSVME
SLLQSDVYRE LLPRVGEKKQ PLQELMLGYS DSNKDSGFLS SNWEIHKAQI ALQDLASRQG
IALRIFHGRG GSVGRGGGPA YQAILAQPSG TLQGRIKITE QGEVLASKYS LPELALYNLE
TVTTAVIQNS LVTNKLDATP SWNELMTRLA ARSREHYRAL VHDNPDLVQF FQVVTPIEEI
SKLQISSRPA RRKSGAKDLS SLRAIPWVFG WTQSRFLLPS WFGVGTALAT ELKADPDQME
MLRMLNQRWP FFRMLISKVE MTLSKVDLDV AHHYVVSLGG SDDRDAFASI FDIISSEYSL
TKKLILEITG KSKLLSADPA LQLSVNLRNR TIVPLGFLQV ALLKRLRDQN RQPPISEDVS
IDSTQSSRTY SRSELLRGAL LTINGIAAGM RNTG