Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Synpcc7942_2252 |
Symbol | |
ID | 3773908 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus elongatus PCC 7942 |
Kingdom | Bacteria |
Replicon accession | NC_007604 |
Strand | - |
Start bp | 2318927 |
End bp | 2321980 |
Gene Length | 3054 bp |
Protein Length | 1017 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637800699 |
Product | phosphoenolpyruvate carboxylase |
Protein accession | YP_401269 |
Protein GI | 81301061 |
COG category | [C] Energy production and conversion |
COG ID | [COG2352] Phosphoenolpyruvate carboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.0123033 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTACT GCCAGAACGC TCGAACTGCC ATGAGTGCTG CTCTCCAGTC ATCCGACGAT GCTTTCCGAA CCGTTTCGAG TCCCCTCGCC ACGGATTTGG ATCTGTCGTC TCCGCTGGAG TTTTTCCTTC GCCATCGCTT GACGGTGGTT GAAGAACTCT GGGAAGTGGT TTTGCGCCAA GAGTGCGGCC AAGAGCTGGT CGATATTCTG ACTCAGCTGC GTGACTTGAC CTCGCCGGAA GGCCAAGCCC CAGAAGTGGG CGGCGAAGCC TTGGTTCAGG TGATTGAAAC CCTAGAGTTG AGCGATGCGA TTCGGGCTGC CCGTGCCTTT GCGCTCTACT TTCAGCTGAT CAATATTGTT GAGCAGCACT ACGAGCAAAC TCAATATCAA CTCGCCTACG AGCGATCGCG GTTGGAACCC TTGCCAGGAC CAGATGAAAG TCCGGAGGGA TTGCACACCA TTGAAATTCC TCAGCATCAG CTCGATCCCT TTGCTGCGGT GATTCCGCTC AACCAAGATC CGGCAACCTT CCAAACGCTG TTCCCGCGCC TGCGCCAGCT CAATGTGCCG CCGCAAATGA TCCAAGAGCT GACCGATCGC CTCGATATTC GGCTGGTTTT CACCGCTCAC CCGACGGAAA TTGTCCGCCA CACGATTCGC GACAAACAAC GCCGAATTGC CTACCTGCTG CGGCAACTGG ATGAGCTCGA AACAGGCAAA AACCGAGGCT TTCGAGAGCT TGAAGCGCAG AATATTCGTC AGCAGCTGAC CGAGGAGATT CGGCTCTGGT GGCGGACGGA TGAGCTCCAC CAGTTCAAGC CAACGGTGTT GGATGAGGTG GACTACGCGC TCCACTACTT CCAAGAAGTC CTCTTTGAGG CCATTCCTCT GCTCTATCAG CGCTTTCGGC TCGCGCTGCA GGGGACTTTC CCCGACCTAC AACCGCCCCG CTACAACTTC TGCCAGTTCG GCTCTTGGGT CGGCTCCGAT CGCGATGGCA ATCCTTCAGT GACCTCTGCC GTCACTTGGC AAACCGCTTG CTATCAGCGC AGTCTCGTCC TCGATCGCTA CATCACAGCG GTTGAACATC TCCGCAATGT GCTCAGCCTC TCGATGCACT GGAGCGAGGT GCTGCCGGAG TTGCTCAGCT CGTTGGAACA GGAGAGCATG CTCTTCCCGG AGACCTATGA GCAGCTAGCG GTCCGCTATC GCCAAGAGCC CTATCGCCTC AAGCTCTCCT ATATTCTGGA GCGCCTGCAC AACACCCGCG ATCGCAATAC CCGCCTCCAA CAGCAGCAAG AAAAAGATCC CACCACGCCC CTGCCCGAAT ATCGGGATGG CACCCTCTAC CAGGCTGGTA CGGCCTTTCT CGAAGATCTC AAGCTGATTC AGCACAACCT TAAGCAGACG GGACTGAGCT GTTACGAGCT AGAGAAGTTG ATCTGCCAGG TCGAGATCTT TGGTTTCAAC CTGGTCCATC TCGACATTCG CCAAGAAAGC TCGCGCCATT CCGACGCGAT CAACGAAATC TGTGAATACC TCCAAATTCT TCCCCAGCCC TACAACGAGC TGAGCGAAGC AGAACGAACT GCCTGGCTGG TTCAAGAGCT GAAAACCCGT CGGCCGCTGG TACCAGCGCG CATGCCGTTC TCAGAATCGA CCCGCGAGAT CATTGAAACC CTGCGGATGG TCAAGCAGCT ACAGGAAGAA TTTGGGGAGG CGGCTTGCCA AACCTACATC ATCAGCATGA GCCGCGAGCT GAGCGACCTG CTGGAAGTGC TGCTGCTGGC CAAGGAGGTT GGTCTCTACG ACCCAGTCAC CGGCAAGAGT TCGCTTCAGG TGATTCCGCT GTTTGAAACT GTGGAGGACT TACAAAATGC CCCGCGGGTG ATGACGGCGC TGTTTGAGCT GCCCTTCTAC ACCCAGCTCA ACCCCACCCA GTCTGAACCG CTGCAGGAAG TGATGCTGGG GTATTCCGAC AGTAACAAGG ACTCGGGCTT CCTCAGCAGT AACTGGGAGA TCCACAAGGC CCAGAAAGCC CTAGGGACGG TAGCCCGCGA CCACCGCGTC AAGCTGCGGA TCTTCCACGG CCGCGGGGGC TCCGTCGGTC GAGGTGGTGG CCCTGCCTAC GAGGCGATCT TGGCCCAGCC GGGTCGCACC ACAGATGGCC GAATCAAGAT TACGGAACAG GGCGAGGTCT TGGCTTCGAA ATACGCCCTG CCCGAACTGG CGCTCTATAA CCTTGAGACG ATCACGACGG CGGTGATTCA GTCCAGCCTG CTGGGTAGCG GCTTTGATGA CATTGAGCCG TGGAACCAAA TTATGGAAGA GTTGGCGGCG CGATCGCGGC GACATTACCG CGCTTTGGTG TACGAGCAGC CCGACCTGGT TGACTTCTTC AATCAGGTAA CGCCGATTGA GGAGATCAGC AAACTGCAAA TCAGCTCGCG ACCGGCTCGA CGCAAAACCG GCAAGCGCGA TCTGGGCAGT CTACGTGCCA TCCCCTGGGT CTTTAGCTGG ACGCAGAGTC GTTTTCTGCT GCCCTCTTGG TATGGCGTCG GCACAGCACT TCAGGAGTTT TTGCAGGAGC GCCCGGAGCA GAACCTCAAC CTGCTGCGCT ACTTCTACGA GAAGTGGCCG TTCTTCCGCA TGGTGATCTC GAAGGTCGAG ATGACCCTAG CGAAGGTCGA TTTGCAGATT GCTCATCACT ACGTGCATGA GCTGGCCAAT CCTGAGGATC AAGAGCGGTT TGAACGAGTG TTCAGCCAAA TCGCTGCAGA GTTTCAGCTG ACTTGTCATC TCGTGTTGAC GATTACCAAC CACGGTCGCT TGCTGGATGG CGACCCCGAA CTGCAGCGAT CGGTGCAGCT GCGCAACGGT ACGATCGTGC CCCTCGGCTT CTTGCAAGTC GCCCTGCTTA AACGCCTGCG GCAGTATCGC CAGCAAACGG AAACGACGGG ATTGATGCGA TCGCGCTATA GCAAAGGGGA ACTGCTGCGC GGAGCATTGC TGACGATCAA CGGCATTGCG GCTGGCATGC GCAATACAGG TTGA
|
Protein sequence | MNYCQNARTA MSAALQSSDD AFRTVSSPLA TDLDLSSPLE FFLRHRLTVV EELWEVVLRQ ECGQELVDIL TQLRDLTSPE GQAPEVGGEA LVQVIETLEL SDAIRAARAF ALYFQLINIV EQHYEQTQYQ LAYERSRLEP LPGPDESPEG LHTIEIPQHQ LDPFAAVIPL NQDPATFQTL FPRLRQLNVP PQMIQELTDR LDIRLVFTAH PTEIVRHTIR DKQRRIAYLL RQLDELETGK NRGFRELEAQ NIRQQLTEEI RLWWRTDELH QFKPTVLDEV DYALHYFQEV LFEAIPLLYQ RFRLALQGTF PDLQPPRYNF CQFGSWVGSD RDGNPSVTSA VTWQTACYQR SLVLDRYITA VEHLRNVLSL SMHWSEVLPE LLSSLEQESM LFPETYEQLA VRYRQEPYRL KLSYILERLH NTRDRNTRLQ QQQEKDPTTP LPEYRDGTLY QAGTAFLEDL KLIQHNLKQT GLSCYELEKL ICQVEIFGFN LVHLDIRQES SRHSDAINEI CEYLQILPQP YNELSEAERT AWLVQELKTR RPLVPARMPF SESTREIIET LRMVKQLQEE FGEAACQTYI ISMSRELSDL LEVLLLAKEV GLYDPVTGKS SLQVIPLFET VEDLQNAPRV MTALFELPFY TQLNPTQSEP LQEVMLGYSD SNKDSGFLSS NWEIHKAQKA LGTVARDHRV KLRIFHGRGG SVGRGGGPAY EAILAQPGRT TDGRIKITEQ GEVLASKYAL PELALYNLET ITTAVIQSSL LGSGFDDIEP WNQIMEELAA RSRRHYRALV YEQPDLVDFF NQVTPIEEIS KLQISSRPAR RKTGKRDLGS LRAIPWVFSW TQSRFLLPSW YGVGTALQEF LQERPEQNLN LLRYFYEKWP FFRMVISKVE MTLAKVDLQI AHHYVHELAN PEDQERFERV FSQIAAEFQL TCHLVLTITN HGRLLDGDPE LQRSVQLRNG TIVPLGFLQV ALLKRLRQYR QQTETTGLMR SRYSKGELLR GALLTINGIA AGMRNTG
|
| |