Gene Synpcc7942_2252 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_2252 
Symbol 
ID3773908 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp2318927 
End bp2321980 
Gene Length3054 bp 
Protein Length1017 aa 
Translation table11 
GC content57% 
IMG OID637800699 
Productphosphoenolpyruvate carboxylase 
Protein accessionYP_401269 
Protein GI81301061 
COG category[C] Energy production and conversion 
COG ID[COG2352] Phosphoenolpyruvate carboxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0123033 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTACT GCCAGAACGC TCGAACTGCC ATGAGTGCTG CTCTCCAGTC ATCCGACGAT 
GCTTTCCGAA CCGTTTCGAG TCCCCTCGCC ACGGATTTGG ATCTGTCGTC TCCGCTGGAG
TTTTTCCTTC GCCATCGCTT GACGGTGGTT GAAGAACTCT GGGAAGTGGT TTTGCGCCAA
GAGTGCGGCC AAGAGCTGGT CGATATTCTG ACTCAGCTGC GTGACTTGAC CTCGCCGGAA
GGCCAAGCCC CAGAAGTGGG CGGCGAAGCC TTGGTTCAGG TGATTGAAAC CCTAGAGTTG
AGCGATGCGA TTCGGGCTGC CCGTGCCTTT GCGCTCTACT TTCAGCTGAT CAATATTGTT
GAGCAGCACT ACGAGCAAAC TCAATATCAA CTCGCCTACG AGCGATCGCG GTTGGAACCC
TTGCCAGGAC CAGATGAAAG TCCGGAGGGA TTGCACACCA TTGAAATTCC TCAGCATCAG
CTCGATCCCT TTGCTGCGGT GATTCCGCTC AACCAAGATC CGGCAACCTT CCAAACGCTG
TTCCCGCGCC TGCGCCAGCT CAATGTGCCG CCGCAAATGA TCCAAGAGCT GACCGATCGC
CTCGATATTC GGCTGGTTTT CACCGCTCAC CCGACGGAAA TTGTCCGCCA CACGATTCGC
GACAAACAAC GCCGAATTGC CTACCTGCTG CGGCAACTGG ATGAGCTCGA AACAGGCAAA
AACCGAGGCT TTCGAGAGCT TGAAGCGCAG AATATTCGTC AGCAGCTGAC CGAGGAGATT
CGGCTCTGGT GGCGGACGGA TGAGCTCCAC CAGTTCAAGC CAACGGTGTT GGATGAGGTG
GACTACGCGC TCCACTACTT CCAAGAAGTC CTCTTTGAGG CCATTCCTCT GCTCTATCAG
CGCTTTCGGC TCGCGCTGCA GGGGACTTTC CCCGACCTAC AACCGCCCCG CTACAACTTC
TGCCAGTTCG GCTCTTGGGT CGGCTCCGAT CGCGATGGCA ATCCTTCAGT GACCTCTGCC
GTCACTTGGC AAACCGCTTG CTATCAGCGC AGTCTCGTCC TCGATCGCTA CATCACAGCG
GTTGAACATC TCCGCAATGT GCTCAGCCTC TCGATGCACT GGAGCGAGGT GCTGCCGGAG
TTGCTCAGCT CGTTGGAACA GGAGAGCATG CTCTTCCCGG AGACCTATGA GCAGCTAGCG
GTCCGCTATC GCCAAGAGCC CTATCGCCTC AAGCTCTCCT ATATTCTGGA GCGCCTGCAC
AACACCCGCG ATCGCAATAC CCGCCTCCAA CAGCAGCAAG AAAAAGATCC CACCACGCCC
CTGCCCGAAT ATCGGGATGG CACCCTCTAC CAGGCTGGTA CGGCCTTTCT CGAAGATCTC
AAGCTGATTC AGCACAACCT TAAGCAGACG GGACTGAGCT GTTACGAGCT AGAGAAGTTG
ATCTGCCAGG TCGAGATCTT TGGTTTCAAC CTGGTCCATC TCGACATTCG CCAAGAAAGC
TCGCGCCATT CCGACGCGAT CAACGAAATC TGTGAATACC TCCAAATTCT TCCCCAGCCC
TACAACGAGC TGAGCGAAGC AGAACGAACT GCCTGGCTGG TTCAAGAGCT GAAAACCCGT
CGGCCGCTGG TACCAGCGCG CATGCCGTTC TCAGAATCGA CCCGCGAGAT CATTGAAACC
CTGCGGATGG TCAAGCAGCT ACAGGAAGAA TTTGGGGAGG CGGCTTGCCA AACCTACATC
ATCAGCATGA GCCGCGAGCT GAGCGACCTG CTGGAAGTGC TGCTGCTGGC CAAGGAGGTT
GGTCTCTACG ACCCAGTCAC CGGCAAGAGT TCGCTTCAGG TGATTCCGCT GTTTGAAACT
GTGGAGGACT TACAAAATGC CCCGCGGGTG ATGACGGCGC TGTTTGAGCT GCCCTTCTAC
ACCCAGCTCA ACCCCACCCA GTCTGAACCG CTGCAGGAAG TGATGCTGGG GTATTCCGAC
AGTAACAAGG ACTCGGGCTT CCTCAGCAGT AACTGGGAGA TCCACAAGGC CCAGAAAGCC
CTAGGGACGG TAGCCCGCGA CCACCGCGTC AAGCTGCGGA TCTTCCACGG CCGCGGGGGC
TCCGTCGGTC GAGGTGGTGG CCCTGCCTAC GAGGCGATCT TGGCCCAGCC GGGTCGCACC
ACAGATGGCC GAATCAAGAT TACGGAACAG GGCGAGGTCT TGGCTTCGAA ATACGCCCTG
CCCGAACTGG CGCTCTATAA CCTTGAGACG ATCACGACGG CGGTGATTCA GTCCAGCCTG
CTGGGTAGCG GCTTTGATGA CATTGAGCCG TGGAACCAAA TTATGGAAGA GTTGGCGGCG
CGATCGCGGC GACATTACCG CGCTTTGGTG TACGAGCAGC CCGACCTGGT TGACTTCTTC
AATCAGGTAA CGCCGATTGA GGAGATCAGC AAACTGCAAA TCAGCTCGCG ACCGGCTCGA
CGCAAAACCG GCAAGCGCGA TCTGGGCAGT CTACGTGCCA TCCCCTGGGT CTTTAGCTGG
ACGCAGAGTC GTTTTCTGCT GCCCTCTTGG TATGGCGTCG GCACAGCACT TCAGGAGTTT
TTGCAGGAGC GCCCGGAGCA GAACCTCAAC CTGCTGCGCT ACTTCTACGA GAAGTGGCCG
TTCTTCCGCA TGGTGATCTC GAAGGTCGAG ATGACCCTAG CGAAGGTCGA TTTGCAGATT
GCTCATCACT ACGTGCATGA GCTGGCCAAT CCTGAGGATC AAGAGCGGTT TGAACGAGTG
TTCAGCCAAA TCGCTGCAGA GTTTCAGCTG ACTTGTCATC TCGTGTTGAC GATTACCAAC
CACGGTCGCT TGCTGGATGG CGACCCCGAA CTGCAGCGAT CGGTGCAGCT GCGCAACGGT
ACGATCGTGC CCCTCGGCTT CTTGCAAGTC GCCCTGCTTA AACGCCTGCG GCAGTATCGC
CAGCAAACGG AAACGACGGG ATTGATGCGA TCGCGCTATA GCAAAGGGGA ACTGCTGCGC
GGAGCATTGC TGACGATCAA CGGCATTGCG GCTGGCATGC GCAATACAGG TTGA
 
Protein sequence
MNYCQNARTA MSAALQSSDD AFRTVSSPLA TDLDLSSPLE FFLRHRLTVV EELWEVVLRQ 
ECGQELVDIL TQLRDLTSPE GQAPEVGGEA LVQVIETLEL SDAIRAARAF ALYFQLINIV
EQHYEQTQYQ LAYERSRLEP LPGPDESPEG LHTIEIPQHQ LDPFAAVIPL NQDPATFQTL
FPRLRQLNVP PQMIQELTDR LDIRLVFTAH PTEIVRHTIR DKQRRIAYLL RQLDELETGK
NRGFRELEAQ NIRQQLTEEI RLWWRTDELH QFKPTVLDEV DYALHYFQEV LFEAIPLLYQ
RFRLALQGTF PDLQPPRYNF CQFGSWVGSD RDGNPSVTSA VTWQTACYQR SLVLDRYITA
VEHLRNVLSL SMHWSEVLPE LLSSLEQESM LFPETYEQLA VRYRQEPYRL KLSYILERLH
NTRDRNTRLQ QQQEKDPTTP LPEYRDGTLY QAGTAFLEDL KLIQHNLKQT GLSCYELEKL
ICQVEIFGFN LVHLDIRQES SRHSDAINEI CEYLQILPQP YNELSEAERT AWLVQELKTR
RPLVPARMPF SESTREIIET LRMVKQLQEE FGEAACQTYI ISMSRELSDL LEVLLLAKEV
GLYDPVTGKS SLQVIPLFET VEDLQNAPRV MTALFELPFY TQLNPTQSEP LQEVMLGYSD
SNKDSGFLSS NWEIHKAQKA LGTVARDHRV KLRIFHGRGG SVGRGGGPAY EAILAQPGRT
TDGRIKITEQ GEVLASKYAL PELALYNLET ITTAVIQSSL LGSGFDDIEP WNQIMEELAA
RSRRHYRALV YEQPDLVDFF NQVTPIEEIS KLQISSRPAR RKTGKRDLGS LRAIPWVFSW
TQSRFLLPSW YGVGTALQEF LQERPEQNLN LLRYFYEKWP FFRMVISKVE MTLAKVDLQI
AHHYVHELAN PEDQERFERV FSQIAAEFQL TCHLVLTITN HGRLLDGDPE LQRSVQLRNG
TIVPLGFLQV ALLKRLRQYR QQTETTGLMR SRYSKGELLR GALLTINGIA AGMRNTG