Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC7424_4238 |
Symbol | |
ID | 7108159 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 7424 |
Kingdom | Bacteria |
Replicon accession | NC_011729 |
Strand | + |
Start bp | 4701113 |
End bp | 4704178 |
Gene Length | 3066 bp |
Protein Length | 1021 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 643482462 |
Product | phosphoenolpyruvate carboxylase |
Protein accession | YP_002379476 |
Protein GI | 218441147 |
COG category | [C] Energy production and conversion |
COG ID | [COG2352] Phosphoenolpyruvate carboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 59 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTCCC TAGTTCAAGT ATCCACAACC CCCTTACAAT TTTTCTCTAC CTCCGATCTA TTTTTACAAG ACAGACTGAA GTTAGTTGAA GAACTATGGG AAGAAGTGCT ACAGGCTGAA TGTGGTCAAG AGTTAGTCGC TCTGCTCAAA AAGTTAAGAG CAATTTGTTG TGAGCAAGGA CAAGCCCAAA AAGCTCTAGA AGATTCCATA ACCCAACTGA TAGAACAACT AGACCTCGTT GAAGCCGTCA GAACATCTCG CGCTTTTGCT CTTTATTTCC AACTGATTAA CATCGTAGAA CAACACTACG AACAACGAGA TCAACAACTC TCCAGACGGG CAACCTATCA AGAAAGCCCC GTAAAAACCA AAAATGGCAA TAATTCCGAG TCTCCGAGTT TATTTTCCTC GGCATTTGGA GCAAATTGGG TAGAAAGAAG TTGGACAGAT GACAACCAAA ACCAACAAAA AGCCGGAACC CTTCACTGGC TCTTTCCCTA TTTAAAACAA CTCAATGTTC CCCCTCAACA AATCCAACGG CTTCTAAATC AACTCGATAT TCGTCTGGTT TTTACAGCCC ATCCCACCGA AATTGTCCGG CACACCATCC GACGCAAACA ACGCCGGATC GCCCATATCT TACAAAAACT CGACCAAGCA GAAGAAACCT TTCGAGGTAT GGGGTTAACC AACTCTTGGG AAGCCGAACA ATTAAAAGAA CAGTTTAAAG AAGAAATTCG TCTCTGGTGG CGGACAGACG AACTACATCA ATTTAAACCC ACCGTTTTAG ACGAAGTGGA TTATGCCCTA CACTACTTTA ATGAAGTTCT GTTTCAAGCT CTGCCTCAAT TAGCCCTCCG TCTCAAACAA ACCTTAAAGG GTGCTTTTCC CCGACTAGAA CCCCCGAAAA ATAATTTCTG TTATTTTGGC TCTTGGGTAG GAGGCGATCG AGATGGAAAC CCGTTTGTCA CTCCGGAAGT CACTTGGGCA ACCGCCTGTT ATCAGCGAAA CGTGGTCATC GAAAAATATC TGCAAACCGT TGATGAGTTA TCGAACATTT TAAGTCCGTC TCTCCATTGG TGTAATGTGT TGCCGGACTT GCTCGACTCT TTAGAAAAAG ACCGGGTACA AATGCCGGAG ATTTATACTC AATTGGCCAT CCGCTACCGT CAAGAACCCT ATCGTCTCAA ATTAGCCTAC ATTAAAAAAC GGCTCGAAAA CACCCAAGAT CGTAACAACC GTCTAGCGAA CCCCAATGAG CGCCTATTAA TCTTCAAAAC CGGCAATCCG AATATTTATC ATAATAAGGA AGAATTGAGC GAAGAACTTC AATTAATCCG CCGAAACTTA GAAGCCAGTG GGTTAACCTG TCAGGAATTG GAAAATTTAA TCTGTCAGGT GGAAATTTAC GGGTTTAACC TCACTCAACT CGATTTCCGT CAAGAATCTA GTTGTCATTC TGACGCAATT AATGAAATTG CCGAGTATTT ACAAATCCTA CCCAAACCCT ATAACCAACT GAGTGAAGCA GAAAGAACCG CTTGGCTGAT AGAAGAATTA AAAACCCGTC GGCCTCTGAT TCCCCAGGAA ATGCCCTTTT CTGAGAAAAC CTGCGAAGTC ATTGAAACCT TAAAAATGTT GCGACTGCTG CAACAGGAAT TTGGCTTAGA GATCTGTCAT ACCTACATTA TCAGTATGAC CAATGATGTG AGTGATGTGT TAGAAGTGAT GCTATTAGCA CAAGAAGCCG GACTCTATGA TCCAGCCACA AGCTCTATCA CCATTCGCAT TGTTCCCCTA TTTGAAACCG TAGAAGACTT GAAACGCGCT CCGGAAATTA TGCGGGCGCT GTTTGAAATG ACCCTTTACC GAGCCGCTTT GGCCGGAGGA TATCAATATT TAGCGAAAGA TCAAGGTCAA GAGATCATTG GGGAACTCCA ACCCCCCCTA CTCCAACCCC CCAACCTTCA AGAAATTATG GTCGGGTATT CCGACAGTAA CAAAGATTCA GGCTTTTTAA GCAGTAATTG GGAAATTCAT AAAGCCCAAA AAGCCCTACA AAAAGTCGCC AAAGGATATG GAATTGACCT ACGACTGTTT CATGGTCGAG GCGGTTCAGT CGGACGAGGA GGCGGGCCAG CTTATGCCGC TATTCTCGCC CAACCCACAG GAACAATTAA CGGACGCATT AAAATTACCG AACAGGGAGA AGTGTTAGCT TCTAAATATT CTTTACCCGA ATTAGCCCTG TATAATTTAG AAACGATCGT CACGGCGGTA ATTCAATCGA GTTTATTGGG GTGTGGGTTT GATGATATTC AAGTCTGGAA TGAAATCATG GAAGACTTAG CCGGTTGCGC TCGTAAAGCT TACCGTTCCT TGATTTATGA AGACCCAGAT TTTATCGATT TCTTCATGTC TGTGACTCCC ATTCCAGAAA TTAGTCAACT GCAAATTAGT TCCCGTCCGG CTCGACGCAA AAGTGGCAAA AAAGATTTGA GTACCTTACG GGCTATTCCT TGGGTATTTA GTTGGACTCA AACCCGTTTT CTGCTGCCGG CTTGGTATGG AGTAGGAACG GCTTTAGAGC AATTTATTAA TCGAGAACCA GAAGAACATT TAAAATTATT GCGCTATTTT TATTTAAAAT GGCCGTTCTT TAGAATGGTA ATTTCTAAAG TCGAGATGAC CTTATCTAAG GTAGATTTAC AAATTGCCCA TCATTATGTA AAAGAACTCT CTAAACCCGA AGATCTAGAG CGGTTTGAAC GAGTCTTTAA TCAAATTTCC GAAGAATATC ATCGGACTTG TCGTTTAGTT TTAGACATTA CAGAAAATGA ACGATTACTT GATGGTGATC CCACATTACA ACGATCTGTC CAGTTAAGAA ATGGCACAAT TGTCCCCTTG GGATTTTTAC AAGTTTCGTT ATTAAAACGG CTACGTCAGT ATAACGCTCA AGCGGAGTCG GGGGTCATTC ATTTCCGTTA TTCTAAGGAA GAATTATTAA GAGGAGCGCT CTTAACCATT AATGGAATTG CAGCCGGAAT GCGGAATACC GGTTGA
|
Protein sequence | MSSLVQVSTT PLQFFSTSDL FLQDRLKLVE ELWEEVLQAE CGQELVALLK KLRAICCEQG QAQKALEDSI TQLIEQLDLV EAVRTSRAFA LYFQLINIVE QHYEQRDQQL SRRATYQESP VKTKNGNNSE SPSLFSSAFG ANWVERSWTD DNQNQQKAGT LHWLFPYLKQ LNVPPQQIQR LLNQLDIRLV FTAHPTEIVR HTIRRKQRRI AHILQKLDQA EETFRGMGLT NSWEAEQLKE QFKEEIRLWW RTDELHQFKP TVLDEVDYAL HYFNEVLFQA LPQLALRLKQ TLKGAFPRLE PPKNNFCYFG SWVGGDRDGN PFVTPEVTWA TACYQRNVVI EKYLQTVDEL SNILSPSLHW CNVLPDLLDS LEKDRVQMPE IYTQLAIRYR QEPYRLKLAY IKKRLENTQD RNNRLANPNE RLLIFKTGNP NIYHNKEELS EELQLIRRNL EASGLTCQEL ENLICQVEIY GFNLTQLDFR QESSCHSDAI NEIAEYLQIL PKPYNQLSEA ERTAWLIEEL KTRRPLIPQE MPFSEKTCEV IETLKMLRLL QQEFGLEICH TYIISMTNDV SDVLEVMLLA QEAGLYDPAT SSITIRIVPL FETVEDLKRA PEIMRALFEM TLYRAALAGG YQYLAKDQGQ EIIGELQPPL LQPPNLQEIM VGYSDSNKDS GFLSSNWEIH KAQKALQKVA KGYGIDLRLF HGRGGSVGRG GGPAYAAILA QPTGTINGRI KITEQGEVLA SKYSLPELAL YNLETIVTAV IQSSLLGCGF DDIQVWNEIM EDLAGCARKA YRSLIYEDPD FIDFFMSVTP IPEISQLQIS SRPARRKSGK KDLSTLRAIP WVFSWTQTRF LLPAWYGVGT ALEQFINREP EEHLKLLRYF YLKWPFFRMV ISKVEMTLSK VDLQIAHHYV KELSKPEDLE RFERVFNQIS EEYHRTCRLV LDITENERLL DGDPTLQRSV QLRNGTIVPL GFLQVSLLKR LRQYNAQAES GVIHFRYSKE ELLRGALLTI NGIAAGMRNT G
|
| |