Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_3469 |
Symbol | |
ID | 7101560 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | + |
Start bp | 3621257 |
End bp | 3624331 |
Gene Length | 3075 bp |
Protein Length | 1024 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 643476481 |
Product | phosphoenolpyruvate carboxylase |
Protein accession | YP_002373590 |
Protein GI | 218248219 |
COG category | [C] Energy production and conversion |
COG ID | [COG2352] Phosphoenolpyruvate carboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTCGC TTCTTGAATC ATCTTCATCC ATCGTTGAAG AAATTAACAT TTTTTCGACT TCTGATCTCT TCTTACGTCA ACGTCTCAAA TTAGTGGAAG AACTATGGGA GGCGGTGTTA AGGGCGGAAT GTGGTCAAGA ATTGGTGGAT TTGCTCAAAC AACTCCGGAC CATGTGTTCC CCGGAAGGAC AGTTAACCGA TATTACCCAA ACGCCGATTA CCGCAGTGAT TGAGCAATTA GAATTGAATG AATCAATTCG GGCAGCAAGG GCGTTTGCGC TTTATTTTCA ATTAATTAAT ATTGTTGAAC AACATTACGA ACAACGGGAT CAACAACTAA CGCGACGGGC AAATTATTGT GATATCGACA GTAAACCCGC CAATAATCAC GGAGAATCGT CCACTAATCC TTCAATTGGT CATCATTTAG TGGAAAGAAG CTGGATGGAC TCAGAAAATA CCTCGGAAAA AGGGGGAACA TTCCACTGGC TATTCCCTCA TCTCAAAGAA CTAAACGTCC CTCCTCAACA AATTCAACGA CTCCTCGATC AACTCGATAT TCGCCTCGTT TTTACAGCCC ATCCGACGGA AATTGTTCGC CATACCATTC GACGCAAACA ACGGCGCATT GTCAATATTT TACGACGCTT AGATCGCGCT GAAGAAGCCT TTCGGGGCAT GGGACTGAGC AATTCTTGGG AAGCACAGTC GGCTATTGAA CAACTGACCG AAGAAATTCG CCTCTGGTGG CGTACAGATG AACTACATCA GTTTAAACCG AGTGTTCTCG ATGAGGTAGA TTATGCCTTG CACTACTTCG ATGAAGTGTT ATTTGAGGTT TTACCCCAAT TATCCCAACG ATTGCAACAG TCCCTAAAAT CTTCCTTTCC TTGGTTACGT CCTCCCAAAA ATACCTTCTG TCGTTTTGGA TCATGGGTTG GAGGTGATCG CGATGGCAAC CCCTTTGTTA CCCCAGAAGT AACGTGGAAA ACCGCCTGTT ACCAGCGTAA TATGGTGTTA AAGAAATATT TAGAGTCAAT TCGAGACTTA ACCGAGATTT TAAGTGCGTC CTTGCACTGG AGTAACGTTT CCCAAGATTT GCTGGACTCA TTGGAACGCG ATCGCGTTCA AATGCCAGAG ATTTATGATG AGTTAGCCAT CCGCTATCGT CATGAACCCT ATCGCCTCAA ATTGGCTTAT ATTGAGAAAC GGCTACAAAA TACCCGCGAT CGCAACAATC GCTTAGCCAA CCCCGATCAA CGACAACAGC TATTGTATCG GGAAGAAGAG AATATCTATC ATTCAGGGGA AGAATTTTGG CAAGAACTGG AGCTAATTAA GCGAAATTTA GAAGAGACGG GGTTAAATTG CCTAGAGCTT AATAATTTAC TGATTCAAGC CGAAATGTTT GGCTTTAACC TGACTCAATT GGATTTTCGG CAAGATTCTT CCCGTCATGC GGATGCGATC GAAGAAATTG CGGAATACCT GAATATTTTA CCAAAACCCT ACACTCAGCT TTCGGAAGCG GAGAAAACCC AATGGTTGAT CCAAGAACTG AAAACGAGAA GACCCCTTAT TCCCACGGGA ATGCAGTTCA AAAAGCCTGA AAATAGCGAA ACGGTAGAAA CCCTACAAAT GTTGCGGTAT TTGCAACAGG AATTTGGCTT AGAAATCTGT CAAACCTACA TCATCAGCAT GACAAATTAT GTCAGTGATG TCCTAGAAGT GTTGTTATTA GCCAAAGAAG CCGGACTTTA CGACCCTGCT ACAAGTACCA CCACCATTCG CATTGTTCCT CTATTTGAAA CCGTAGACGA CTTAAAACGG GCTCCCGAAG TGATGGAGGA TTTATTTAAG TTACCCCTGT ACCGCGCATC TCTGGCGGGA GGTTACGATC AACTGCAACC ATCGGAAACC CCGAGTCAAG GGGCTGTTAA ATCTCTTAAT CTGCCGGCTT TGCAACCGAC GAACCTACAG GAGATTATGG TGGGATACTC CGATAGTAAC AAAGATTCGG GTTTTTTGAG CAGTAATTGG GAAATTCATA AGGCGCAGAA AGCCCTCCAA AACATGGCTC AACGTTACGG CGTAGACTTA AGGCTGTTTC ATGGTCGTGG CGGCTCGGTG GGACGCGGAG GAGGGCCTGC CTATGCCGCG ATTTTAGCGC AGCCCTCGTC TACCATCAAT GGACGGATTA AGATTACTGA ACAGGGGGAA GTCTTAGCCT CGAAGTATTC CTTAGGAGAT TTGGCGTTAT ATAACCTAGA AACCGTCTCT ACTGCGGTGA TTCAAGCGAG TTTATTAGGG AGTGGATTTG ATGATATTAA CCCCTGGAAT GAGATCATGG AGGACTTAGC TGAACGTGCG CGTAAAGCCT ATCGGGGACT TATTTATGAG CAACCTGATT TTCTCGATTT CTTCCTGTCG GTTACGCCTA TTCCCGAAAT TAGTCAATTA CAGATTAGTT CTCGTCCGGC ACGACGCAAA AGCGGTAAAG CTGATTTAAG CAGTTTACGG GCGATTCCTT GGGTATTTAG CTGGACACAA AGCCGTTTTC TGCTTCCGGC TTGGTATGGG GTAGGAACAG CGTTACAAAG CTTTGTCGAT GAAGAACCGG AGGAAAATTT GAAATTATTG CGTTATTTTT ACCTAAAATG GCCATTTTTT AAAATGGTGG TATCTAAGGT AGAAATGACC CTTTCTAAAG TGGATTTACA AATCGCTCAT CATTATGTGA GGGAATTGTC AAAAGCAGAA GATAAAGAGC GATTTGAGCG AGTTTTTGAG GAAATATCCC AAGAGTATCA CCGTACCCGT GACGTTATTC TCAATATTAC TAATCATCAA CGCTTACTCG ATAGTGATCT GAGTCTCCAG CGTTCGGTTC AGCTACGCAA TGGAACAATT GTTCCCCTTG GCTTTTTACA AGTAGCTCTA TTGAAGCGGT TACGGCAATA TAGTAACCAA GCGCAGTCAG GGGTCATTCA TTTCCGCTAT TCTAAAGAAG AGTTGCTGCG GGGGGCAATG TTAACCATTA ATGGCATTGC TGCAGGGATG CGGAATACGG GTTGA
|
Protein sequence | MSSLLESSSS IVEEINIFST SDLFLRQRLK LVEELWEAVL RAECGQELVD LLKQLRTMCS PEGQLTDITQ TPITAVIEQL ELNESIRAAR AFALYFQLIN IVEQHYEQRD QQLTRRANYC DIDSKPANNH GESSTNPSIG HHLVERSWMD SENTSEKGGT FHWLFPHLKE LNVPPQQIQR LLDQLDIRLV FTAHPTEIVR HTIRRKQRRI VNILRRLDRA EEAFRGMGLS NSWEAQSAIE QLTEEIRLWW RTDELHQFKP SVLDEVDYAL HYFDEVLFEV LPQLSQRLQQ SLKSSFPWLR PPKNTFCRFG SWVGGDRDGN PFVTPEVTWK TACYQRNMVL KKYLESIRDL TEILSASLHW SNVSQDLLDS LERDRVQMPE IYDELAIRYR HEPYRLKLAY IEKRLQNTRD RNNRLANPDQ RQQLLYREEE NIYHSGEEFW QELELIKRNL EETGLNCLEL NNLLIQAEMF GFNLTQLDFR QDSSRHADAI EEIAEYLNIL PKPYTQLSEA EKTQWLIQEL KTRRPLIPTG MQFKKPENSE TVETLQMLRY LQQEFGLEIC QTYIISMTNY VSDVLEVLLL AKEAGLYDPA TSTTTIRIVP LFETVDDLKR APEVMEDLFK LPLYRASLAG GYDQLQPSET PSQGAVKSLN LPALQPTNLQ EIMVGYSDSN KDSGFLSSNW EIHKAQKALQ NMAQRYGVDL RLFHGRGGSV GRGGGPAYAA ILAQPSSTIN GRIKITEQGE VLASKYSLGD LALYNLETVS TAVIQASLLG SGFDDINPWN EIMEDLAERA RKAYRGLIYE QPDFLDFFLS VTPIPEISQL QISSRPARRK SGKADLSSLR AIPWVFSWTQ SRFLLPAWYG VGTALQSFVD EEPEENLKLL RYFYLKWPFF KMVVSKVEMT LSKVDLQIAH HYVRELSKAE DKERFERVFE EISQEYHRTR DVILNITNHQ RLLDSDLSLQ RSVQLRNGTI VPLGFLQVAL LKRLRQYSNQ AQSGVIHFRY SKEELLRGAM LTINGIAAGM RNTG
|
| |