Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_2647 |
Symbol | |
ID | 8391973 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013161 |
Strand | - |
Start bp | 2670294 |
End bp | 2673368 |
Gene Length | 3075 bp |
Protein Length | 1024 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644980608 |
Product | phosphoenolpyruvate carboxylase |
Protein accession | YP_003138344 |
Protein GI | 257060456 |
COG category | [C] Energy production and conversion |
COG ID | [COG2352] Phosphoenolpyruvate carboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.261414 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.0910712 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTCGC TTCTTGAATC ATCTTCATCC ATCGTTGAAG AAATTAACAT TTTTTCGACT TCTGATCTCT TCTTACGTCA ACGTCTCAAA TTAGTGGAAG AACTATGGGA GGCGGTGTTA AGGGCGGAAT GTGGTCAAGA ATTGGTGGAT TTGCTCAAAC AACTCCGGAC CATGTGTTCC CCGGAAGGAC AGTTAACCGA TATTACCCAA ACGCCGATTA CCGCAGTGAT TGAGCAATTA GAATTGAATG AATCAATTCG GGCAGCAAGG GCGTTTGCGC TTTATTTTCA ATTAATTAAT ATTGTTGAAC AACATTACGA ACAACGGGAT CAACAACTAA CGCGACGGGC AAATTATTGT GATATCGACA GTAAACCCGC CAATAATCAC GGAGAATCGT CCACTAATCC TTCAATTGGT CATCATTTAG TGGAAAGAAG CTGGATGGAC TCCGAAAATA CCTCGGAAAA AGGGGGAACA TTCCACTGGC TATTCCCTCA TCTCAAAGAA CTAAACGTCC CTCCTCAACA AATTCAACGA CTCCTCGATC AACTCGATAT TCGCCTCGTT TTTACAGCCC ATCCGACGGA AATTGTTCGC CATACCATTC GACGCAAACA ACGGCGCATT GTCAATATTT TACGACGCTT AGATCGCGCT GAAGAAGCCT TTCGGGGCAT GGGACTGAGC AATTCTTGGG AAGCACAGTC GGCTATTGAA CAACTGACCG AAGAAATTCG CCTCTGGTGG CGTACGGATG AACTGCATCA GTTTAAACCG AGTGTTCTCG ATGAGGTAGA TTATGCCTTG CACTACTTCG ATGAAGTGTT ATTTGAGGTT TTACCCCAAT TATCCCAACG ATTGCAACAG TCCCTAAAAT CTTCCTTTCC TTGGTTACGT CCTCCCAAAA ATACCTTCTG TCGTTTTGGA TCATGGGTTG GAGGTGATCG CGATGGCAAC CCCTTTGTTA CCCCAGAAGT AACGTGGAAA ACCGCCTGTT ACCAGCGTAA TATGGTGTTA AAGAAATATT TAGAGTCAAT TCGAGACTTA ACCGAGATTT TAAGTGCGTC CTTGCACTGG AGTAACGTTT CCCAAGATTT GCTGGACTCA TTGGAACGCG ATCGCGTGCA AATGCCAGAG ATTTATGATG AGTTAGCCAT CCGCTATCGT CATGAACCCT ATCGCCTCAA ATTGGCTTAT ATTGAGAAAC GGCTACAAAA TACCCGCGAT CGCAACAATC GCTTAGCCAA CCCCGATCAA CGACAACAAC TATTGTATCG GGAAGAAGAG AATATCTATC ATTCAGGGGA AGAATTTTGG CAAGAACTGG AGCTAATTAA GCGAAATTTA GAAGAGACGG GGTTAAATTG CCTAGAGCTT AATAATTTAC TCATTCAAGC CGAAATGTTT GGCTTTAACC TGACCCAATT GGATTTTCGG CAAGATTCTT CCCGTCATGC GGATGCGATC GAAGAAATTG CGGAATACCT GAATATTTTA CCAAAACCCT ACACTCAGCT TTCGGAAGCG GAGAAAACCC AATGGTTGAT CCAAGAACTG AAAACGAGAA GACCCCTTAT TCCCACGGGA ATGCAGTTCA AAAAGCCTGA AAATAGCGAA ACGGTAGAAA CCCTACAAAT GTTGCGGTAT TTGCAACAGG AATTTGGCTT AGAAATCTGT CAAACCTACA TCATCAGCAT GACAAATTAT GTCAGTGATG TGCTAGAAGT GTTGTTATTA GCCAAAGAAG CCGGACTTTA CGACCCTGCT ACGAGTACCA CCACCATTCG CATTGTTCCT CTATTTGAAA CCGTAGACGA CTTAAAACGG GCTCCCGAAG TGATGGAGGA TTTATTTAAG TTACCCCTGT ACCGCGCATC TCTGGCGGGA GGTTACGATC AACTGCAACC ATCGGAAACC CCGAGTCAAG GGGCTGTTAA ATCTCTTAAT CTGCCGGCTT TGCAACCGAC GAATCTACAG GAGATTATGG TGGGATACTC CGATAGTAAC AAAGATTCGG GTTTTTTGAG CAGTAATTGG GAAATTCATA AGGCGCAGAA AGCCCTCCAA AACATGGCTC AACGTTACGG CGTAGACTTA AGGCTGTTTC ATGGTCGTGG CGGCTCGGTG GGACGCGGAG GAGGGCCTGC CTATGCCGCG ATTTTAGCGC AGCCCTCGTC TACCATCAAT GGACGGATTA AGATTACTGA ACAGGGGGAA GTCTTAGCCT CGAAGTATTC CTTAGGAGAT TTGGCGTTAT ATAACCTAGA AACCGTCTCT ACTGCGGTGA TTCAAGCGAG TTTATTAGGG AGTGGATTTG ATGATATTAA CCCCTGGAAT GAGATCATGG AGGACTTAGC TGAACGTGCG CGTAAAGCCT ATCGGGGACT TATTTATGAG CAACCTGATT TTCTCGATTT CTTCCTGTCG GTTACTCCTA TTCCCGAAAT TAGTCAATTA CAGATTAGTT CTCGTCCGGC ACGACGCAAA AGCGGTAAAG CTGATTTAAG CAGTTTACGG GCGATTCCTT GGGTATTTAG CTGGACACAA AGCCGTTTTC TGCTTCCGGC TTGGTATGGG GTAGGAACAG CGTTACAAAG CTTTGTCGAT GAAGAACCGG AGGAAAATTT GAAATTATTG CGTTATTTTT ACCTAAAATG GCCATTTTTT AAAATGGTGG TATCTAAGGT AGAAATGACC CTTTCTAAAG TGGATTTACA AATCGCTCAT CATTATGTGA GGGAATTGTC AAAAGCAGAA GATAAAGAGC GATTTGAGCG AGTTTTTGAG GAAATATCCC AAGAGTATCA CCGTACCCGT GACGTTATTC TCAATATTAC TAATCATCAA CGCTTACTCG ATAGTGATCT GAGTCTCCAG CGTTCGGTTC AGCTACGCAA TGGAACAATT GTTCCCCTTG GCTTTTTACA AGTAGCTCTA TTGAAGCGGT TACGGCAATA TAGTAACCAA GCGCAGTCAG GGGTCATTCA TTTCCGCTAT TCTAAAGAAG AGTTGCTGCG GGGGGCAATG TTAACCATTA ATGGCATTGC TGCAGGGATG CGGAATACGG GTTGA
|
Protein sequence | MSSLLESSSS IVEEINIFST SDLFLRQRLK LVEELWEAVL RAECGQELVD LLKQLRTMCS PEGQLTDITQ TPITAVIEQL ELNESIRAAR AFALYFQLIN IVEQHYEQRD QQLTRRANYC DIDSKPANNH GESSTNPSIG HHLVERSWMD SENTSEKGGT FHWLFPHLKE LNVPPQQIQR LLDQLDIRLV FTAHPTEIVR HTIRRKQRRI VNILRRLDRA EEAFRGMGLS NSWEAQSAIE QLTEEIRLWW RTDELHQFKP SVLDEVDYAL HYFDEVLFEV LPQLSQRLQQ SLKSSFPWLR PPKNTFCRFG SWVGGDRDGN PFVTPEVTWK TACYQRNMVL KKYLESIRDL TEILSASLHW SNVSQDLLDS LERDRVQMPE IYDELAIRYR HEPYRLKLAY IEKRLQNTRD RNNRLANPDQ RQQLLYREEE NIYHSGEEFW QELELIKRNL EETGLNCLEL NNLLIQAEMF GFNLTQLDFR QDSSRHADAI EEIAEYLNIL PKPYTQLSEA EKTQWLIQEL KTRRPLIPTG MQFKKPENSE TVETLQMLRY LQQEFGLEIC QTYIISMTNY VSDVLEVLLL AKEAGLYDPA TSTTTIRIVP LFETVDDLKR APEVMEDLFK LPLYRASLAG GYDQLQPSET PSQGAVKSLN LPALQPTNLQ EIMVGYSDSN KDSGFLSSNW EIHKAQKALQ NMAQRYGVDL RLFHGRGGSV GRGGGPAYAA ILAQPSSTIN GRIKITEQGE VLASKYSLGD LALYNLETVS TAVIQASLLG SGFDDINPWN EIMEDLAERA RKAYRGLIYE QPDFLDFFLS VTPIPEISQL QISSRPARRK SGKADLSSLR AIPWVFSWTQ SRFLLPAWYG VGTALQSFVD EEPEENLKLL RYFYLKWPFF KMVVSKVEMT LSKVDLQIAH HYVRELSKAE DKERFERVFE EISQEYHRTR DVILNITNHQ RLLDSDLSLQ RSVQLRNGTI VPLGFLQVAL LKRLRQYSNQ AQSGVIHFRY SKEELLRGAM LTINGIAAGM RNTG
|
| |