Gene PCC8801_3469 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3469 
Symbol 
ID7101560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp3621257 
End bp3624331 
Gene Length3075 bp 
Protein Length1024 aa 
Translation table11 
GC content44% 
IMG OID643476481 
Productphosphoenolpyruvate carboxylase 
Protein accessionYP_002373590 
Protein GI218248219 
COG category[C] Energy production and conversion 
COG ID[COG2352] Phosphoenolpyruvate carboxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTCGC TTCTTGAATC ATCTTCATCC ATCGTTGAAG AAATTAACAT TTTTTCGACT 
TCTGATCTCT TCTTACGTCA ACGTCTCAAA TTAGTGGAAG AACTATGGGA GGCGGTGTTA
AGGGCGGAAT GTGGTCAAGA ATTGGTGGAT TTGCTCAAAC AACTCCGGAC CATGTGTTCC
CCGGAAGGAC AGTTAACCGA TATTACCCAA ACGCCGATTA CCGCAGTGAT TGAGCAATTA
GAATTGAATG AATCAATTCG GGCAGCAAGG GCGTTTGCGC TTTATTTTCA ATTAATTAAT
ATTGTTGAAC AACATTACGA ACAACGGGAT CAACAACTAA CGCGACGGGC AAATTATTGT
GATATCGACA GTAAACCCGC CAATAATCAC GGAGAATCGT CCACTAATCC TTCAATTGGT
CATCATTTAG TGGAAAGAAG CTGGATGGAC TCAGAAAATA CCTCGGAAAA AGGGGGAACA
TTCCACTGGC TATTCCCTCA TCTCAAAGAA CTAAACGTCC CTCCTCAACA AATTCAACGA
CTCCTCGATC AACTCGATAT TCGCCTCGTT TTTACAGCCC ATCCGACGGA AATTGTTCGC
CATACCATTC GACGCAAACA ACGGCGCATT GTCAATATTT TACGACGCTT AGATCGCGCT
GAAGAAGCCT TTCGGGGCAT GGGACTGAGC AATTCTTGGG AAGCACAGTC GGCTATTGAA
CAACTGACCG AAGAAATTCG CCTCTGGTGG CGTACAGATG AACTACATCA GTTTAAACCG
AGTGTTCTCG ATGAGGTAGA TTATGCCTTG CACTACTTCG ATGAAGTGTT ATTTGAGGTT
TTACCCCAAT TATCCCAACG ATTGCAACAG TCCCTAAAAT CTTCCTTTCC TTGGTTACGT
CCTCCCAAAA ATACCTTCTG TCGTTTTGGA TCATGGGTTG GAGGTGATCG CGATGGCAAC
CCCTTTGTTA CCCCAGAAGT AACGTGGAAA ACCGCCTGTT ACCAGCGTAA TATGGTGTTA
AAGAAATATT TAGAGTCAAT TCGAGACTTA ACCGAGATTT TAAGTGCGTC CTTGCACTGG
AGTAACGTTT CCCAAGATTT GCTGGACTCA TTGGAACGCG ATCGCGTTCA AATGCCAGAG
ATTTATGATG AGTTAGCCAT CCGCTATCGT CATGAACCCT ATCGCCTCAA ATTGGCTTAT
ATTGAGAAAC GGCTACAAAA TACCCGCGAT CGCAACAATC GCTTAGCCAA CCCCGATCAA
CGACAACAGC TATTGTATCG GGAAGAAGAG AATATCTATC ATTCAGGGGA AGAATTTTGG
CAAGAACTGG AGCTAATTAA GCGAAATTTA GAAGAGACGG GGTTAAATTG CCTAGAGCTT
AATAATTTAC TGATTCAAGC CGAAATGTTT GGCTTTAACC TGACTCAATT GGATTTTCGG
CAAGATTCTT CCCGTCATGC GGATGCGATC GAAGAAATTG CGGAATACCT GAATATTTTA
CCAAAACCCT ACACTCAGCT TTCGGAAGCG GAGAAAACCC AATGGTTGAT CCAAGAACTG
AAAACGAGAA GACCCCTTAT TCCCACGGGA ATGCAGTTCA AAAAGCCTGA AAATAGCGAA
ACGGTAGAAA CCCTACAAAT GTTGCGGTAT TTGCAACAGG AATTTGGCTT AGAAATCTGT
CAAACCTACA TCATCAGCAT GACAAATTAT GTCAGTGATG TCCTAGAAGT GTTGTTATTA
GCCAAAGAAG CCGGACTTTA CGACCCTGCT ACAAGTACCA CCACCATTCG CATTGTTCCT
CTATTTGAAA CCGTAGACGA CTTAAAACGG GCTCCCGAAG TGATGGAGGA TTTATTTAAG
TTACCCCTGT ACCGCGCATC TCTGGCGGGA GGTTACGATC AACTGCAACC ATCGGAAACC
CCGAGTCAAG GGGCTGTTAA ATCTCTTAAT CTGCCGGCTT TGCAACCGAC GAACCTACAG
GAGATTATGG TGGGATACTC CGATAGTAAC AAAGATTCGG GTTTTTTGAG CAGTAATTGG
GAAATTCATA AGGCGCAGAA AGCCCTCCAA AACATGGCTC AACGTTACGG CGTAGACTTA
AGGCTGTTTC ATGGTCGTGG CGGCTCGGTG GGACGCGGAG GAGGGCCTGC CTATGCCGCG
ATTTTAGCGC AGCCCTCGTC TACCATCAAT GGACGGATTA AGATTACTGA ACAGGGGGAA
GTCTTAGCCT CGAAGTATTC CTTAGGAGAT TTGGCGTTAT ATAACCTAGA AACCGTCTCT
ACTGCGGTGA TTCAAGCGAG TTTATTAGGG AGTGGATTTG ATGATATTAA CCCCTGGAAT
GAGATCATGG AGGACTTAGC TGAACGTGCG CGTAAAGCCT ATCGGGGACT TATTTATGAG
CAACCTGATT TTCTCGATTT CTTCCTGTCG GTTACGCCTA TTCCCGAAAT TAGTCAATTA
CAGATTAGTT CTCGTCCGGC ACGACGCAAA AGCGGTAAAG CTGATTTAAG CAGTTTACGG
GCGATTCCTT GGGTATTTAG CTGGACACAA AGCCGTTTTC TGCTTCCGGC TTGGTATGGG
GTAGGAACAG CGTTACAAAG CTTTGTCGAT GAAGAACCGG AGGAAAATTT GAAATTATTG
CGTTATTTTT ACCTAAAATG GCCATTTTTT AAAATGGTGG TATCTAAGGT AGAAATGACC
CTTTCTAAAG TGGATTTACA AATCGCTCAT CATTATGTGA GGGAATTGTC AAAAGCAGAA
GATAAAGAGC GATTTGAGCG AGTTTTTGAG GAAATATCCC AAGAGTATCA CCGTACCCGT
GACGTTATTC TCAATATTAC TAATCATCAA CGCTTACTCG ATAGTGATCT GAGTCTCCAG
CGTTCGGTTC AGCTACGCAA TGGAACAATT GTTCCCCTTG GCTTTTTACA AGTAGCTCTA
TTGAAGCGGT TACGGCAATA TAGTAACCAA GCGCAGTCAG GGGTCATTCA TTTCCGCTAT
TCTAAAGAAG AGTTGCTGCG GGGGGCAATG TTAACCATTA ATGGCATTGC TGCAGGGATG
CGGAATACGG GTTGA
 
Protein sequence
MSSLLESSSS IVEEINIFST SDLFLRQRLK LVEELWEAVL RAECGQELVD LLKQLRTMCS 
PEGQLTDITQ TPITAVIEQL ELNESIRAAR AFALYFQLIN IVEQHYEQRD QQLTRRANYC
DIDSKPANNH GESSTNPSIG HHLVERSWMD SENTSEKGGT FHWLFPHLKE LNVPPQQIQR
LLDQLDIRLV FTAHPTEIVR HTIRRKQRRI VNILRRLDRA EEAFRGMGLS NSWEAQSAIE
QLTEEIRLWW RTDELHQFKP SVLDEVDYAL HYFDEVLFEV LPQLSQRLQQ SLKSSFPWLR
PPKNTFCRFG SWVGGDRDGN PFVTPEVTWK TACYQRNMVL KKYLESIRDL TEILSASLHW
SNVSQDLLDS LERDRVQMPE IYDELAIRYR HEPYRLKLAY IEKRLQNTRD RNNRLANPDQ
RQQLLYREEE NIYHSGEEFW QELELIKRNL EETGLNCLEL NNLLIQAEMF GFNLTQLDFR
QDSSRHADAI EEIAEYLNIL PKPYTQLSEA EKTQWLIQEL KTRRPLIPTG MQFKKPENSE
TVETLQMLRY LQQEFGLEIC QTYIISMTNY VSDVLEVLLL AKEAGLYDPA TSTTTIRIVP
LFETVDDLKR APEVMEDLFK LPLYRASLAG GYDQLQPSET PSQGAVKSLN LPALQPTNLQ
EIMVGYSDSN KDSGFLSSNW EIHKAQKALQ NMAQRYGVDL RLFHGRGGSV GRGGGPAYAA
ILAQPSSTIN GRIKITEQGE VLASKYSLGD LALYNLETVS TAVIQASLLG SGFDDINPWN
EIMEDLAERA RKAYRGLIYE QPDFLDFFLS VTPIPEISQL QISSRPARRK SGKADLSSLR
AIPWVFSWTQ SRFLLPAWYG VGTALQSFVD EEPEENLKLL RYFYLKWPFF KMVVSKVEMT
LSKVDLQIAH HYVRELSKAE DKERFERVFE EISQEYHRTR DVILNITNHQ RLLDSDLSLQ
RSVQLRNGTI VPLGFLQVAL LKRLRQYSNQ AQSGVIHFRY SKEELLRGAM LTINGIAAGM
RNTG