Gene Cyan8802_2647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_2647 
Symbol 
ID8391973 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp2670294 
End bp2673368 
Gene Length3075 bp 
Protein Length1024 aa 
Translation table11 
GC content44% 
IMG OID644980608 
Productphosphoenolpyruvate carboxylase 
Protein accessionYP_003138344 
Protein GI257060456 
COG category[C] Energy production and conversion 
COG ID[COG2352] Phosphoenolpyruvate carboxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.261414 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0910712 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTCGC TTCTTGAATC ATCTTCATCC ATCGTTGAAG AAATTAACAT TTTTTCGACT 
TCTGATCTCT TCTTACGTCA ACGTCTCAAA TTAGTGGAAG AACTATGGGA GGCGGTGTTA
AGGGCGGAAT GTGGTCAAGA ATTGGTGGAT TTGCTCAAAC AACTCCGGAC CATGTGTTCC
CCGGAAGGAC AGTTAACCGA TATTACCCAA ACGCCGATTA CCGCAGTGAT TGAGCAATTA
GAATTGAATG AATCAATTCG GGCAGCAAGG GCGTTTGCGC TTTATTTTCA ATTAATTAAT
ATTGTTGAAC AACATTACGA ACAACGGGAT CAACAACTAA CGCGACGGGC AAATTATTGT
GATATCGACA GTAAACCCGC CAATAATCAC GGAGAATCGT CCACTAATCC TTCAATTGGT
CATCATTTAG TGGAAAGAAG CTGGATGGAC TCCGAAAATA CCTCGGAAAA AGGGGGAACA
TTCCACTGGC TATTCCCTCA TCTCAAAGAA CTAAACGTCC CTCCTCAACA AATTCAACGA
CTCCTCGATC AACTCGATAT TCGCCTCGTT TTTACAGCCC ATCCGACGGA AATTGTTCGC
CATACCATTC GACGCAAACA ACGGCGCATT GTCAATATTT TACGACGCTT AGATCGCGCT
GAAGAAGCCT TTCGGGGCAT GGGACTGAGC AATTCTTGGG AAGCACAGTC GGCTATTGAA
CAACTGACCG AAGAAATTCG CCTCTGGTGG CGTACGGATG AACTGCATCA GTTTAAACCG
AGTGTTCTCG ATGAGGTAGA TTATGCCTTG CACTACTTCG ATGAAGTGTT ATTTGAGGTT
TTACCCCAAT TATCCCAACG ATTGCAACAG TCCCTAAAAT CTTCCTTTCC TTGGTTACGT
CCTCCCAAAA ATACCTTCTG TCGTTTTGGA TCATGGGTTG GAGGTGATCG CGATGGCAAC
CCCTTTGTTA CCCCAGAAGT AACGTGGAAA ACCGCCTGTT ACCAGCGTAA TATGGTGTTA
AAGAAATATT TAGAGTCAAT TCGAGACTTA ACCGAGATTT TAAGTGCGTC CTTGCACTGG
AGTAACGTTT CCCAAGATTT GCTGGACTCA TTGGAACGCG ATCGCGTGCA AATGCCAGAG
ATTTATGATG AGTTAGCCAT CCGCTATCGT CATGAACCCT ATCGCCTCAA ATTGGCTTAT
ATTGAGAAAC GGCTACAAAA TACCCGCGAT CGCAACAATC GCTTAGCCAA CCCCGATCAA
CGACAACAAC TATTGTATCG GGAAGAAGAG AATATCTATC ATTCAGGGGA AGAATTTTGG
CAAGAACTGG AGCTAATTAA GCGAAATTTA GAAGAGACGG GGTTAAATTG CCTAGAGCTT
AATAATTTAC TCATTCAAGC CGAAATGTTT GGCTTTAACC TGACCCAATT GGATTTTCGG
CAAGATTCTT CCCGTCATGC GGATGCGATC GAAGAAATTG CGGAATACCT GAATATTTTA
CCAAAACCCT ACACTCAGCT TTCGGAAGCG GAGAAAACCC AATGGTTGAT CCAAGAACTG
AAAACGAGAA GACCCCTTAT TCCCACGGGA ATGCAGTTCA AAAAGCCTGA AAATAGCGAA
ACGGTAGAAA CCCTACAAAT GTTGCGGTAT TTGCAACAGG AATTTGGCTT AGAAATCTGT
CAAACCTACA TCATCAGCAT GACAAATTAT GTCAGTGATG TGCTAGAAGT GTTGTTATTA
GCCAAAGAAG CCGGACTTTA CGACCCTGCT ACGAGTACCA CCACCATTCG CATTGTTCCT
CTATTTGAAA CCGTAGACGA CTTAAAACGG GCTCCCGAAG TGATGGAGGA TTTATTTAAG
TTACCCCTGT ACCGCGCATC TCTGGCGGGA GGTTACGATC AACTGCAACC ATCGGAAACC
CCGAGTCAAG GGGCTGTTAA ATCTCTTAAT CTGCCGGCTT TGCAACCGAC GAATCTACAG
GAGATTATGG TGGGATACTC CGATAGTAAC AAAGATTCGG GTTTTTTGAG CAGTAATTGG
GAAATTCATA AGGCGCAGAA AGCCCTCCAA AACATGGCTC AACGTTACGG CGTAGACTTA
AGGCTGTTTC ATGGTCGTGG CGGCTCGGTG GGACGCGGAG GAGGGCCTGC CTATGCCGCG
ATTTTAGCGC AGCCCTCGTC TACCATCAAT GGACGGATTA AGATTACTGA ACAGGGGGAA
GTCTTAGCCT CGAAGTATTC CTTAGGAGAT TTGGCGTTAT ATAACCTAGA AACCGTCTCT
ACTGCGGTGA TTCAAGCGAG TTTATTAGGG AGTGGATTTG ATGATATTAA CCCCTGGAAT
GAGATCATGG AGGACTTAGC TGAACGTGCG CGTAAAGCCT ATCGGGGACT TATTTATGAG
CAACCTGATT TTCTCGATTT CTTCCTGTCG GTTACTCCTA TTCCCGAAAT TAGTCAATTA
CAGATTAGTT CTCGTCCGGC ACGACGCAAA AGCGGTAAAG CTGATTTAAG CAGTTTACGG
GCGATTCCTT GGGTATTTAG CTGGACACAA AGCCGTTTTC TGCTTCCGGC TTGGTATGGG
GTAGGAACAG CGTTACAAAG CTTTGTCGAT GAAGAACCGG AGGAAAATTT GAAATTATTG
CGTTATTTTT ACCTAAAATG GCCATTTTTT AAAATGGTGG TATCTAAGGT AGAAATGACC
CTTTCTAAAG TGGATTTACA AATCGCTCAT CATTATGTGA GGGAATTGTC AAAAGCAGAA
GATAAAGAGC GATTTGAGCG AGTTTTTGAG GAAATATCCC AAGAGTATCA CCGTACCCGT
GACGTTATTC TCAATATTAC TAATCATCAA CGCTTACTCG ATAGTGATCT GAGTCTCCAG
CGTTCGGTTC AGCTACGCAA TGGAACAATT GTTCCCCTTG GCTTTTTACA AGTAGCTCTA
TTGAAGCGGT TACGGCAATA TAGTAACCAA GCGCAGTCAG GGGTCATTCA TTTCCGCTAT
TCTAAAGAAG AGTTGCTGCG GGGGGCAATG TTAACCATTA ATGGCATTGC TGCAGGGATG
CGGAATACGG GTTGA
 
Protein sequence
MSSLLESSSS IVEEINIFST SDLFLRQRLK LVEELWEAVL RAECGQELVD LLKQLRTMCS 
PEGQLTDITQ TPITAVIEQL ELNESIRAAR AFALYFQLIN IVEQHYEQRD QQLTRRANYC
DIDSKPANNH GESSTNPSIG HHLVERSWMD SENTSEKGGT FHWLFPHLKE LNVPPQQIQR
LLDQLDIRLV FTAHPTEIVR HTIRRKQRRI VNILRRLDRA EEAFRGMGLS NSWEAQSAIE
QLTEEIRLWW RTDELHQFKP SVLDEVDYAL HYFDEVLFEV LPQLSQRLQQ SLKSSFPWLR
PPKNTFCRFG SWVGGDRDGN PFVTPEVTWK TACYQRNMVL KKYLESIRDL TEILSASLHW
SNVSQDLLDS LERDRVQMPE IYDELAIRYR HEPYRLKLAY IEKRLQNTRD RNNRLANPDQ
RQQLLYREEE NIYHSGEEFW QELELIKRNL EETGLNCLEL NNLLIQAEMF GFNLTQLDFR
QDSSRHADAI EEIAEYLNIL PKPYTQLSEA EKTQWLIQEL KTRRPLIPTG MQFKKPENSE
TVETLQMLRY LQQEFGLEIC QTYIISMTNY VSDVLEVLLL AKEAGLYDPA TSTTTIRIVP
LFETVDDLKR APEVMEDLFK LPLYRASLAG GYDQLQPSET PSQGAVKSLN LPALQPTNLQ
EIMVGYSDSN KDSGFLSSNW EIHKAQKALQ NMAQRYGVDL RLFHGRGGSV GRGGGPAYAA
ILAQPSSTIN GRIKITEQGE VLASKYSLGD LALYNLETVS TAVIQASLLG SGFDDINPWN
EIMEDLAERA RKAYRGLIYE QPDFLDFFLS VTPIPEISQL QISSRPARRK SGKADLSSLR
AIPWVFSWTQ SRFLLPAWYG VGTALQSFVD EEPEENLKLL RYFYLKWPFF KMVVSKVEMT
LSKVDLQIAH HYVRELSKAE DKERFERVFE EISQEYHRTR DVILNITNHQ RLLDSDLSLQ
RSVQLRNGTI VPLGFLQVAL LKRLRQYSNQ AQSGVIHFRY SKEELLRGAM LTINGIAAGM
RNTG