Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan7425_2220 |
Symbol | |
ID | 7288147 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 7425 |
Kingdom | Bacteria |
Replicon accession | NC_011884 |
Strand | - |
Start bp | 2135997 |
End bp | 2139053 |
Gene Length | 3057 bp |
Protein Length | 1018 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643585215 |
Product | phosphoenolpyruvate carboxylase |
Protein accession | YP_002482942 |
Protein GI | 220907631 |
COG category | [C] Energy production and conversion |
COG ID | [COG2352] Phosphoenolpyruvate carboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.0767995 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTTCTA CCGTTCCCCT CTCTGAACAG GATCAAATTC TTTCCAACCG GCGTGAGCAA GCTCTCCGCA ATCGGATGCA GGTGATTGAA GACCTGTTAG AAGCGGTTCT GCAAACGGAA TGTGGTCAGA CCCTTGTAGA TCTGCTCCGG CAATTACGGG CCCTCTGTTC TCCCGAAGGT CAGGTGAGAC GGGCCCTGGA TGAAGAAGTG CTCCGGTTGA TCGAAAAGCT GGACTTCAAT GAAGCGATTC GGGCGGCCCG TTCCTTTGCC CTCTACTTCC AACTGATCAA TATTGTTGAG CAGCACTACG AGCAACAGGA AAGCCTGTAC CGTACCCATC CAGAATCGGG ACATCCCCTC CCAGATTTGA TTTCCCATAT CAGTGGCGAG AGCTTCCCAG ACAAAAGTTT TTCTGGACAG AACTTTTCGG CACACAATAA TACAGTCGAT GTCCGGGGTG GTCTAACAGA GCGGGTCGAA CATAGTCTTT ACGAAGCCAC CTCTTTGGAG CGTCATCAGG GTACCTTTGC CTGGCTGTTC CCCCATCTCT GGCAACAGAA TGTTCCCCCC CGTTATCTCC AAAATCTGAT CGATCAACTG GATATCAAGC TGGTCTTTAC GGCTCACCCA ACCGAAATTG TGCGGCAGAC GATCCGGGAT AAACAGCGGC GGATTGCTCG AATCCTGGAA CAACTGGATA GGGTGGAGGC AACCCTGGGC AGTCGCAGTG GGGTGGAGAT GCAAAACCTG CATGAGCAGC TCATCGAAGA AATTCGGCTC TGGTGGCGAA CCGATGAATT ACACCAGTTC AAACCCCAGG TTTTAGATGA GGTGGAATAC AGCCTGCACT ACTTTAAGGA AGTGGTGTTC GAGATTATTC CCCAGCTTTA CCGCCGCATC ACCCAATCTC TCCAACAGTC TTTCCCTAAT TTGCGGCCAC CCCGGCATAA TTTCTGTAAA TTTGGTTCCT GGGTGGGGGC CGATCGCGAC GGCAACCCGT CTGTGACCCC AGAGGTGACC TGGGAAACCG CCTGCTACCA ACGGGATCTG GTCCTGGAGA AGTATACTAA GTCGGTTGAA AAACTGATCA ATGTCCTCAG CTTGTCGCTG CACTGGTGTG ATGTGCTGCC GGATCTGCTG GATTCATTGG AACAGGATCA ACAACAATTC CCCGAACTCT ATGAGCAGCT ATCTGCGCGC TTTCGCCATG AGCCCTATCG CTTCAAGCTT TCTTATGTTT TAAGGAGACT AGAGAATACC CGCGATCGCA ACCGGCAACT GCAGAATGGC TACTATCCCT TTTCGACGGA TAGTTCAGGG TATGGGGACC TACAGATTTA CCGTAAGGGC GAGGAATTTC TGGCGGAATT GTTGCTGATT CAACGCAATC TGCAGGCCAC TAATCTCAGT TGTCGGCAAC TGGAAGATTT AATTTGCCAG GTTGAGATCT TTGGCTTTAA TTTAGCCCAG CTCGATATCC GTCAGGAAAG TTCCTGTCAT TCTGAAGCAC TGAATGAAAT CATTACCTAT TTGCAAATTC TCCCTCGCCC CTACAATGAG CTGAGTGAGG GGGAGCGGAC CGAGTGGCTG GTGAAGGAAC TTCAGACCCG GCGGCCCTTG ATTCCGGCAG AGTTGCCCTT TAGTGAGCGC ACCCGTGAGA TCATTGAAAC CCTGCGCATG GTGCGGCGGC TCCAGCAAGA GTTTGGCCCG GACATCTGCA AAACCTACGT GATCAGCATG AGCCATGAGG CCAGTGATTT GCTGGAAGTG CTGCTGCTGG CGAAGGAGGC TGGTCTATAC GATCCGGTTA CAGGAACAGG GAATCTGCAG GTGGTGCCGC TGTTTGAAAC CGTTGAAGAC TTGAAAAAAG CCCCCACTGT GCTGACCCAA CTCCTGGAGT TGCCCTTTTA TCGCGCTTAC CTGGATAGTG ATTGCCCCAG TGCCGATGTT CCACCTTTAC AGGAAGTGAT GTTGGGTTAT TCCGACAGCA ATAAGGATTC TGGCTTCTTG AGCAGCAATT GGGAAATTTA TAAAGCCCAG CAATCCCTGC AGCACACAGC GGAACGGTAT GGTGTGGCCC TGCGGATCTT CCACGGTCGG GGCGGCTCTG TGGGACGGGG AGGGGGCCCA GCCTATGCAG CTATCCTGGC CCAACCAGGA CGAACGATCG ATGGGCGGAT CAAGATTACT GAACAGGGGG AAGTGCTGGC CTCTAAGTAT TCTCTGCCAG AACTGGCCCT CTATAATCTG GAAACGATTA CGACTGCAGT GATTCAAGCC AGTTTACTCC GCACCAGTAT TGATGAAATT CAGCCCTGGC ACGAGATCAT GGAGGAGCTG GCCCTGCGAT CGCGGCAGCA CTACCGGCAG CTGATCTACG AACAACCGGA TTTTGTCGAT TTCTTTCATC AGGTCACCCC GATCGAAGAG ATCAGCCAGT TAGAAATCAG TTCTCGCCCC AGCCGGCGGG GAGGCAAGCG AACCCTGACC AGCCTGCGGG CCATTCCCTG GGTGTTTAGC TGGACACAGA GTCGTTTTCT TCTCCCGGCC TGGTACGGGG TAGGCACCGC CATCCAGGAC TTTTTGGCCG AAAAGCCGGT CGAGCATCTT TCTCTGCTGC AGTATTTCTA TTTCAAATGG CCCTTTTTCC GCATGGTGAT TTCTAAGGCA GAAATGACCC TGGCCAAGGT AGACCTGCAA ATTGCTCAGC ATTACGTGCG GGAATTGACC CACCCAGAAG ATCGAGAACG CTTTGCCCCC CTCTATGACC AAATTGCCCA GGAATATTAC CGCACCTGCG AGTTAATTCT GACCATTACC GGGCATAAGC GACTCCTGGA TGGAGATCCC GATTTACAGC GATCGGTACA GCTCCGCAAT GGTAACATTG TCCCCCTCGG CTTTCTCCAG GTTTTATTAC TCAAACGACT GCGGCAACAC CAGAGCCAAA CCAGTTCTGG GGCGTTATTG CGATCGCGCT ACAGTAAGGG CGAACTGCGT CGAGGTGCGT TGTTAACGAT TAATGGCATT GCCGCCGGGA TGCGAAACAC AGGTTAA
|
Protein sequence | MTSTVPLSEQ DQILSNRREQ ALRNRMQVIE DLLEAVLQTE CGQTLVDLLR QLRALCSPEG QVRRALDEEV LRLIEKLDFN EAIRAARSFA LYFQLINIVE QHYEQQESLY RTHPESGHPL PDLISHISGE SFPDKSFSGQ NFSAHNNTVD VRGGLTERVE HSLYEATSLE RHQGTFAWLF PHLWQQNVPP RYLQNLIDQL DIKLVFTAHP TEIVRQTIRD KQRRIARILE QLDRVEATLG SRSGVEMQNL HEQLIEEIRL WWRTDELHQF KPQVLDEVEY SLHYFKEVVF EIIPQLYRRI TQSLQQSFPN LRPPRHNFCK FGSWVGADRD GNPSVTPEVT WETACYQRDL VLEKYTKSVE KLINVLSLSL HWCDVLPDLL DSLEQDQQQF PELYEQLSAR FRHEPYRFKL SYVLRRLENT RDRNRQLQNG YYPFSTDSSG YGDLQIYRKG EEFLAELLLI QRNLQATNLS CRQLEDLICQ VEIFGFNLAQ LDIRQESSCH SEALNEIITY LQILPRPYNE LSEGERTEWL VKELQTRRPL IPAELPFSER TREIIETLRM VRRLQQEFGP DICKTYVISM SHEASDLLEV LLLAKEAGLY DPVTGTGNLQ VVPLFETVED LKKAPTVLTQ LLELPFYRAY LDSDCPSADV PPLQEVMLGY SDSNKDSGFL SSNWEIYKAQ QSLQHTAERY GVALRIFHGR GGSVGRGGGP AYAAILAQPG RTIDGRIKIT EQGEVLASKY SLPELALYNL ETITTAVIQA SLLRTSIDEI QPWHEIMEEL ALRSRQHYRQ LIYEQPDFVD FFHQVTPIEE ISQLEISSRP SRRGGKRTLT SLRAIPWVFS WTQSRFLLPA WYGVGTAIQD FLAEKPVEHL SLLQYFYFKW PFFRMVISKA EMTLAKVDLQ IAQHYVRELT HPEDRERFAP LYDQIAQEYY RTCELILTIT GHKRLLDGDP DLQRSVQLRN GNIVPLGFLQ VLLLKRLRQH QSQTSSGALL RSRYSKGELR RGALLTINGI AAGMRNTG
|
| |