Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Syncc9902_1933 |
Symbol | |
ID | 3743813 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus sp. CC9902 |
Kingdom | Bacteria |
Replicon accession | NC_007513 |
Strand | + |
Start bp | 1847488 |
End bp | 1850475 |
Gene Length | 2988 bp |
Protein Length | 995 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637772128 |
Product | phosphoenolpyruvate carboxylase |
Protein accession | YP_377934 |
Protein GI | 78185499 |
COG category | [C] Energy production and conversion |
COG ID | [COG2352] Phosphoenolpyruvate carboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTGAGT CCACCGCCCA TGCACTCCAA GGCGAACAGC CCAGGGAGAG CGGTGTGACT GCTGGAGCCG GCCGATTGCT GCAGAACCGC TTGGTGTTGG TCGAAGACCT TTGGCAAACG GTTCTCCGCA GCGAGTGTCC CCCAGAACAA AGTGCTCGCC TCCTCCGACT AAAACAACTG AGTGATCCCG TGGCCCTTGA GGGGCGAGAC GGTGACAGCA CAAGCGAGGC GATTGTTGAA TTGATCCGTG CCATGGATCT TTCAGAAGCC ATTGCCGCGG CCAGAGCGTT CTCGCTGTAT TTCCAACTCA TCAACATCCT CGAACAACGC ATCGAGGAAG ACAGTTATCT CGACAGCCTG GCCCCCCGGA AATCAGCAGC AGACGACGGC CGCGACGCCT TTGACCCTTT TGCTCCTCCT TTGGCGAGTC AGACCGATCC CGCCACATTT GGAGAAGTTT TTGAACGGCT CCGGCGGATG AATGTGCCGC CAGCACAAGT CGAAGCCCTA CTCCAGGAGT TGGATATCCG GTTGGTCTTT ACCGCTCATC CCACAGAGAT TGTTCGGCAC ACCGTCCGTC ACAAGCAACG ACGGGTCGCG AATCTTCTGC AACAACTGCA GTCCGACTCG CCTATGGCGC TTCAAGTCAA AGACGACTTG CGTCAGCAAC TGGAAGAGGA GATTCGACTC TGGTGGAGAA CCGATGAGCT GCATCAGTTC AAGCCAACGG TTCTCGATGA AGTGGATTCA ACCCTTCATT ACTTCCAACA AGTGTTGTTC GATGCCATGC CCCAACTACG GCGACGGCTC ACATCATCAC TTCACCGGCA CTATCCCGAT GTCCAAGTTC CCCAGGCATC GTTCTGCACC TTTGGATCTT GGGTTGGTTC CGACCGGGAT GGCAATCCCT CCGTCACAAC GGACATCACC TGGCGGACCG CCTGCTATCA GCGTCAGCTC ATGCTCGAGC TGTACATCAG CTCGGTCCAG GCGCTTCGTA ATCAGCTGAG CATTTCAATG CAGTGGAGCC AGGTTGCTCC TGCGTTGCTG GAGTCCTTGG AAATGGATCG GCTGCGCTTC CCAGAGATCT ATGAGCGGCG CGCGGCGCGT TACCGACTAG AGCCTTACCG ACTAAAGCTG AGCTACATCC TCGAACGCCT GGAACGCACG CTTAAGCGCA ACGATCAGCT CTCAGATGCG GGGTGGAAGA GTCCTAAAGA TGCCATCCCA ACCGATGGAC CGCCTGGATC GGAAGCGCTC CACTACACCT CGGTCGATCA ATTCCGAAAC GACCTGGAAC TGATCCGCAA CAGCTTGATC GGCACTGAAC TCACCTGCGA GCAGCTCGAT ACCTTGCTCC ATCAAGTCCA TATCTTTGGC TTCTCACTCG CCAGTCTCGA TATTCGTCAG GAGAGCACCC GCCATAGCGA TGCGATTGAC GAACTCACGC GTTTTCTCGA ATTACCACAG CCCTATGGCG AGATGGAAGA GTCCACGCGT GTGGCCTGGT TGATTGAAGA GCTCCGTACC CGGCGTCCGT TGATTCCAAC GGCAGTTCGA TGGTCGGACA CCACCGCGGA GACGATGGCC GTCTTCCGCA TGTTGCATCG ACTCCAAGAG GAGTTCGGCC AGCGGATCTG CCACTCGTAT GTGATTTCGA TGAGCCACAC GGCCTCCGAT CTATTGGAGG TGTTACTGCT CGCGAAGGAA GCAGGACTCG TGGATCCAGC CGCCCGCAAA GCGTCGCTGC TGGTGGTGCC ACTGTTCGAA ACGGTGGAAG ATCTACAACG CGCACCAGCC GTCATGGACG AGCTGTTCAA TACACCTCTT TATCGCGACC TACTCCCGAT GGTGGGGATC CAAGGCCAGC CCCTACAGGA GCTAATGCTT GGCTACTCCG ACAGCAATAA AGACTCAGGT TTCCTCTCGA GCAACTGGGA GATTCACCAA GCTCAACTTG CCCTCCAAGA GCTGTCCAGT CGACAGGGCG TTGCCCTTCG ACTGTTTCAT GGTCGCGGCG GTTCCGTGAG CCGTGGCGGT GGCCCGGCGT ATCAAGCCAT CCTTGCCCAG CCCAGTGGCA CCCTTCAGGG CCGCATCAAA ATCACCGAAC AAGGGGAGGT GCTGGCTTCG AAGTACAGCC TGCCGGAGCT CGCCCTCTAC AACCTGGAAA CAGTCACCAC CGCGGTCGTT CAAAACAGCC TGGTAACCAA TCAGCTCGAT GCGACACCGA GCTGGAACCA ACTCATGAGT CGTCTCTCTG CACGCTCCCG TGAGCACTAC CGCGCGTTGG TACATGACAA CCCCGACTTG GTGGCATTTT TCCAACAGGT GACACCAATC GAAGAGATCA GCAAACTGCA GATTTCCAGT CGACCTGCTC GACGTAAAAC CGGAGCCAAA GATCTTTCAA GCTTGCGGGC GATTCCTTGG GTGTTTGGCT GGACGCAAAG CCGTTTTTTA CTTCCCAGTT GGTTTGGCTT TGGCACAGCT TTAGCTGAAG AGGTGAAAGC AGACCCCGAT CAACTCGATC TGCTCCGGCG GCTTCACCAA CGCTGGCCCT TTTTCCGCAC CCTGATTTCC AAGGTGGAGA TGACCCTCTC CAAAGTGGAT CTCGACTTAG CGCATCACTA CATGAACAGC CTGGGGAAAC CCGAGCAACG TGAGGCCTTC GAGGCGATTT TTGCGGTGAT TGCAACGGAA TACGCGTTAA CGCGGAAATT GGTTTTAGAG ATCACAGGAC AACCGCGCCT ACTCGGGGCT GATCAAGGGC TTCAGCTCTC AGTCGACCTG AGGAATCGCA CGATTGTGCC TTTGGGCTTT CTCCAGGTTG CGCTGCTCAA ACGGCTGCGG GATCAAAACC GTCAGCCACC AATGAGTGAA AGTCCAGGAG CACCCGAAGA CACCCGTACC TACAGCCGCA GTGAGTTACT CCGTGGGGCC CTGCTCACGC TGAATGGGAT TGCTGCTGGC ATGCGCAACA CAGGCTGA
|
Protein sequence | MPESTAHALQ GEQPRESGVT AGAGRLLQNR LVLVEDLWQT VLRSECPPEQ SARLLRLKQL SDPVALEGRD GDSTSEAIVE LIRAMDLSEA IAAARAFSLY FQLINILEQR IEEDSYLDSL APRKSAADDG RDAFDPFAPP LASQTDPATF GEVFERLRRM NVPPAQVEAL LQELDIRLVF TAHPTEIVRH TVRHKQRRVA NLLQQLQSDS PMALQVKDDL RQQLEEEIRL WWRTDELHQF KPTVLDEVDS TLHYFQQVLF DAMPQLRRRL TSSLHRHYPD VQVPQASFCT FGSWVGSDRD GNPSVTTDIT WRTACYQRQL MLELYISSVQ ALRNQLSISM QWSQVAPALL ESLEMDRLRF PEIYERRAAR YRLEPYRLKL SYILERLERT LKRNDQLSDA GWKSPKDAIP TDGPPGSEAL HYTSVDQFRN DLELIRNSLI GTELTCEQLD TLLHQVHIFG FSLASLDIRQ ESTRHSDAID ELTRFLELPQ PYGEMEESTR VAWLIEELRT RRPLIPTAVR WSDTTAETMA VFRMLHRLQE EFGQRICHSY VISMSHTASD LLEVLLLAKE AGLVDPAARK ASLLVVPLFE TVEDLQRAPA VMDELFNTPL YRDLLPMVGI QGQPLQELML GYSDSNKDSG FLSSNWEIHQ AQLALQELSS RQGVALRLFH GRGGSVSRGG GPAYQAILAQ PSGTLQGRIK ITEQGEVLAS KYSLPELALY NLETVTTAVV QNSLVTNQLD ATPSWNQLMS RLSARSREHY RALVHDNPDL VAFFQQVTPI EEISKLQISS RPARRKTGAK DLSSLRAIPW VFGWTQSRFL LPSWFGFGTA LAEEVKADPD QLDLLRRLHQ RWPFFRTLIS KVEMTLSKVD LDLAHHYMNS LGKPEQREAF EAIFAVIATE YALTRKLVLE ITGQPRLLGA DQGLQLSVDL RNRTIVPLGF LQVALLKRLR DQNRQPPMSE SPGAPEDTRT YSRSELLRGA LLTLNGIAAG MRNTG
|
| |