Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_1972 |
Symbol | |
ID | 6409632 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 2131054 |
End bp | 2133864 |
Gene Length | 2811 bp |
Protein Length | 936 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 642711858 |
Product | phosphoenolpyruvate carboxylase |
Protein accession | YP_001990970 |
Protein GI | 192290365 |
COG category | [C] Energy production and conversion |
COG ID | [COG2352] Phosphoenolpyruvate carboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGTCGC TGAATCTGTC CGCCGGCCCC GAGCCGGTCT CCGAACGTCC TGACGACGCT GCCGCGATCG AGGCCGAGAC CCGGCTGCGC AACGACATCC GGCTGCTCGG CCGCATTCTC GGCGATACCG TGCGCGAGCA GGAAGGCCAG AGCGTGTTCG ATCTGGTCGA GAACATCCGC CAGACCTCGA TCCGCTTCCA TCGCGATGAC GACAAGACCG CGCGCGCCGA GCTCGCCGCC ATTCTCGACG GCATGTCGAT CCAGGACACG ATGCGGATCG TTCGCGCCTT CAGCTATTTC TCGCACCTCG CCAACATCGC CGAGGACCAG AACAACATCC GCCAGATGCG CGCCGGCTCG ACCGCCGGCT CGGCGCCGCG CGCCGGCCTG CTCGCCAAGA CTCTGGCGCA CGCGCGGCAG GAGGGCATCA GCGCCGCGGA GCTGCGCAAG TTCTTTGCGA CCGCGCTGGT CAGTCCGGTG CTGACCGCGC ATCCGACCGA GGTGCGCCGC AAGAGCACGA TGGACCGCGA GATGCAGATC GCAGGATTGC TCGATCAGCG CGACCGCGTT CAGCTCACCG CCGACGAATG GGCCGACAAC GAGGAGTGGC TGCGCCGCGC CGTCGAGACG CTGTGGAAGA CCAACCTGCT GCGCCGGACC AAGCTGACGG TGCTGGACGA AGTCACCAAT GGGCTGTCGT TCTACGACTA CACCTTCCTG CGCGAGGTGC CACGGCTGCA CAGCGCGCTG GAGGACCGGC TCGCCGATGC GGCGAAAGCC GAAGGCGCCA ACGCCGAGGG CGAACTCGCC AGCTTCCTGC GGATGGGAAG CTGGATCGGC GGCGACCGCG ATGGCAATCC GTTCGTCACC GCCGAGGTGC TGCACGGCAC CTTGAAGCTG CAGAGCACGC GGGTGCTGCG CTATTATCTC GAGGAGCTGC ACGAGCTCGG CTCGGAATTG TCACTGGCCT CGCATCTCGC CGGCACCACC GACACCGTCA AGGCGCTGGC CGAAACTTCG CCCGACACTT CGCCGCATCG CAAATACGAG CCGTATCGGC TCGCGGTGTC CGGCATCTAT GCGCGGCTGG CTGCCACGGC GCTCAAGCTC GAGGTCGAGA ACCTCCGCAC ACCGGTCGGC GAGGCCGAGC CCTATGCCAG CGCGCAAGAC TTCAAGACCG ATCTTGACGC CATCCATCTT TCCCTGACCA CGCATCATTC CGGCGTGATC GCACGCGGTC GGCTGCGCCA GCTCCGTCGT GCAATCGACT GTTTCGGATT CCACCTCGCC AGCCTCGACA TGCGGCAGAA CTCGGCGGTG CATGAGCGCA CCGTCGGCGA GCTGATGGAC GCGGCCCGGC CGGGTACGTC CTACGCGGTG CTCGACGAGG AAGCGCGGAT CGCGCTGCTG ATCAGCGAGT TGCGCAGTAC CCGGCCGCTG ACCTCGATGT TCGTCAAATA CAGCGACGAG ACGGTCGGCG AGCTTGCGGT GTTCCGCGAA GCGGCGAAGG CCCATGCGAC CTACGGGGCG GCGGCGATCC CCCAATGCAT CATTTCGATG ACCAAGGGCG TCTCGGATCT CTTGGAGGTC GCGGTGCTGC TCAAGGAAGT CGGGTTGATC GATCCGTCCG GGCGCAGCGC CATCAACGTC GTGCCGCTGT TCGAGACCAT CGAGGATCTG CAGGCCTGCG CCAAGATCAT GGACCGGCTG CTGTCGATCC CCGAATATCG CCGCCTGGTC GACAGCCGCG GCTCGGTGCA GGAGGTGATG CTCGGCTACT CCGACAGCAA TAAGGACGGC GGCTTCGTCA CCTCAGGCTG GGAGCTGTAC AAGGCCGAGA TCGGTCTGAT CGAGATCTTC GAGCACCACG GCGTTCGGCT GCGGCTGTTC CACGGCCGTG GCGGCTCGGT CGGCCGCGGC GGCGGCCCGA GCTACGATGC GATCGTGGCG CAGCCGGGTG GTGCGGTGAA CGGCCAGATC CGCATCACCG AGCAGGGCGA GATCATCACC AGTAAATATT CCAACGTCGA AGTCGGCCGC AACAATCTCG AGATCCTCGC CGCCGCGACG CTGGAAGCGA GCCTGCTGCA GCCGAAGCGC GTGGCGCCGC ACCGCGATTA TCTCGAAGCG ATGGAGCAGC TCTCGGCATT GGCCTTCAAG GCGTATCGCG GCCTGGTGTA CGAGACCGAC GGCTTCGTCG ATTACTTCTG GGCCTCGACG GTGATCAATG AGATTTCGAC GCTGAACATC GGCAGCCGCC CGGCCTCGCG CAAGAAGACC CGCGCGATCG AGGACCTGCG CGCGATCCCA TGGGTGTTCT CGTGGGCGCA GTGCCGGCTG ATGCTGCCGG GCTGGTATGG CTTCGGCAGC GCGGTGTCGG CCTGGGTCAC TGAGCACCCC GACAAGGGCA TCGCCTTTCT GCAGGCGATG TATCAGGAGT GGCCGTTCTT CCGTACGCTC TTGTCGAATA TGGACATGGT GCTGTCGAAG AGCTCGATCG GCATCGCCTC GCGCTATGCG GAGCTGGTCG AAGACACCGC GATCCGCGAC CGCATCTTCG GCCGCATCCG CGCCGAATGG CATTCGTCGA TCGACTATCT CCTGGCGATC ATGCAGCAGG ACCATTTGCT GCAGAGCAAC CCATTGCTCG AACGCTCGAT CCGCCACCGC TTCCCGTATC TCGATCCGCT GAACCACGTC CAGGTCCAGC TGCTGCGCGA ACATCGTACC CATGATCCGG ACGAGCAGGT GCTGCGCGGC GTGCAACTGA CGATCAACGG GATTTCCGCG GGGCTGCGGA ATAGCGGGTG A
|
Protein sequence | MSSLNLSAGP EPVSERPDDA AAIEAETRLR NDIRLLGRIL GDTVREQEGQ SVFDLVENIR QTSIRFHRDD DKTARAELAA ILDGMSIQDT MRIVRAFSYF SHLANIAEDQ NNIRQMRAGS TAGSAPRAGL LAKTLAHARQ EGISAAELRK FFATALVSPV LTAHPTEVRR KSTMDREMQI AGLLDQRDRV QLTADEWADN EEWLRRAVET LWKTNLLRRT KLTVLDEVTN GLSFYDYTFL REVPRLHSAL EDRLADAAKA EGANAEGELA SFLRMGSWIG GDRDGNPFVT AEVLHGTLKL QSTRVLRYYL EELHELGSEL SLASHLAGTT DTVKALAETS PDTSPHRKYE PYRLAVSGIY ARLAATALKL EVENLRTPVG EAEPYASAQD FKTDLDAIHL SLTTHHSGVI ARGRLRQLRR AIDCFGFHLA SLDMRQNSAV HERTVGELMD AARPGTSYAV LDEEARIALL ISELRSTRPL TSMFVKYSDE TVGELAVFRE AAKAHATYGA AAIPQCIISM TKGVSDLLEV AVLLKEVGLI DPSGRSAINV VPLFETIEDL QACAKIMDRL LSIPEYRRLV DSRGSVQEVM LGYSDSNKDG GFVTSGWELY KAEIGLIEIF EHHGVRLRLF HGRGGSVGRG GGPSYDAIVA QPGGAVNGQI RITEQGEIIT SKYSNVEVGR NNLEILAAAT LEASLLQPKR VAPHRDYLEA MEQLSALAFK AYRGLVYETD GFVDYFWAST VINEISTLNI GSRPASRKKT RAIEDLRAIP WVFSWAQCRL MLPGWYGFGS AVSAWVTEHP DKGIAFLQAM YQEWPFFRTL LSNMDMVLSK SSIGIASRYA ELVEDTAIRD RIFGRIRAEW HSSIDYLLAI MQQDHLLQSN PLLERSIRHR FPYLDPLNHV QVQLLREHRT HDPDEQVLRG VQLTINGISA GLRNSG
|
| |