Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3592 |
Symbol | |
ID | 3911394 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 4117754 |
End bp | 4120555 |
Gene Length | 2802 bp |
Protein Length | 933 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637885494 |
Product | phosphoenolpyruvate carboxylase |
Protein accession | YP_487198 |
Protein GI | 86750702 |
COG category | [C] Energy production and conversion |
COG ID | [COG2352] Phosphoenolpyruvate carboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.795293 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGTCCA TGATCCTGCC GACCGAGCCC GAGGCGCTGC CGAACCGCGC CGACGACAGT GCCGCGCTGG AGGCCGAGAC GCGGCTGCGC AACGACATCC GGCTGCTCGG CCGGATTCTC GGCGACACCG TGCGCGACCA GGAAGGCGCC GCGGTGTTCG ATCTGGTCGA GGGCATCCGC CAGACCTCGA TCCGGTTTCA CCGCGACGAC GACACCACCG CGCGGCGCGA GCTCGAAGCC ATCTTGGACG GCATGTCGGC CTCCGACACC GTGAAGATCG TTCGTGCCTT CAGCTATTTC TCGCACCTCG CCAACATCGC CGAAGACCAG AACAACATCC GCCAGATGCG GGTCGGCTCG ACCGCGGGCT CGGCGCCGCG TGCCGGCATG CTGGCCAAGA CGCTCGCGCA TGCGCGCGCG GACGGGATCG GCGCAGCCGA GCTGCGCGAT TTCTTCAAGA CCGCTCTGGT GAGCCCGGTG CTGACCGCGC ATCCGACCGA GGTGCGGCGC AAGAGCACGA TGGACCGCGA GATGCAGATC GCGGCCTTGC TCGACGAGCG CGAGCGCGTC CAGCTCACGC CCGAGGAATG GGAGCAGAAC GAGGAGCAGC TCCGCCGCGC GGTGGTGACG TTGTGGAAGA CCAATCTGTT GCGCCGGACC AAGCTGACGG TGCTCGACGA AGTCACCAAC GGCCTGTCGT TCTACGACTA CACCTTCCTG CGCGAGGTGC CGCGGCTGCA TTGCGCGCTG GAGGATCAGC TCGGCGGCGG CGAGGGCGCC GAGGCCGATG CGGAACTCGC GAGCTTCCTG CGGATGGGAA GCTGGATCGG CGGCGACCGC GATGGCAATC CGTTCGTCAC CGCCGAGGTG CTGCACGGCA CGTTGCAATT GCAGAGCGCC CGCGTGCTGC GGTTCTATCT CGACGAACTG CACGAGCTCG GCTCGGAATT GTCGCTGGCG TCGCATCTGG TGGCGATCAG CGACGAGGTC CGCGCGCTGG CCGAGCGCTC ACCGGATCAT TCGCCGCATC GCCGCCACGA GCCGTATCGG CTGGCGGTGT CGGGAATTTA TGCGCGGCTG GCCGCGACCG CGGCGAAGCT GAGGATCGAC AGCATCCGCG CCCCGGTCGG CGAAGCCGAG GCTTACGCGA GCGTGCACGA CTTCAAGGCC GATCTCGACG CGATCCATCG TTCGCTGGTC GCCCACAATG CCGGCGTGAT CGCGCGGGGC CGGCTGCGGC AGCTGCGCCG CGCCGCCGAT TGCTTCGGCT TCCACCTCGC CAGCCTCGAC ATGCGGCAGA ATTCCGCGGT GCACGAGCGC ACCATCGCCG AACTGATGAA CGCGGCGCAT CCGGCCAGCG CCTATCTGGA GATCGGCGAA GACGCACGGA TCGCGCTACT CACTGCCGAG CTGCGCAGCG CCCGGCCGCT GACCTCGATC TTCGTCAAAT ACAGCGACGA GACCGTCGGC GAACTCGCGG TGCTGCACGA GGCGGCGCAG GCGCACGCCA CTTACGGCGC GGCGGCGATC CCGCAATGCA TCATCTCGAT GACCAAGGGC GTCTCCGACC TGCTCGAAGT CGCGGTGCTG CTCAAGGAGG TCGGACTGAT CGATCCGTCG GGGCGCAGCG CGATCAACAT CGTGCCGCTG TTCGAGACCA TCGAGGATCT GCAGGCCTCC TCGGCGATCA TGGACCGGCT GCTCGGCATC CCGGAATATC GCCGGCTGGT CGACAGCCGC GGCGGCGTGC AGGAAGTGAT GCTCGGCTAT TCAGACAGCA ACAAGGACGG CGGCTTCGTC ACCTCCGGCT GGGAACTGTA CAAGGCCGAG ATCGGGTTGA TCGAGATCTT CGAACATCAC GGCATCCGGC TGCGGCTGTT CCACGGCCGC GGCGGCTCGG TCGGCCGCGG CGGCGGGCCG AGCTACGACG CCATCGTGGC GCAGCCCGGC GGCGCGGTGA ACGGCCAGAT CCGCATCACC GAGCAGGGCG AGATCATCAC CAGCAAATAT TCCAACCGCG AGGTCGGCCG CAACAATCTG GAGATCCTCA CCGCGGCGAC GCTGGAGGCG AGCCTGCTGC AGCCCAAGCG CGTCGCGCCG CAGCGCGACT ATCTCGACGC GATGGAGCAG CTCTCGGCGA TGGCCTTCAA GGCCTATCGC GGCCTGGTCT ACGAGACCGA CGGCTTCGTC GATTACTTCT GGGCCTCGAC GGTGATCACG GAGATCTCGA CGCTGAACAT CGGCAGCCGG CCGGCGTCGC GCAAGAAGAC CCGCGCGATC GAGGATCTGC GCGCGATCCC CTGGGTGTTC TCATGGGCGC AATGCCGGCT GATGCTGCCC GGCTGGTACG GCTTCGGCAG CGCGGTCGAG GCCTGGATCG CCGCGCATCC CGACAAGGGC GTCCCGTTCC TGCGATCGAT GTATCAGGAA TGGCCGTTCT TCCGCACGCT GCTGTCGAAC ATGGACATGG TGCTTTCGAA GAGCTCGCTC GGCATCGCCT CGCGCTACGC CGAACTGGTT CCCGACGAGA CGCTGCGGCG GGAGATCTTC GGCCGCATCC GCGCCGAATG GCACGCTTCG GTCGACGGCC TGCTGGCGAT CATGGGCCAC GACAAGCTGC TGCAGGGCAA CCCGCTACTC GACCGCTCGA TCCGCCACCG CTTCCCGTAT CTCGACCCGC TCAACCACGT CCAGGTGCAG TTACTCCGCG AGCACCGCAC GCATGATCCC GACGAGCAGA TTCTGCGCGG CATTCAGCTG ACGATCAACG GGATCTCGGC GGGGCTGCGG AACAGCGGCT GA
|
Protein sequence | MSSMILPTEP EALPNRADDS AALEAETRLR NDIRLLGRIL GDTVRDQEGA AVFDLVEGIR QTSIRFHRDD DTTARRELEA ILDGMSASDT VKIVRAFSYF SHLANIAEDQ NNIRQMRVGS TAGSAPRAGM LAKTLAHARA DGIGAAELRD FFKTALVSPV LTAHPTEVRR KSTMDREMQI AALLDERERV QLTPEEWEQN EEQLRRAVVT LWKTNLLRRT KLTVLDEVTN GLSFYDYTFL REVPRLHCAL EDQLGGGEGA EADAELASFL RMGSWIGGDR DGNPFVTAEV LHGTLQLQSA RVLRFYLDEL HELGSELSLA SHLVAISDEV RALAERSPDH SPHRRHEPYR LAVSGIYARL AATAAKLRID SIRAPVGEAE AYASVHDFKA DLDAIHRSLV AHNAGVIARG RLRQLRRAAD CFGFHLASLD MRQNSAVHER TIAELMNAAH PASAYLEIGE DARIALLTAE LRSARPLTSI FVKYSDETVG ELAVLHEAAQ AHATYGAAAI PQCIISMTKG VSDLLEVAVL LKEVGLIDPS GRSAINIVPL FETIEDLQAS SAIMDRLLGI PEYRRLVDSR GGVQEVMLGY SDSNKDGGFV TSGWELYKAE IGLIEIFEHH GIRLRLFHGR GGSVGRGGGP SYDAIVAQPG GAVNGQIRIT EQGEIITSKY SNREVGRNNL EILTAATLEA SLLQPKRVAP QRDYLDAMEQ LSAMAFKAYR GLVYETDGFV DYFWASTVIT EISTLNIGSR PASRKKTRAI EDLRAIPWVF SWAQCRLMLP GWYGFGSAVE AWIAAHPDKG VPFLRSMYQE WPFFRTLLSN MDMVLSKSSL GIASRYAELV PDETLRREIF GRIRAEWHAS VDGLLAIMGH DKLLQGNPLL DRSIRHRFPY LDPLNHVQVQ LLREHRTHDP DEQILRGIQL TINGISAGLR NSG
|
| |