Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_1876 |
Symbol | |
ID | 4022358 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 2102526 |
End bp | 2105327 |
Gene Length | 2802 bp |
Protein Length | 933 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637962069 |
Product | phosphoenolpyruvate carboxylase |
Protein accession | YP_569012 |
Protein GI | 91976353 |
COG category | [C] Energy production and conversion |
COG ID | [COG2352] Phosphoenolpyruvate carboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.209652 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGTCCA CGATCCTGCC GACCGACCCC GACGTTCTGC CGAACCGGGC GGACGACAGC GCTTTTATCG AAGAGGACAC CAGGCTTCGC AATGACATAC GGCTGTTGGG CCGGATACTT GGCGATACCG TGCGCGACCA GGAGGGCTCT GCGGTGTTCG ACTTGGTCGA ACGCATCCGG CAGACCTCCA TCCGGTTTCA CCGGGACGAG GACAAGCCGG CGCGACGCGA ACTCGAAGCG ATCCTCGACG ACATGTCGGC CTCGGACACG GTGAAGATCG TCCGGGCATT CAGCTACTTT TCGCATCTCG CCAACATCGC CGAAGATCAA AACAACATCC GCCAGATGCG CGCCGGCTCG ACGGCAGGCT CGGCGCCTCG CGCGGGGATG CTGGCCAAGA CCCTGGCGCA TGCGCGCGAG GAAGGGATCG GCGCGCGCGA GCTGCGCGAA TTCTTCAAGA CCGCGCTGGT CAGCCCGGTG CTGACGGCGC ATCCGACCGA GGTGCGCCGC AAGAGCACGA TGGACCGCGA GATGGAGGTC GCAGCGCTGC TCGATCAACG CGAGCGCCTC CAGCTCACCG CTGACGAGTG GGCGCAGAAC GAGGAGCAGC TTCGTCGCGC GGTGGTGACG CTGTGGAAAA CCAACCTGCT GCGCCGGACC AAGCTGACGG TGCTCGACGA AGTCGCCAAC GGCCTGTCGT TCTATGACTA CACCTTCTTG CGCGAAGTGC CGCGGCTGCA CAGCGCGTTG GAGGATCAGC TCGGCGGCGG CGAGGGCGGC GAAGCGGAAG AGGAGCTGGC GAGTTTCCTG CGGATGGGAA GCTGGATCGG CGGCGATCGC GACGGCAATC CGTTCGTCAC AGCCGAGGTG CTGCAGGGGA CGCTGCGGTT GCAGAGCGCG CGCGTGCTGC GGTTCTATCT CGACGAGCTG CACGAGCTCG GCTCGGAGCT GTCGCTGGCG TCGCATCTCG CCCCGATCAC CGAAGACGTC CGTTTGCTAG CTGAACGTTC GCCCGATCAT TCGCCGCATC GCCGCCACGA GCCCTATCGG CTGGCGGTGT CAGGCATCTA TGCGAGGCTC GCCGCCACCG CGGCGAAGCT GAAGATCGAC AGCGTCCGCG CCCCGGTCGG CGAGGCCGAG ATTTACGCCA ACGTGCAGGA GTTCAAAGCC GACCTCGATG CGATCCATTA CTCGCTGACC AAATATAATG CCGGCGTGAT CGCGCGCGGC AGACTCCGGC AGCTGCGCCG CGCCGCCGAC TGTTTCGGCT TTCATCTCGC CAGCCTCGAC ATGCGGCAAA ACTCAGCGGT GCACGAGCGC ACTATGGGCG AGCTGATGGA CGCGGCGCGG CCGACCAGCT CCTATCTGGC GCTCGATGAG GACGAACGCA TCGCGCTGCT CACCGGCGAA CTGCGCAGCG CCCGGCCGCT GACCTCGATT TTCATCAAGT ATAGCGACGA GACCGTCGGC GAACTCGCGG TGCTGCACGA GGCGGCGCGG GCGCACTCCA TTTATGGCGA GGCGGCGATT CCCCAGTGCA TCATCTCGAT GACCAAGGGC GTGTCCGATC TGCTCGAGGT CGCGGTGCTG CTCAAGGAAG TCGGGCTGAT CGATCCATCG GGCCGTTGCG CGATCAACAT CGTGCCGCTG TTCGAGACCA TCGAAGACCT GCAGGCCTGC GCCGCGATCA TGGACCGGCT GCTGGCGATC CCGGAATATC GCCGCCTGGT CGACAGCCGC GGCGGCGTGC AGGAGGTGAT GCTCGGTTAC TCCGACAGCA ACAAGGACGG CGGCTTCGTC ACATCCGGTT GGGAGCTCTA CAAGGCCGAG ATCGGCTTGC TCGACGTGTT CGAGCACCAT GGCGTTCGGC TGCGGCTGTT CCACGGCCGC GGCGGTTCGG TCGGCCGCGG CGGCGGCCCG AGCTACGACG CGATCGTAGC GCAGCCGGGC GGCGCGGTGA ACGGCCAGAT TCGCATCACC GAGCAGGGCG AGATCATCAC CAGTAAATAT TCCAACCGCG AGGTCGGTCG CAACAATCTG GAAATCCTCA CCGCGGCGAC GCTGGAGGCG AGCCTGCTGC AGCCGCGCCG CAGCGCGCCA CACCACGACT ATCTGGAAGC GATGGAGCAG CTCTCGGCGC TGGCCTTCAA GGCGTATCGC GGACTGGTCT ACGAGACCGA CGGCTTCGTC GACTACTTCT GGTCGTCGAC TGTGATCAAC GAGATCTCGA CGCTGAACAT CGGCAGCCGC CCGGCATCGC GCAAGAAGAC CCGCGCGATC GAGGATCTGC GCGCCATCCC CTGGGTGTTC TCTTGGGCGC AATGCCGGCT GATGCTGCCG GGCTGGTACG GCTTCGGCAG CGCGGTCGAG GCCTGGGTCG CCGAGCATCC CGACAAGGGC ACTGCGTTCC TGCAGTCGAT GTATCAGGAG TGGCCGTTCT TCCGCATGCT GCTCTCCAAC ATGGACATGG TGCTGTCGAA GAGTTCGATC GCGATCGCCT CGCGCTACGC CGATCTGGTG CCGGATGAGG AGTTGCGGCA CAAGATCTTC GGCCGCATCC GCATCGAATG GCATGCCTCG GTCGATAGCC TGCTCGCGAT CATGGGCCAC GAACGGCTGC TGCAGGGCAA CCCGCTGCTG GAGCGCTCGA TCCGCCACCG CTTCCCGTAT CTCGACCCGC TTAACCACGT CCAGGTGCAG CTCCTGCGCG AGCACCGCAC CCACGATCCC GACGAGCAGG TGCTGCGCGG GATTCAGCTG ACCATCAACG GAATTTCGGC GGGGCTGCGG AATAGCGGGT GA
|
Protein sequence | MSSTILPTDP DVLPNRADDS AFIEEDTRLR NDIRLLGRIL GDTVRDQEGS AVFDLVERIR QTSIRFHRDE DKPARRELEA ILDDMSASDT VKIVRAFSYF SHLANIAEDQ NNIRQMRAGS TAGSAPRAGM LAKTLAHARE EGIGARELRE FFKTALVSPV LTAHPTEVRR KSTMDREMEV AALLDQRERL QLTADEWAQN EEQLRRAVVT LWKTNLLRRT KLTVLDEVAN GLSFYDYTFL REVPRLHSAL EDQLGGGEGG EAEEELASFL RMGSWIGGDR DGNPFVTAEV LQGTLRLQSA RVLRFYLDEL HELGSELSLA SHLAPITEDV RLLAERSPDH SPHRRHEPYR LAVSGIYARL AATAAKLKID SVRAPVGEAE IYANVQEFKA DLDAIHYSLT KYNAGVIARG RLRQLRRAAD CFGFHLASLD MRQNSAVHER TMGELMDAAR PTSSYLALDE DERIALLTGE LRSARPLTSI FIKYSDETVG ELAVLHEAAR AHSIYGEAAI PQCIISMTKG VSDLLEVAVL LKEVGLIDPS GRCAINIVPL FETIEDLQAC AAIMDRLLAI PEYRRLVDSR GGVQEVMLGY SDSNKDGGFV TSGWELYKAE IGLLDVFEHH GVRLRLFHGR GGSVGRGGGP SYDAIVAQPG GAVNGQIRIT EQGEIITSKY SNREVGRNNL EILTAATLEA SLLQPRRSAP HHDYLEAMEQ LSALAFKAYR GLVYETDGFV DYFWSSTVIN EISTLNIGSR PASRKKTRAI EDLRAIPWVF SWAQCRLMLP GWYGFGSAVE AWVAEHPDKG TAFLQSMYQE WPFFRMLLSN MDMVLSKSSI AIASRYADLV PDEELRHKIF GRIRIEWHAS VDSLLAIMGH ERLLQGNPLL ERSIRHRFPY LDPLNHVQVQ LLREHRTHDP DEQVLRGIQL TINGISAGLR NSG
|
| |