Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_1075 |
Symbol | ppc |
ID | 4900860 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | - |
Start bp | 1055347 |
End bp | 1058421 |
Gene Length | 3075 bp |
Protein Length | 1024 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640134305 |
Product | phosphoenolpyruvate carboxylase |
Protein accession | YP_001065355 |
Protein GI | 126451534 |
COG category | [C] Energy production and conversion |
COG ID | [COG2352] Phosphoenolpyruvate carboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCGGCATG CGCCGCCGCG CCGCCTGATC GCGCCCGCTG TTGCTGGACC GCAGTTCCCG TTCACTCGTG CTTTCCCAAG GAAATCGATC GTGAAGTCTT CCGGATCGGC GCGCGCGACG CGCCGCAATG CTGTCTCGTC CTCTTCCGCC CCGGCGCACG CCGAGCCCCC CGCCCGCCGC GCCGCGAAAC CCGCACGCAA GCTCGACGGC GCCGCCGCGC GCCCGCTCGC GCCGACGAAC GCCGCATCCG CGAAACCGCA AGGCCGCACG CGCGAAGACA AGGACCGCCC GCTCTTCGAG GACATTCGCT ATCTCGGCCG CCTGCTCGGC GACGTCGTTC GCGAACAGGA AGGCGACGCC GTGTTCGACG TCGTCGAGAC GATTCGCCAG ACCGCGGTCA AGTTCCGCCG CGAGGACGAC AAGGCCGCCG CGCAGACGCT CGAGAAAATG CTGCGCAAGC TCACGCCCGA GCAGACGGTG AGCGTCGTGC GCGCGTTCAG CTATTTCTCG CACCTCGCGA ACATCGCCGA GGACCGCCAT CACAACCGCC GCCGCCGCAT CCACGCGCTC GCGGGCTCCG CGGCGCAGGC GGGCACCGTC GCGTACGCGC TCGACAAGCT CAAGCAGGCG GGCGACGCGT CGTCGAAGAC GATCAAGCAG TTCTTCGAAG GCGCGCTGAT CGTGCCCGTG CTCACCGCGC ACCCGACCGA GGTGCAGCGC AAGAGCATTC TCGACGCGCA GCACGACATC GCGCGGCTGC TCGCCGAGCG CGACCAGCCG CTGACCGCGC GCGAGCTCGC GCACAACGAG GCGCTGCTGC GCGCGCGCGT GACGACGCTC TGGCAGACCC GGATGCTGCG CGACGCGCGC CTGACCGTCG CCGATGAGAT CGAGAACGCG CTGTCGTACT ACCGCGCGAC GTTCCTCGAC GAGCTGCCCG CGCTCTACGC GGACATCGAG GAGGCGCTCG CCGAGCACGG CCTGCGCGCG CGCGTGCCGG CGTTCTTCCA GATGGGCAGT TGGATCGGCG GCGACCGCGA CGGCAACCCG AACGTCACCG CCGCGACGCT CGACGAGGCG ATCAGCCGCC AGGCGGCGGT GATCTTCGAG CATTACCTCG AACAGGTGCA CAAGCTCGGC GCGGAGCTGT CCGTGTCGAA CCTGCTCGTC GGCGCGAGCG ACGCGCTCAA GGCGCTCGCC GCCGCGTCGC CGGACCAGTC GCCGCACCGC GTCGACGAGC CGTACCGCCG CGCGCTGATC GGCGTCTACA CGCGGCTCGC GGCCAGCGCG CGCGTGCGGC TCGGCGAGGG CACGGTGCCC GTGCGCAGCG CGGGCCGCGG CGCCGCGCCC GTGCGCGCGA CGCCGTACGC GGACGCGGAG GAGTTCGCCG CCGATCTGCG CGTGCTGACC GATTCGCTCG CGCTGCATCA CGGCGAATCG CTCGCGACGC CGCGCCTCGC GCCGCTCATG CGCGCGGCCG AGGTGTTCGG CTTCCATCTC GCGAGCATCG ATTTGCGGCA GAGCTCGGAC ATCCATGAAG CGGTGGTCGC CGAACTGCTC GCGCGCGGCG GCGTCGAGGC CGACTACGCG GCGCTGCCCG AAGCGGACAA GCTGCGCGTG CTGCTCGCGG CGCTCGCGGA CCCGCGGCCG CTGCGCTCGC CGTATCTCGA CTACTCGGAC CTCGCGAAGA GCGAGCTCGG CGTGCTCGAG CGCGCGCACG CGATCCGCGC GCAGTTCGGC GCGCGCGCGG TGCGCAACTA CATCATTTCG CATACCGAGA CAGTGAGCGA TCTCGTCGAG GTGCTGCTGC TGCAGAAGGA AACGGGCCTC TTCGAGGGCA CGCTCGGCAC GCCGCACGCG AACGCGCGCA ACGGCCTGAT GGTGATTCCG CTCTTCGAGA CGATCGCCGA CCTGCGCAAC GCGTCCGACA TCATGCGCGC GTTCTTCGCG CTGCCGGGCG TGGGCGAGCT GCTCGCGCAC CAGGGCCACG AGCAGGAAGT GATGCTCGGC TATTCGGACA GCAACAAGGA CGGCGGCTTC CTCACGTCGA ACTGGGAGCT CTATCGCGCG GAACTGGCGC TCGTCGATCT GTTCGACGAG CGCGGGATCA AGCTGCGCCT GTTCCACGGC CGCGGCGGCA CGGTGGGACG CGGCGGCGGC CCGACCTATC AGGCGATCCT GTCGCAGCCG CCCGGCACGG TAAACGGCCA GATCCGGCTC ACCGAGCAGG GCGAGGTGAT CGCGAGCAAG TTCGCGAACC CGGAGATCGG CCGGCGCAAT CTGGAGACGG TCGTCGCCGC GACGCTCGAG GCGACGCTCG CGCCGCACAG CAACGCGCCG AAGCAGTTGC CCGCGTTCGA GGCGGCGATG CAGACGCTGT CGGACGCGGC GATGGCGTCG TACCGCGCGC TCGTCTACGA GACGCCCGGC TTCACCGACT ACTTCTTCTC GTCGACGCCG ATCACCGAGA TCGCCGAGCT GAACATCGGC AGCCGGCCCG CGTCGCGCAA GCTGCAGGAT CCGAAGAACC GCAAGATCGA GGACCTGCGC GCGATTCCGT GGGGCTTCTC ATGGGGCCAG TGCCGGCTGC TGCTCACCGG CTGGTACGGC TTCGGCAGCG CGGTCGCCGC GTATCTCGAC GGCGCGCCGG ACGCGGCCGA GCGCGGCAAG CGCGTCGCGC TGCTGAAGAA AATGAACAAG ACCTGGCCGT TCTTCGCGAA CCTGCTGTCG AACATGGACA TGGTGCTCGC GAAGACCGAT CTCGCGGTTG CGTCGCGCTA CGCGCAGCTC GTCGCCGACA AGAAGCTGCG CAAGCACGTG TTCGAGCGGA TCGTCGCCGA ATGGCATCGC ACGGCGGATG CGCTCGCCGA GATCACCGGC GCGCACGCGC GGCTCGCCGC GAATCCGCTT CTCGCGCGCT CGATCAAGAA CCGCTTCCCG TACCTCGATC CGCTGAACCA CCTGCAAGTC GAGCTGATCA AGCGGCACCG CGCGGGCGAC ACGAACGCGC GGCTGCGGCG CGGGATCCAT CTGACGATCA ACGGGATCGC GGCCGGCCTG CGCAATACGG GCTGA
|
Protein sequence | MRHAPPRRLI APAVAGPQFP FTRAFPRKSI VKSSGSARAT RRNAVSSSSA PAHAEPPARR AAKPARKLDG AAARPLAPTN AASAKPQGRT REDKDRPLFE DIRYLGRLLG DVVREQEGDA VFDVVETIRQ TAVKFRREDD KAAAQTLEKM LRKLTPEQTV SVVRAFSYFS HLANIAEDRH HNRRRRIHAL AGSAAQAGTV AYALDKLKQA GDASSKTIKQ FFEGALIVPV LTAHPTEVQR KSILDAQHDI ARLLAERDQP LTARELAHNE ALLRARVTTL WQTRMLRDAR LTVADEIENA LSYYRATFLD ELPALYADIE EALAEHGLRA RVPAFFQMGS WIGGDRDGNP NVTAATLDEA ISRQAAVIFE HYLEQVHKLG AELSVSNLLV GASDALKALA AASPDQSPHR VDEPYRRALI GVYTRLAASA RVRLGEGTVP VRSAGRGAAP VRATPYADAE EFAADLRVLT DSLALHHGES LATPRLAPLM RAAEVFGFHL ASIDLRQSSD IHEAVVAELL ARGGVEADYA ALPEADKLRV LLAALADPRP LRSPYLDYSD LAKSELGVLE RAHAIRAQFG ARAVRNYIIS HTETVSDLVE VLLLQKETGL FEGTLGTPHA NARNGLMVIP LFETIADLRN ASDIMRAFFA LPGVGELLAH QGHEQEVMLG YSDSNKDGGF LTSNWELYRA ELALVDLFDE RGIKLRLFHG RGGTVGRGGG PTYQAILSQP PGTVNGQIRL TEQGEVIASK FANPEIGRRN LETVVAATLE ATLAPHSNAP KQLPAFEAAM QTLSDAAMAS YRALVYETPG FTDYFFSSTP ITEIAELNIG SRPASRKLQD PKNRKIEDLR AIPWGFSWGQ CRLLLTGWYG FGSAVAAYLD GAPDAAERGK RVALLKKMNK TWPFFANLLS NMDMVLAKTD LAVASRYAQL VADKKLRKHV FERIVAEWHR TADALAEITG AHARLAANPL LARSIKNRFP YLDPLNHLQV ELIKRHRAGD TNARLRRGIH LTINGIAAGL RNTG
|
| |