Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bphy_5925 |
Symbol | |
ID | 6247497 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia phymatum STM815 |
Kingdom | Bacteria |
Replicon accession | NC_010625 |
Strand | - |
Start bp | 445490 |
End bp | 448330 |
Gene Length | 2841 bp |
Protein Length | 946 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 642597633 |
Product | phosphoenolpyruvate carboxylase |
Protein accession | YP_001862035 |
Protein GI | 186470717 |
COG category | [C] Energy production and conversion |
COG ID | [COG2352] Phosphoenolpyruvate carboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.613814 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.197386 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGCCCTC TCCATGCGGG CAGCCACACC GCCGTCGACG GGCTGCACAA GCTCGCCGCC GCGTCGGCGT CACTGCCCGC GCTCAGTGCG GATGAATACG AACACGCTGT GATCGAGCTG CTGTCGGAAC TGCTGCGCGA CATCGCACGC GCCCGCCAGC CCGAAGTGGA ACGCACGCTG CGCGGCGAGG CCGCGCACGA ATCGATGAGC GAGCTGATGC GCGAGCGGAT GGATGACCGC ACCGCGCGCG TCGTGCTGCG GCGCATGCTG CAGGCGCAGG GCATGTGGTT CCAGCTGCTC AGCATTGCGG AACAGAGCAC CGCGATGCGC CGGCGCCGCG AGATCGAGAT CGAAGGCGGC TACGACAGGC TGCCCGATTC GTTCGCGCGG GTGATCGCCG AAGTCGCGCA AGCGGGCGTC CCCGCCAGCG AAGTGCGCGA CGTGCTGGGC CATATGAAAG TGCGGCCCGT GCTGACGGCG CACCCGACGG AAGCGAAACG CGTCACCGTG CTCGAAATCC ACCGGCGCAT TTACCGGCGT CTGATGGAAC TCGAATCGCC TCGCTGGACG CCGCGCGAAC GGCACACGCT CGTGCTCGCG TTGAGAAACG ACATCGAGCT CTTGTGGATG AGCGGCGAAC TGCGGCTCGA AAAGCCCACG GTCGCGCAGG AGGTAGCGTG GGGGCTGCAC TTTTTCGGCG AGACGCTGTT CGAGGCCGTG CCGCTGCTAT TCGACAAGCT GGAGAGCGCG CTCGAGCGCC ACTATCCGGG CGAGCGCTTC GACATGCCGC GCTTTTTCCA GTTCGGGTCG TGGATAGGCG GCGACCGCGA CGGCAATCCA TTCGTCGACG ACAGCGTCAC GCGCGCCACT CTGCATGAGA ACCGGCTTGC GTGCCTGAAG CGTTACCGGC TGCGTCTGGT CGAACTGGCG CAGACGCTAA GCATCACATC CGAGGCGCTG CCCGTGCCGG ACAGTTTCCA CGAGGCGCTC GCGCGCGCGC TGATGGCGAG CGGCGAACCG GCGTCGATTG CGTCGCGCAA TCCGGGCGAA CTATTTCGCC AATACCTGAC CTGCATCCTG CGCCGGCTCG ACGCGTCGCT CGCCAACGCG AGCCGTCCCG GCGACGGCGC ACCCGTGCAG GGCGGCTACA CCAGTGCCGA CGAACTGGCT GCCGACCTGC TCGTCATCGA ACAGACGCTG CTCGCGACGG AAAGCGGCCA GCTCGCGCGC ATGTTGATCC GCCCGTTACG GCATGAAGTG GAGACGTTTC GCTTCAGCAC CGTGCAGCTC GATCTGCGTC AGAACACGAC CGTCATCGAG CAGGCGCTGC ACGGCTTGTG GCGCGCGACC TGCGGCACGT CGGGAGCGCC GCCCGCGAGC GACTCGCCCG AATGGAAGGC GTGGCTGCTG GGCGAACTGG CCCAGCCTTC GGACAGCGAA GCCGAACGCG AGCGCCGGTT CCAGTCTTTG CCGCCCGACG AAGCCCAGAC GCTGCAGATT TTCCGCACTG TTCGGGCGAT GCGTCAGCAG GTGGATCGCA ACGCATTCGG CGCGTTTATT CTCAGCATGA CGCATCGCGC GAGCGACGTG CTCGGCGTCT ATCTGCTCGC GAAGGAAGCC GGTCTCTTTT CCGACGCGGC GGGCACGGAA AGCTGCACGC TGCCCGTCGT GCCGCTGCTC GAAACCATCG ACGACCTGCG CCGCGCGCCC GACATCCTGC GCGAACTGCT GGCCGTGCCG ATGGTCCGCC GCAGCATCCG GGCGCAAGGC GGCGTGCAGG AAATCATGAT CGGCTATTCC GATTCGAACA AGGACGGCGG ATTCTTCGCG AGCAACTGGG AGCTGTCGAA GGCGCAGACG AAGATCCGGC GGCTCGGCGA CGAACTGGGC GTCGCCATCG CGTTCTTTCA CGGACGCGGC GGCTCCGTGA GCCGCGGCGG CATCGCTGCC GGCCGCGCCA TCGCCGCATT GCCCGCCGGT TCCGTCAACG GACGTTTTCG CGTGACCGAA CAGGGCGAAG TGGTGTCGTT CAAGTACGCG AATCGCGGCA CGGCGCAGTA TCACGTCGAG TTGCTGGCGT CGAGCGTGCT CGAACATACG CTCAAGTCCG AACGCGAAGA CGCATTACAG CCCAAGGGCG AATTCGACGA GGCGATGGAA GCGTTGTCGG GCGCGTCGCG GGCCGCGTAC GCAAAATTCA TCGAACAGCC GGGCATGCTC GCGTATTTCC AGGCGGCGAG CCCGCTCGAA GAACTGTCGA TGCTCAACAT GGGCTCGCGC CCTGCCCGCC GGTTCGGCGC GAAGAGTCTG CAGGACCTGC GCGCGATTCC GTGGGTTTTC GCATGGGCGC AGAACCGTCA TGCGCTGACC GGCTGGTATG GCGTCGGCAG CGCGATCGAC GGATTTCTCT GCGTGCGGCA GGAACGCGGA CTCGATCTGC TGCGGCGCAT GTTCCAGGAA TCGCGGCTCT TCAGGCTCGT CATCGACGAA GTCGAGAAGA CGCTCGCGCA GGTCAACCTG GAGATCGCGC GCGAATACGC GAACCTCGTT CCCGACGAGC AGATACGCGA CACCATCTTT ACGCAGATCG AAGCCGAATA CAGGCTGACC TTGAAGATGG TGCAGGCCGT CACGGGTTCG CCTGGGCCGG GCACGCGCTT TCCGAAGTTC AGTGCCCGTC TGCAGCGCCG GTTGCCAGCC ATCGACCTGA TCAGCCGCCA GCAGATCGAA CTGCTTCGCC TCTACCGTTC GGCGCAAACG GAACGGCAGC GGCGCGCGTA CCAGGTGCCG CTGCTGCTGT CCATCAACTG CATTGCATCG GGATTCGGCG CGACGGGCTG A
|
Protein sequence | MRPLHAGSHT AVDGLHKLAA ASASLPALSA DEYEHAVIEL LSELLRDIAR ARQPEVERTL RGEAAHESMS ELMRERMDDR TARVVLRRML QAQGMWFQLL SIAEQSTAMR RRREIEIEGG YDRLPDSFAR VIAEVAQAGV PASEVRDVLG HMKVRPVLTA HPTEAKRVTV LEIHRRIYRR LMELESPRWT PRERHTLVLA LRNDIELLWM SGELRLEKPT VAQEVAWGLH FFGETLFEAV PLLFDKLESA LERHYPGERF DMPRFFQFGS WIGGDRDGNP FVDDSVTRAT LHENRLACLK RYRLRLVELA QTLSITSEAL PVPDSFHEAL ARALMASGEP ASIASRNPGE LFRQYLTCIL RRLDASLANA SRPGDGAPVQ GGYTSADELA ADLLVIEQTL LATESGQLAR MLIRPLRHEV ETFRFSTVQL DLRQNTTVIE QALHGLWRAT CGTSGAPPAS DSPEWKAWLL GELAQPSDSE AERERRFQSL PPDEAQTLQI FRTVRAMRQQ VDRNAFGAFI LSMTHRASDV LGVYLLAKEA GLFSDAAGTE SCTLPVVPLL ETIDDLRRAP DILRELLAVP MVRRSIRAQG GVQEIMIGYS DSNKDGGFFA SNWELSKAQT KIRRLGDELG VAIAFFHGRG GSVSRGGIAA GRAIAALPAG SVNGRFRVTE QGEVVSFKYA NRGTAQYHVE LLASSVLEHT LKSEREDALQ PKGEFDEAME ALSGASRAAY AKFIEQPGML AYFQAASPLE ELSMLNMGSR PARRFGAKSL QDLRAIPWVF AWAQNRHALT GWYGVGSAID GFLCVRQERG LDLLRRMFQE SRLFRLVIDE VEKTLAQVNL EIAREYANLV PDEQIRDTIF TQIEAEYRLT LKMVQAVTGS PGPGTRFPKF SARLQRRLPA IDLISRQQIE LLRLYRSAQT ERQRRAYQVP LLLSINCIAS GFGATG
|
| |