Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcep18194_A5753 |
Symbol | |
ID | 3750983 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia sp. 383 |
Kingdom | Bacteria |
Replicon accession | NC_007510 |
Strand | + |
Start bp | 2860526 |
End bp | 2863552 |
Gene Length | 3027 bp |
Protein Length | 1008 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637764071 |
Product | phosphoenolpyruvate carboxylase |
Protein accession | YP_369991 |
Protein GI | 78067222 |
COG category | [C] Energy production and conversion |
COG ID | [COG2352] Phosphoenolpyruvate carboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.61686 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGTCTT CCGGATCGGC GCGTACGGCG CGCCGCAATG CTGCCCTGTC CTCCTCCGAC GCCTCGACGG ACACCGTCGC CACCGCCGCG AACGGCCGTG CGAAAACGGC AACGAAACCG AAAGACCCGA TACGTCAGAC AAAACGCACG GCGAAAGCCG CCGGCCCGGC TGCCCGCACC GCGGCCCGCA CGGCCGCCGC GCCGAAGTCC GGCACGCGCA CGCGCGAAGA CAAGGACGGC CCGCTGTTCG ACGACATCCG CTTTCTCGGC CGCCTGCTCG GCGACGTCGT GCGTGAGCAG GAAGGCGACA CCGTGTTCGA CGTCGTCGAA ACGATCCGCC AGACCGCGGT CAAGTTCCGC CGCGAGGACG ACAGCGAAGC CGCGCAGACG CTCGAGAAGA AGCTGCGCAA GCTGACGCCG GAGCAGACGG TGAGCGTCGT GCGCGCGTTC AGCTATTTCT CGCATCTCGC GAATATCGCG GAAGACCGCC ACCACAATCG CCGCCGCCGC ATCCACGCGC TGGCCGGCTC CGCGTCGCAG CCCGGCACGG TCGCGTACGC GCTCGAACAA CTGAAGACGA CCGGCAACGC GTCGAAGCGC CTGCTGCAGC GCTTTTTCGA CGATGCGCTG ATCGTGCCGG TGCTGACCGC GCACCCGACC GAAGTGCAGC GCAAGAGCAT CCTCGACGCA CAGCACGACA TCGCGCGCCT GCTCGCCGAA CGCGACCAGG AACTGACCGG CCGCGAGCGC CAGTACAACG AATCGATGCT GCGCGCCCGC GTCACCGCGC TGTGGCAGAC CCGCATGCTG CGCGACGCGC GCCTGACGGT GGGCGACGAA ATCGAGAACG CGCTGTCGTA CTACCGCGCG ACGTTCCTCG ACGAGCTGCC CGCGCTGTAC GGCGACATCG AGGCCGCACT CGCCGAGCAC GGCCTGTCGG CGCGCGTACC CGCGTTCTTC CAGATGGGCA GCTGGATCGG CGGCGACCGC GACGGCAACC CGAACGTGAC CGCACCGACG CTCGAAGAAG CAATCAACCG CCAGGCCGCG GTGATCCTCG AGCACTATCT GGAACAGGTG CACAAGCTCG GCGCCGAGCT GTCGGTGTCG AACCTGCTCG TCGGCGCGAA CGACGCCGTG AAAGCGCTCG CAGCAGCGTC CCCCGACCAG TCGCCGCATC GCGTCGACGA GCCGTATCGC CGTGCGCTGA TCGGCATCTA CACGCGCCTC GCCGCAAGCG CGCGCGTGCG TCTCGGCGAA GGCACGGTGC CGGTGCGCAG CGCAGGCCGC GGCGCGGCGC CCGTGCGCGC GACCCCGTAT GCGGATTCCG AAGCGTTCGT CGCCGACCTG AAGGTGCTGA CCGCGTCGCT CGACGAACAC CACGGCACGT CGCTCGCCGC GCCGCGCCTC GCGCCGCTCG TACGCGCGGC CGAAGTGTTC GGCTTCCATC TCGCGAGCAT CGACCTGCGC CAGAGCTCCG ACATCCACGA AGCCGTAGTC GCCGAACTGT TCGCACGCGC GGGCGTCGAG GCCGACTACG CGGCGCTCGC CGAGGAAGAC AAGCTGCGCG TGCTGCTCGC CGCGCTCGCC GATCCGCGTC CGCTGCGCTC GCCGTACTTC GAATACTCGG CGCTCGCGCA GAGCGAACTC GGCGTGTTCG AGAAGGCGCG CGAAGTCCGC GCGCAATTCG GCGCACGCGC GGTGCGCAAC TACATCATTT CGCATACGGA AACCGTCAGC GACCTCGTCG AGGTGCTGCT GCTGCAGAAG GAGACGGGCC TGCTCGACGG CGCGCTCGGC GTGCCGGGCG GCGATGCGAA GAACAGCCTG ATGGTGATCC CGCTGTTCGA GACGATTCCC GACCTGCGCG ACGCCGCGCG CATCATGCGC GAATACTTCG CACTGCCGGG CATCGACGCG CTGATCGCGC ACCAGGGCGC CGAACAGGAA GTGATGCTCG GCTATTCGGA CAGCAACAAG GACGGCGGCT TCCTCACGTC GAACTGGGAG CTGTATCGCG CGGAACTCGC GCTCGTCGAC CTGTTCCGCG ACCGCAAGAT CACGCTGCGC CTGTTCCACG GCCGAGGCGG CACGGTCGGC CGTGGCGGCG GCCCGACCTA CCAGGCGATC CTGTCGCAGC CGCCGGGCAC CGTGAACGGC CAGATCCGCC TGACCGAACA GGGCGAGGTG ATCGCGAGCA AGTTCGCGAA CCCGGAGATC GGCCGCCGCA ACCTCGAGAC GGTCGTCGCC GCGACGCTCG AGGCGACGCT CCTGCCGCAG AACAACGCGC CCGCGCAGTT GCCGGCGTTC GAGGCCGCGA TGCAGACGCT GTCCGATTCG GCGATGGCCG CGTACCGCGC GCTCGTCTAT GAAACCCCGG GCTTCACCGA CTACTTCTTC TCGTCGACGC CGATCACCGA GATCGCCGAG CTGAACATCG GCAGCCGCCC GGCTTCGCGC AAGCTGCAGG ATCCGAAGCA GCGCAAGATC GAAGACCTGC GCGCGATTCC GTGGGGCTTC TCGTGGGGCC AGTGCCGGCT GCTGCTGACC GGCTGGTACG GCTTCGGCAG CGCGGTGAGT GCGTATCTCG ACGGCGCGCA GGACGACGCC GAGCGCACGA AGCGCGTCGC GCTGCTGAAG AAGATGAACA AGACCTGGCC GTTCTTCGCG AACCTGCTGT CGAACATGGA CATGGTGCTC GCGAAAACCG ACCTCGCGGT CGCGTCGCGC TACGCGCAGC TCGTTTCCGA TCGCAAGCTG CGCAAGCACG TGTTCGAGCG GATCGTCGCG GAATGGGAGC GCACGTCGCA GGCGCTGGCG GAAATCACCG GGCACGAAGG CCGCCTCGCG ACCAACCCGC TGCTCGCGCG CTCGATCAAG AACCGCTTCC CGTATCTCGA TCCGCTGAAC CACCTGCAAG TCGAGCTGAT CAAGCGTCAC CGCGCGGGCG ATACGAACGC GCGGCTGCGC CGCGGGATTC ACCTGACGAT CAACGGGATC GCGGCCGGCC TGCGCAACAC GGGCTGA
|
Protein sequence | MKSSGSARTA RRNAALSSSD ASTDTVATAA NGRAKTATKP KDPIRQTKRT AKAAGPAART AARTAAAPKS GTRTREDKDG PLFDDIRFLG RLLGDVVREQ EGDTVFDVVE TIRQTAVKFR REDDSEAAQT LEKKLRKLTP EQTVSVVRAF SYFSHLANIA EDRHHNRRRR IHALAGSASQ PGTVAYALEQ LKTTGNASKR LLQRFFDDAL IVPVLTAHPT EVQRKSILDA QHDIARLLAE RDQELTGRER QYNESMLRAR VTALWQTRML RDARLTVGDE IENALSYYRA TFLDELPALY GDIEAALAEH GLSARVPAFF QMGSWIGGDR DGNPNVTAPT LEEAINRQAA VILEHYLEQV HKLGAELSVS NLLVGANDAV KALAAASPDQ SPHRVDEPYR RALIGIYTRL AASARVRLGE GTVPVRSAGR GAAPVRATPY ADSEAFVADL KVLTASLDEH HGTSLAAPRL APLVRAAEVF GFHLASIDLR QSSDIHEAVV AELFARAGVE ADYAALAEED KLRVLLAALA DPRPLRSPYF EYSALAQSEL GVFEKAREVR AQFGARAVRN YIISHTETVS DLVEVLLLQK ETGLLDGALG VPGGDAKNSL MVIPLFETIP DLRDAARIMR EYFALPGIDA LIAHQGAEQE VMLGYSDSNK DGGFLTSNWE LYRAELALVD LFRDRKITLR LFHGRGGTVG RGGGPTYQAI LSQPPGTVNG QIRLTEQGEV IASKFANPEI GRRNLETVVA ATLEATLLPQ NNAPAQLPAF EAAMQTLSDS AMAAYRALVY ETPGFTDYFF SSTPITEIAE LNIGSRPASR KLQDPKQRKI EDLRAIPWGF SWGQCRLLLT GWYGFGSAVS AYLDGAQDDA ERTKRVALLK KMNKTWPFFA NLLSNMDMVL AKTDLAVASR YAQLVSDRKL RKHVFERIVA EWERTSQALA EITGHEGRLA TNPLLARSIK NRFPYLDPLN HLQVELIKRH RAGDTNARLR RGIHLTINGI AAGLRNTG
|
| |