Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcep18194_A6152 |
Symbol | |
ID | 3751385 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia sp. 383 |
Kingdom | Bacteria |
Replicon accession | NC_007510 |
Strand | - |
Start bp | 3297979 |
End bp | 3300561 |
Gene Length | 2583 bp |
Protein Length | 860 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637764473 |
Product | phosphoenolpyruvate--protein phosphotransferase |
Protein accession | YP_370390 |
Protein GI | 78067621 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) [COG1925] Phosphotransferase system, HPr-related proteins [COG2190] Phosphotransferase system IIA components |
TIGRFAM ID | [TIGR00830] PTS system, glucose subfamily, IIA component [TIGR01003] Phosphotransferase System HPr (HPr) Family [TIGR01417] phosphoenolpyruvate-protein phosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.32892 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACGCG CCGAGGAGTC CCAGTTGAAG AGCCCCACCC ACGATCAGAT CGTCCTGCTT GCCCCGTTGA CGGGGCCGAT CGTCGCGCTG GCCGACGTGC CCGATCCGGT GTTCTCCGGC GGGATGTTCG GCGACGGCAT CGGCATCGAC CCGCTCGCGG GCCGGCTCGT CGCGCCGTGC GCGGGCGTCG TGTCGCATCT CGCGCGCACG GGCCATGCCG TGACGATCAC GACGCCGCAC GGCGCGGAAG TGCTGCTGCA CATCGGCATC GACACGGTCG AGCTGAACGG GCAGGGCTTC ATCGCGCACG TCGAGGCTGG TGCACGCGTC GAGGCCGGCA CGCTGCTGAT CGAGTTCGAC CAGGACGCGG TGGCGCGCAG CGCGCATTCG CTCGTGTCGG TGATCGCGAT CGCGAACTCG GATGCGTTCG AGGTTGTCGA TCGCGCGAGC GGCTTCGCGA CGGCCGGCGA GACGCCGCTG CTCGTGTTGC GCGGCAAGGG CGAAGCGGCC GTGCAGGCGG CACAAACCGG CGCGGCGGCG CAGAACGAAG TGCGCCGTGA AATCGTGCTG GCACAACCGG GCGGCCTGCA TGCGCGGCCG GCCGCGCGGG CACGTGAAGC CGTACGCGGG TTCGACGCGA CCGTCGACGT GCTGTTCGAC GGCCGCAAGG CGTCGATCGC GAGCGTCGTC GGCCTGCTCG GCCTCGGCGC CGGCGAAGGC GCGACGGTCG AACTGGTCGG CCGTGGCGCG CACGCGCAGC AGGCCGTCGA TGCAGTCGAG CACGAGCTGC TGCGCGAAGC GCACGGTGAA GTCGAAGAAA AGCCGGCACG GCTGAAGTCG CCCGCGCCGC AGATGGTCGC GCGCAACGTC GGCGTGCCGA TCGACCCGAA CACGCTCGCG GGCGTGTGCG CGGCGCCCGG CATCGCGGTC GGCACGCTGG TGCGTCTCGA CGATGCGGAA ATCGTGCCGC CCGAGCAGGC CTCCGGCACG CCGGCTGCGG AAAGCCGCCA GCTCGACCAG GCGCTGAAGG CCGTCGACGG CGAGCTCGAC GAAACGGTGC GCAACGCATC GGCGCGTGGC GCGGTCGGCG AAGCAGGCAT TTTCGCGGTG CACCGCGTGC TGCTGGAAGA CCCGACGCTG ATCGACGCGG CGCGCGACCT GATCAGCCTC GGCAAGAGCG CGGGCTTCGC ATGGCGCGCG ACGATCCGCA CGCAGATCGA CACGCTGTCG AAGCTTGACG ATGCGCTGCT CGCCGAACGT GCGGCCGACC TGCGCGACAT CGAGAAGCGC GTGCTGCGCG CGCTCGGCCA CACGAACGGT GCCGCGCGTG CGCTGCCCGA CGAAGCGGTG CTCGCGGCCG AGGAATTCAC GCCGTCCGAC CTGTCGTCGC TCGATCGCCA GCGCGTGACC GCGCTCGTGA TGGCGCGCGG CGGCGCGACG TCGCACGCGG CGATCATCGC GCGGCAGCTC GGCATTCCGG CGCTGGTTGC GGTCGGCGAT GCGCTGTACG CGATTCCGGA CGGCACGCAG GTTGTCGTCG ATGCGAGCGC GGGCCGGCTC GAACACGCGC CGACCGCGCT CGACGTCGAG CGCGCGCGGC ACGAGCGCCA GCGCCTGGAC GGCGTGCGCG AGGCGAACCG GCAACTGGCC GGTGAAGCCG CCTCGACCGT CGACGGCCGC GCGATCGAGG TCGCCGCGAA CATCGCGACG CTCGACGATG CGAACACCGC GGTCGACAAC GGCGCCGATG CAGTCGGCCT GCTGCGCACC GAGCTGATGT TCATCCATCG CCAGGCCGCG CCGACGGTCG TCGAACACCA GCAGAGCTAC CAGTCGATCG TCGACGCGTT GCAGGGCCGC ACGGCGATCA TCCGCACGCT CGACGTCGGC GCCGACAAGG AAGTCGACTA CCTGACGCTG CCGCCCGAAC CGAACCCGGC GCTCGGCCTG CGCGGCATCC GTCTCGCGCA GGTGCGCCCC GATCTGCTCG ACGATCAATT GCAGGGCCTG CTCGCGGTGA AGCCGCTCGG CGCGGTGCGC ATCCTGCTGC CGATGGTCAC CGATGCGGGT GAGCTCGTGC GGCTGCGCAA GCGCATCGAC GAATTCGCCC GCGCGCAGGG CCGCACCGAG CCGATCGAAG TCGGCGTGAT GATCGAGGTG CCGTCGGCCG CGCTGCTGGC CGACCAGCTC GCGCAGCACG CGGACTTCCT GTCGATCGGC ACCAACGACC TGACGCAATA CACGCTCGCG ATGGACCGCT GCCAGGCCGA TCTCGCTGCG CAATCCGACG GCCTGCATCC GGCCGTGCTG CGCCTGATCG ACATCGCGGT GCGCGGCGCC GCGAAGCACG GCAAGTGGGT GGGCGTGTGC GGCGCGCTCG GCGGCGATCC GCTCGCGGTG CCGGTGCTGG TGGGCCTCGG CGTGACCGAG CTGTCGGTCG ACCCCGTATC GGTGCCGGGC ATCAAGGCGC GTGTGCGCCG TCTCGATTAC CAGTTGTGCC GTCAGCGCGC GCAGGATCTG CTCGCGCTCG ATTCGGCACA GGCGGTAAGG GCAGCAAGCC GCGAGGTCTG GCCGCTCGAC TGA
|
Protein sequence | MRRAEESQLK SPTHDQIVLL APLTGPIVAL ADVPDPVFSG GMFGDGIGID PLAGRLVAPC AGVVSHLART GHAVTITTPH GAEVLLHIGI DTVELNGQGF IAHVEAGARV EAGTLLIEFD QDAVARSAHS LVSVIAIANS DAFEVVDRAS GFATAGETPL LVLRGKGEAA VQAAQTGAAA QNEVRREIVL AQPGGLHARP AARAREAVRG FDATVDVLFD GRKASIASVV GLLGLGAGEG ATVELVGRGA HAQQAVDAVE HELLREAHGE VEEKPARLKS PAPQMVARNV GVPIDPNTLA GVCAAPGIAV GTLVRLDDAE IVPPEQASGT PAAESRQLDQ ALKAVDGELD ETVRNASARG AVGEAGIFAV HRVLLEDPTL IDAARDLISL GKSAGFAWRA TIRTQIDTLS KLDDALLAER AADLRDIEKR VLRALGHTNG AARALPDEAV LAAEEFTPSD LSSLDRQRVT ALVMARGGAT SHAAIIARQL GIPALVAVGD ALYAIPDGTQ VVVDASAGRL EHAPTALDVE RARHERQRLD GVREANRQLA GEAASTVDGR AIEVAANIAT LDDANTAVDN GADAVGLLRT ELMFIHRQAA PTVVEHQQSY QSIVDALQGR TAIIRTLDVG ADKEVDYLTL PPEPNPALGL RGIRLAQVRP DLLDDQLQGL LAVKPLGAVR ILLPMVTDAG ELVRLRKRID EFARAQGRTE PIEVGVMIEV PSAALLADQL AQHADFLSIG TNDLTQYTLA MDRCQADLAA QSDGLHPAVL RLIDIAVRGA AKHGKWVGVC GALGGDPLAV PVLVGLGVTE LSVDPVSVPG IKARVRRLDY QLCRQRAQDL LALDSAQAVR AASREVWPLD
|
| |