Gene Bcep18194_A6152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_A6152 
Symbol 
ID3751385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007510 
Strand
Start bp3297979 
End bp3300561 
Gene Length2583 bp 
Protein Length860 aa 
Translation table11 
GC content71% 
IMG OID637764473 
Productphosphoenolpyruvate--protein phosphotransferase 
Protein accessionYP_370390 
Protein GI78067621 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria)
[COG1925] Phosphotransferase system, HPr-related proteins
[COG2190] Phosphotransferase system IIA components 
TIGRFAM ID[TIGR00830] PTS system, glucose subfamily, IIA component
[TIGR01003] Phosphotransferase System HPr (HPr) Family
[TIGR01417] phosphoenolpyruvate-protein phosphotransferase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.32892 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACGCG CCGAGGAGTC CCAGTTGAAG AGCCCCACCC ACGATCAGAT CGTCCTGCTT 
GCCCCGTTGA CGGGGCCGAT CGTCGCGCTG GCCGACGTGC CCGATCCGGT GTTCTCCGGC
GGGATGTTCG GCGACGGCAT CGGCATCGAC CCGCTCGCGG GCCGGCTCGT CGCGCCGTGC
GCGGGCGTCG TGTCGCATCT CGCGCGCACG GGCCATGCCG TGACGATCAC GACGCCGCAC
GGCGCGGAAG TGCTGCTGCA CATCGGCATC GACACGGTCG AGCTGAACGG GCAGGGCTTC
ATCGCGCACG TCGAGGCTGG TGCACGCGTC GAGGCCGGCA CGCTGCTGAT CGAGTTCGAC
CAGGACGCGG TGGCGCGCAG CGCGCATTCG CTCGTGTCGG TGATCGCGAT CGCGAACTCG
GATGCGTTCG AGGTTGTCGA TCGCGCGAGC GGCTTCGCGA CGGCCGGCGA GACGCCGCTG
CTCGTGTTGC GCGGCAAGGG CGAAGCGGCC GTGCAGGCGG CACAAACCGG CGCGGCGGCG
CAGAACGAAG TGCGCCGTGA AATCGTGCTG GCACAACCGG GCGGCCTGCA TGCGCGGCCG
GCCGCGCGGG CACGTGAAGC CGTACGCGGG TTCGACGCGA CCGTCGACGT GCTGTTCGAC
GGCCGCAAGG CGTCGATCGC GAGCGTCGTC GGCCTGCTCG GCCTCGGCGC CGGCGAAGGC
GCGACGGTCG AACTGGTCGG CCGTGGCGCG CACGCGCAGC AGGCCGTCGA TGCAGTCGAG
CACGAGCTGC TGCGCGAAGC GCACGGTGAA GTCGAAGAAA AGCCGGCACG GCTGAAGTCG
CCCGCGCCGC AGATGGTCGC GCGCAACGTC GGCGTGCCGA TCGACCCGAA CACGCTCGCG
GGCGTGTGCG CGGCGCCCGG CATCGCGGTC GGCACGCTGG TGCGTCTCGA CGATGCGGAA
ATCGTGCCGC CCGAGCAGGC CTCCGGCACG CCGGCTGCGG AAAGCCGCCA GCTCGACCAG
GCGCTGAAGG CCGTCGACGG CGAGCTCGAC GAAACGGTGC GCAACGCATC GGCGCGTGGC
GCGGTCGGCG AAGCAGGCAT TTTCGCGGTG CACCGCGTGC TGCTGGAAGA CCCGACGCTG
ATCGACGCGG CGCGCGACCT GATCAGCCTC GGCAAGAGCG CGGGCTTCGC ATGGCGCGCG
ACGATCCGCA CGCAGATCGA CACGCTGTCG AAGCTTGACG ATGCGCTGCT CGCCGAACGT
GCGGCCGACC TGCGCGACAT CGAGAAGCGC GTGCTGCGCG CGCTCGGCCA CACGAACGGT
GCCGCGCGTG CGCTGCCCGA CGAAGCGGTG CTCGCGGCCG AGGAATTCAC GCCGTCCGAC
CTGTCGTCGC TCGATCGCCA GCGCGTGACC GCGCTCGTGA TGGCGCGCGG CGGCGCGACG
TCGCACGCGG CGATCATCGC GCGGCAGCTC GGCATTCCGG CGCTGGTTGC GGTCGGCGAT
GCGCTGTACG CGATTCCGGA CGGCACGCAG GTTGTCGTCG ATGCGAGCGC GGGCCGGCTC
GAACACGCGC CGACCGCGCT CGACGTCGAG CGCGCGCGGC ACGAGCGCCA GCGCCTGGAC
GGCGTGCGCG AGGCGAACCG GCAACTGGCC GGTGAAGCCG CCTCGACCGT CGACGGCCGC
GCGATCGAGG TCGCCGCGAA CATCGCGACG CTCGACGATG CGAACACCGC GGTCGACAAC
GGCGCCGATG CAGTCGGCCT GCTGCGCACC GAGCTGATGT TCATCCATCG CCAGGCCGCG
CCGACGGTCG TCGAACACCA GCAGAGCTAC CAGTCGATCG TCGACGCGTT GCAGGGCCGC
ACGGCGATCA TCCGCACGCT CGACGTCGGC GCCGACAAGG AAGTCGACTA CCTGACGCTG
CCGCCCGAAC CGAACCCGGC GCTCGGCCTG CGCGGCATCC GTCTCGCGCA GGTGCGCCCC
GATCTGCTCG ACGATCAATT GCAGGGCCTG CTCGCGGTGA AGCCGCTCGG CGCGGTGCGC
ATCCTGCTGC CGATGGTCAC CGATGCGGGT GAGCTCGTGC GGCTGCGCAA GCGCATCGAC
GAATTCGCCC GCGCGCAGGG CCGCACCGAG CCGATCGAAG TCGGCGTGAT GATCGAGGTG
CCGTCGGCCG CGCTGCTGGC CGACCAGCTC GCGCAGCACG CGGACTTCCT GTCGATCGGC
ACCAACGACC TGACGCAATA CACGCTCGCG ATGGACCGCT GCCAGGCCGA TCTCGCTGCG
CAATCCGACG GCCTGCATCC GGCCGTGCTG CGCCTGATCG ACATCGCGGT GCGCGGCGCC
GCGAAGCACG GCAAGTGGGT GGGCGTGTGC GGCGCGCTCG GCGGCGATCC GCTCGCGGTG
CCGGTGCTGG TGGGCCTCGG CGTGACCGAG CTGTCGGTCG ACCCCGTATC GGTGCCGGGC
ATCAAGGCGC GTGTGCGCCG TCTCGATTAC CAGTTGTGCC GTCAGCGCGC GCAGGATCTG
CTCGCGCTCG ATTCGGCACA GGCGGTAAGG GCAGCAAGCC GCGAGGTCTG GCCGCTCGAC
TGA
 
Protein sequence
MRRAEESQLK SPTHDQIVLL APLTGPIVAL ADVPDPVFSG GMFGDGIGID PLAGRLVAPC 
AGVVSHLART GHAVTITTPH GAEVLLHIGI DTVELNGQGF IAHVEAGARV EAGTLLIEFD
QDAVARSAHS LVSVIAIANS DAFEVVDRAS GFATAGETPL LVLRGKGEAA VQAAQTGAAA
QNEVRREIVL AQPGGLHARP AARAREAVRG FDATVDVLFD GRKASIASVV GLLGLGAGEG
ATVELVGRGA HAQQAVDAVE HELLREAHGE VEEKPARLKS PAPQMVARNV GVPIDPNTLA
GVCAAPGIAV GTLVRLDDAE IVPPEQASGT PAAESRQLDQ ALKAVDGELD ETVRNASARG
AVGEAGIFAV HRVLLEDPTL IDAARDLISL GKSAGFAWRA TIRTQIDTLS KLDDALLAER
AADLRDIEKR VLRALGHTNG AARALPDEAV LAAEEFTPSD LSSLDRQRVT ALVMARGGAT
SHAAIIARQL GIPALVAVGD ALYAIPDGTQ VVVDASAGRL EHAPTALDVE RARHERQRLD
GVREANRQLA GEAASTVDGR AIEVAANIAT LDDANTAVDN GADAVGLLRT ELMFIHRQAA
PTVVEHQQSY QSIVDALQGR TAIIRTLDVG ADKEVDYLTL PPEPNPALGL RGIRLAQVRP
DLLDDQLQGL LAVKPLGAVR ILLPMVTDAG ELVRLRKRID EFARAQGRTE PIEVGVMIEV
PSAALLADQL AQHADFLSIG TNDLTQYTLA MDRCQADLAA QSDGLHPAVL RLIDIAVRGA
AKHGKWVGVC GALGGDPLAV PVLVGLGVTE LSVDPVSVPG IKARVRRLDY QLCRQRAQDL
LALDSAQAVR AASREVWPLD