Gene Bphyt_7040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBphyt_7040 
Symbol 
ID6280288 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia phytofirmans PsJN 
KingdomBacteria 
Replicon accessionNC_010676 
Strand
Start bp3422448 
End bp3423458 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content64% 
IMG OID642618063 
Producthopanoid-associated sugar epimerase 
Protein accessionYP_001890699 
Protein GI187921667 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID[TIGR03466] hopanoid-associated sugar epimerase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0000519767 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCGAAC AAAATCGCGA TCTCGTACTC GTGACCGGCG CCTCCGGCTT TGTCGGCTCG 
TCGGTGGCGC GCATCGCGCA ACAGAAGGGT TTCAGGGTGC GCGTGCTGGT GCGCGCCACG
AGCCCGCGTC AGAACGTCGA GTCGCTGGAC GCGGAAATCG TGGTCGGCGA TATGCGCGAC
GAAGCGTCGA TGCGCAACGC ACTGCGCGGC GTGCGCTATC TGCTGCACGT GGCCGCCGAC
TATCGGCTGT GGGCGCCTGA CCCGAGCGAG ATCGAGCGCT CGAACCTCGA AGGCACCGAG
GCGACCATGC GCGCGGCGCT GAAGGAAGGC GTCGAGCGCA TGGTCTACAC GAGCAGTGTC
GCCACGTTGA AAGTGACCAG TTCCGGCCAG TCCGCCGACG AAACCTCGCC GCTCAAAGCC
GATCAGGCGA TCGGCGTGTA CAAGCGCAGC AAGGTGCTGG CCGAGCGCGC GGTGGAGCGG
ATGATCGCCG AAGACGGCCT GCCGGCGGTG ATCGTCAATC CGTCCACGCC AATCGGACCG
CGCGATGTCA AGCCGACGCC GACGGGACGC ATTATCGTGG AAGCGGCGCT CGGCAAGATT
CCGGCGTTCG TCGACACGGG CCTGAATCTC GTGCACGTGG ATGACGTCGC GACCGGCCAT
TTCCTCGCGC TCGAACGCGG CAAGATCGGC GAGCGTTATA TTCTCGGCGG CGAAAATCTG
CCGCTTCAAC AGATGCTCGC CGATATCGCG GCGCTGACCG GCCGCAAGGC GCCGACGCTG
AGCCTGCCGC GCTGGCCGCT GTATCCGCTG GCCATGGGCG CCGAAGCCGT CGCCAAGATC
ACGAAACGCG AACCGTTCGT CACCGTCGAC GGCTTGAAAA TGTCGAAGAA CAAGATGTAT
TTCTCTTCGG CAAAAGCGGA ACGCGAGCTC GGTTATCGCT CGCGGCCCTA TCGCGAGGGC
TTGAGCGACG CACTCGACTG GTTCAGACAA GCTGGCTATC TGAAGCCGTG A
 
Protein sequence
MTEQNRDLVL VTGASGFVGS SVARIAQQKG FRVRVLVRAT SPRQNVESLD AEIVVGDMRD 
EASMRNALRG VRYLLHVAAD YRLWAPDPSE IERSNLEGTE ATMRAALKEG VERMVYTSSV
ATLKVTSSGQ SADETSPLKA DQAIGVYKRS KVLAERAVER MIAEDGLPAV IVNPSTPIGP
RDVKPTPTGR IIVEAALGKI PAFVDTGLNL VHVDDVATGH FLALERGKIG ERYILGGENL
PLQQMLADIA ALTGRKAPTL SLPRWPLYPL AMGAEAVAKI TKREPFVTVD GLKMSKNKMY
FSSAKAEREL GYRSRPYREG LSDALDWFRQ AGYLKP