Gene BURPS1710b_A1747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A1747 
Symbol 
ID3694134 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp2132304 
End bp2133515 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content71% 
IMG OID637732001 
ProductN-acylglucosamine 2-epimerase (GlcNAc 2-epimerase) family protein 
Protein accessionYP_336904 
Protein GI76817426 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2942] N-acyl-D-glucosamine 2-epimerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTCTCCC ATGTTCGACG GCGGCGGCCC GCATGCTTTA TCCTTGCCGC TCGGCGCCGC 
GCGTGCGGCG CCCGCGTTCT TCCCCCGTCA ACCGCGCCGC CATCCGTCAT GTCCGCCCCC
GTTTCCGTTT CCGACCAGGC CGCCCGATTG CGCCGCCACT TCGCGCAGAT CGTCTTGCCG
ATCTGGCGCG GGCCCGGCTT CAATCCGGCG CTGCAACTGC CGTTCGAGGC CGTCGCGCCG
GACACGCACG CGCCGCTGCC CGTCACCCGC TATCGCGCGA TGGCGTGCGC GCGCCAGTTG
TTCATATTCT CGCAGGCGGG CGACGCACAG CACGCGCACG CGCTCTTTGC CGCATTGTGC
CGTCACTTTC GCGATCCTCG CCACGACGGC TGGTTTTACA GTGTCGACGC GCAGGGCGCG
CCGCTCGACC GCACGAAGGA CCTGTACACG CATGCGTTCG TCGTGTTCGC ATGCGCCGAG
TATTTCGCGG CGTTCGGCAA CCGCGACGCG CGCGAGCTCA CGCAACGCAC GGCGGCGCTG
ATCGTCGATC GCTTCGCGCC TCGGCCGGGC AGCGCGCTGC TCGATTCCGC ACGCGGCGAG
GACTTCGCCG CGGCGGCGGG CGGCCCGTTG CAGAATCCGC TGATGCACCT GACCGAAGGC
TGGCTCGCCG CCGGCCGCGC GTTCGGCGAC ACCGCGTTCG ACGACGCGCT GCTGCGCACC
GCACAGGCGG TCGAGCGCAC GTTCGTCGAT CCGCACACCG GCTGCGTCGC GGAATTGCCG
ATCGGCTGCG CGGACAACCG CTTCGAGCCC GGCCATCAGT TCGAGTGGTT CTATCTCGTC
GCCTCGGCGG GCGCGCGGCT CGCGGCGACC GGCCTGCCCG ACGCGCTCGC GCGCGCGTAC
GCGTTCGCGC AACGGCACGG CGTCGATCTG GACACGGGCG GCGTCAGCGC GGCGACCGAC
GAGCGCGGCG CATGCGTCGA CGGCACGCAG CGGATCTGGG CGCAAACCGA ATATCTGCGC
GCGCTCGCGA CGCATGGCGG CGAGCCGGAC GCGCTCGCGC GCCAGATCGC GCGCTTTGCC
GAGCGGTTCC TGCATCCGCG CGGCTGGTAC GAATGCAAGA CTGCGCAGGG CGAGGTATCG
CGCGCGGACA TGCCGTCGAC GACGCCGTAT CACCTCGCGA CCGCGTACGC TTCGTTGCCG
GCGGGGACGT GA
 
Protein sequence
MLSHVRRRRP ACFILAARRR ACGARVLPPS TAPPSVMSAP VSVSDQAARL RRHFAQIVLP 
IWRGPGFNPA LQLPFEAVAP DTHAPLPVTR YRAMACARQL FIFSQAGDAQ HAHALFAALC
RHFRDPRHDG WFYSVDAQGA PLDRTKDLYT HAFVVFACAE YFAAFGNRDA RELTQRTAAL
IVDRFAPRPG SALLDSARGE DFAAAAGGPL QNPLMHLTEG WLAAGRAFGD TAFDDALLRT
AQAVERTFVD PHTGCVAELP IGCADNRFEP GHQFEWFYLV ASAGARLAAT GLPDALARAY
AFAQRHGVDL DTGGVSAATD ERGACVDGTQ RIWAQTEYLR ALATHGGEPD ALARQIARFA
ERFLHPRGWY ECKTAQGEVS RADMPSTTPY HLATAYASLP AGT