Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_A1747 |
Symbol | |
ID | 3694134 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007435 |
Strand | - |
Start bp | 2132304 |
End bp | 2133515 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637732001 |
Product | N-acylglucosamine 2-epimerase (GlcNAc 2-epimerase) family protein |
Protein accession | YP_336904 |
Protein GI | 76817426 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2942] N-acyl-D-glucosamine 2-epimerase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTCTCCC ATGTTCGACG GCGGCGGCCC GCATGCTTTA TCCTTGCCGC TCGGCGCCGC GCGTGCGGCG CCCGCGTTCT TCCCCCGTCA ACCGCGCCGC CATCCGTCAT GTCCGCCCCC GTTTCCGTTT CCGACCAGGC CGCCCGATTG CGCCGCCACT TCGCGCAGAT CGTCTTGCCG ATCTGGCGCG GGCCCGGCTT CAATCCGGCG CTGCAACTGC CGTTCGAGGC CGTCGCGCCG GACACGCACG CGCCGCTGCC CGTCACCCGC TATCGCGCGA TGGCGTGCGC GCGCCAGTTG TTCATATTCT CGCAGGCGGG CGACGCACAG CACGCGCACG CGCTCTTTGC CGCATTGTGC CGTCACTTTC GCGATCCTCG CCACGACGGC TGGTTTTACA GTGTCGACGC GCAGGGCGCG CCGCTCGACC GCACGAAGGA CCTGTACACG CATGCGTTCG TCGTGTTCGC ATGCGCCGAG TATTTCGCGG CGTTCGGCAA CCGCGACGCG CGCGAGCTCA CGCAACGCAC GGCGGCGCTG ATCGTCGATC GCTTCGCGCC TCGGCCGGGC AGCGCGCTGC TCGATTCCGC ACGCGGCGAG GACTTCGCCG CGGCGGCGGG CGGCCCGTTG CAGAATCCGC TGATGCACCT GACCGAAGGC TGGCTCGCCG CCGGCCGCGC GTTCGGCGAC ACCGCGTTCG ACGACGCGCT GCTGCGCACC GCACAGGCGG TCGAGCGCAC GTTCGTCGAT CCGCACACCG GCTGCGTCGC GGAATTGCCG ATCGGCTGCG CGGACAACCG CTTCGAGCCC GGCCATCAGT TCGAGTGGTT CTATCTCGTC GCCTCGGCGG GCGCGCGGCT CGCGGCGACC GGCCTGCCCG ACGCGCTCGC GCGCGCGTAC GCGTTCGCGC AACGGCACGG CGTCGATCTG GACACGGGCG GCGTCAGCGC GGCGACCGAC GAGCGCGGCG CATGCGTCGA CGGCACGCAG CGGATCTGGG CGCAAACCGA ATATCTGCGC GCGCTCGCGA CGCATGGCGG CGAGCCGGAC GCGCTCGCGC GCCAGATCGC GCGCTTTGCC GAGCGGTTCC TGCATCCGCG CGGCTGGTAC GAATGCAAGA CTGCGCAGGG CGAGGTATCG CGCGCGGACA TGCCGTCGAC GACGCCGTAT CACCTCGCGA CCGCGTACGC TTCGTTGCCG GCGGGGACGT GA
|
Protein sequence | MLSHVRRRRP ACFILAARRR ACGARVLPPS TAPPSVMSAP VSVSDQAARL RRHFAQIVLP IWRGPGFNPA LQLPFEAVAP DTHAPLPVTR YRAMACARQL FIFSQAGDAQ HAHALFAALC RHFRDPRHDG WFYSVDAQGA PLDRTKDLYT HAFVVFACAE YFAAFGNRDA RELTQRTAAL IVDRFAPRPG SALLDSARGE DFAAAAGGPL QNPLMHLTEG WLAAGRAFGD TAFDDALLRT AQAVERTFVD PHTGCVAELP IGCADNRFEP GHQFEWFYLV ASAGARLAAT GLPDALARAY AFAQRHGVDL DTGGVSAATD ERGACVDGTQ RIWAQTEYLR ALATHGGEPD ALARQIARFA ERFLHPRGWY ECKTAQGEVS RADMPSTTPY HLATAYASLP AGT
|
| |