Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A0305 |
Symbol | |
ID | 4904484 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | - |
Start bp | 293773 |
End bp | 294876 |
Gene Length | 1104 bp |
Protein Length | 367 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640143412 |
Product | isomerase |
Protein accession | YP_001074348 |
Protein GI | 126457190 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2942] N-acyl-D-glucosamine 2-epimerase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.882389 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGCCC CCGTTTCCGT TTCCGACCAG GCCGCCCGAT TGCGCCGCCA CTTCGCGCAG ATCGTCTTGC CGATCTGGCG CGGGCCCGGC TTCAATCCGG CGCTGCAACT GCCGTTCGAG GCCGTCGCGC CGGACACGCA CGTGCCGCTG CCCGTCACCC GCTATCGCGC GATGGCGTGC GCGCGCCAGT TGTTCATATT CTCGCAGGCG GGCGACGCGC AGCACGCGCA CGCGCTCTTT GCCGCATTGT GCCGTCACTT TCGCGATCCT CGCCACGACG GCTGGTTTTA CAGTGTCGAC GCGCAGGGCG CGCCGCTCGA CCGCACGAAG GACCTGTACA CGCATGCGTT CGTCGTGTTC GCATGCGCCG AGTATTTCGC GGCGTTCGGC AACCGCGACG CGCGCGAGCT CGCGCAACGC ACGGCGGCGC TGATCGTCGA TCGCTTCGCG CCTCGGCCGG GCAGCGCGCT GCTCGATTCC GCACGCGGCG AGGACTTCGC CGCGGCGGCG GGCGGCCCGT TGCAGAATCC GCTGATGCAC CTGACCGAAG GCTGGCTCGC CGCCGGCCGC GCGTTCGGCG ACACCGCGTT CGACGACGCG CTGCTGCGCA CCGCGCAGGC GGTCGAGCGC ACGTTCGTCG ATCCGCACAC CGGCTGCGTC GCGGAATTGC CGATCGGCTG CGCGGACAAC CGCTTCGAGC CCGGCCACCA GTTCGAGTGG TTCTATCTCG TCGCCTCGGC GGGCGCGCGG CTCGCGGCGA CCGGCCTGCC CGACGCGCTC GCGCGCGCAT ACGCGTTCGC GCAACGGCAC GGCGTCGATC CGGACACGGG CGGCGTCAGC GCGGCGACCG ACGAGCGCGG CGCATGCGTC GACGGCACGC AGCGGATCTG GGCGCAAACC GAATATCTGC GCGCGCTCGC GACGCATGGC GGCGAGCCGG ACGCGCTCGC GCGCCAGATC GCGCGCTTTG CCGAGCGGTT CCTGCATCCG CGCGGCTGGT ACGAATGCAA GACTGCGCAG GGCGAGGTAT CGCGCGCGGA CATGCCGTCG ACGACGCCGT ATCACCTCGC GACCGCGTAC GCTTCGTTGC CGGCGGGGAC GTGA
|
Protein sequence | MSAPVSVSDQ AARLRRHFAQ IVLPIWRGPG FNPALQLPFE AVAPDTHVPL PVTRYRAMAC ARQLFIFSQA GDAQHAHALF AALCRHFRDP RHDGWFYSVD AQGAPLDRTK DLYTHAFVVF ACAEYFAAFG NRDARELAQR TAALIVDRFA PRPGSALLDS ARGEDFAAAA GGPLQNPLMH LTEGWLAAGR AFGDTAFDDA LLRTAQAVER TFVDPHTGCV AELPIGCADN RFEPGHQFEW FYLVASAGAR LAATGLPDAL ARAYAFAQRH GVDPDTGGVS AATDERGACV DGTQRIWAQT EYLRALATHG GEPDALARQI ARFAERFLHP RGWYECKTAQ GEVSRADMPS TTPYHLATAY ASLPAGT
|
| |