Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A3049 |
Symbol | |
ID | 4888917 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 2895826 |
End bp | 2896833 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640132984 |
Product | NAD-dependent epimerase/dehydratase family protein |
Protein accession | YP_001064039 |
Protein GI | 126445581 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | [TIGR03466] hopanoid-associated sugar epimerase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.138998 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGACA CACAACGCGA TCTCGTTCTG GTGACCGGCG CATCCGGTTT TGTCGGCTCC GCCGTCGCGC GCGCCGCGCG GCGGCAGGGC TATCGGGTGC GCGTGCTCGT GCGGCCGACG AGCCCGCGCA CGAACGTCGC GGATCTCGAC GCCGAGATCG CCACCGGCGA CATGCGCGAC GAGGCATCGA TGCGCGCCGC GCTGCGCGGC GTGCGCCATC TGCTGCACGT CGCCGCCGAC TACCGGCTGT GGGCGCCCGA TCCGCTCGAG ATCGAGCGCG CGAATCTCGA AGGCGCGGTC GCGACGATGC GCGCGGCGCT CGCCGAGGGC GTCGAGCGGA TCGTCTACAC GAGCAGCGTC GCCACGCTGA AGGTGACGCC GTCGGGCGCC TCGGCCGACG AATCGTCGCC GCTCGCCGCC GGGCAGGCGA TCGGCGTATA CAAGCGCAGC AAGGTGCTCG CCGAGCGCGC GGTCGAGCGG ATGATCGCCG AGGACGGGCT GCCCGCGGTG ATCGTCAATC CGTCGACGCC GATCGGCCCG CGCGACGTGA AGCCGACGCC GACCGGCCGG ATCATCGTCG AGGCGGCGCT CGGCAAGATC CCGGCGTTCG TCGACACGGG GCTGAACCTC GTGCACGTCG ACGACGTCGC GCAGGGCCAC CTGCTCGCGC TCGAGCGCGG GCGCATCGGC GAGCGCTACA TCCTCGGCGG CGAGAACCTG CCGCTGCAGG CGATGCTCGC CGACATCGCG CAATCGATGG GACGCAAGCC GCCGACGATC GCGCTGCCGC GCTGGCCGCT GTATCCGATC GCGCTCGGCG CGGAGGCGGT GGCGAAGCTG ACGAAGCGCG AGCCGTTCGT GACGGTCGAC GGGCTCAGGA TGTCGAAGAA CAAGATGTAT TTCACGTCCG CGAAAGCCGA GCGCGAGCTC GGCTACCGCG CGCGGCCGTA CCGCGAAGGC ATTCGCGACG CGCTCGACTG GTTCAGGCAG GCGGGCTATC TGCGCTGA
|
Protein sequence | MTDTQRDLVL VTGASGFVGS AVARAARRQG YRVRVLVRPT SPRTNVADLD AEIATGDMRD EASMRAALRG VRHLLHVAAD YRLWAPDPLE IERANLEGAV ATMRAALAEG VERIVYTSSV ATLKVTPSGA SADESSPLAA GQAIGVYKRS KVLAERAVER MIAEDGLPAV IVNPSTPIGP RDVKPTPTGR IIVEAALGKI PAFVDTGLNL VHVDDVAQGH LLALERGRIG ERYILGGENL PLQAMLADIA QSMGRKPPTI ALPRWPLYPI ALGAEAVAKL TKREPFVTVD GLRMSKNKMY FTSAKAEREL GYRARPYREG IRDALDWFRQ AGYLR
|
| |