Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A2873 |
Symbol | |
ID | 4905188 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | - |
Start bp | 2813944 |
End bp | 2814792 |
Gene Length | 849 bp |
Protein Length | 282 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640145976 |
Product | fumarylacetoacetate hydrolase family protein |
Protein accession | YP_001076902 |
Protein GI | 126456940 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.148755 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACTGC TTCGTTACGG CCCCGTGGGC CAGGAAAAGC CCGGCTTGCT CGACGCCGAC GGCCGCATTC GCGATCTGTC GTCGTTGATC GACGACGTCG CGGGCGGCGC GCTGTCGGAC GTGAGCCTTG CACGGCTTGC CGCGGCCGAT CCCGCGTCGC TGCCGCTCGT CGCCGGCGAG CCGCGGATCG GCGCGTGCGT CGGCCGGATC GGCAAGTTCG TCTGCATCGG CCTGAACTAC GCGGACCACG CGGCGGAGGC GGGCCTCGCG GTGCCCGCCG AGCCCGTCGT GTTCGGCAAG TGGACGAGCG CGGTGAGCGG CCCGCACGAC GGCATCGAGA TTCCGCGCGG CTCGGTGAAG ACGGACTGGG AGGTCGAACT GGGCGTCGTG ATCGGCCGCG CGTGCAAGAA CGTCGACGAG GCCGACGCGC TGTCGCACGT CGCCGGCTAT TGCGTCGTCA ACGACGTGTC CGAGCGCGAA TGGCAGATCG AGCGCGGCGG CCAGTGGGAC AAGGGCAAGG GCTTCGACAC GTTCGGGCCG ATCGGCCCGT GGCTCGTCAC GCGCGACGAA ATCGCCGATC CGCAGGCGCT CGACCTGTGG CTCGACGTGG ACGGCCGGCG CTATCAGAGC GGCAACACGC GCACGATGGT GTTCACCGTC GCGCGGTTGA TCGCGTATCT GTCGCGCTGC ATGAGCCTGC AGCCCGGCGA CGTGATCTCG ACGGGCACGC CGCCCGGCGT CGGGATGGGC GTGAAGCCGT CCCCCGTCTA TCTGAAGCCG GGGCAGACGG TGCGCTGCGG GGTGGCGGGC CTGGGCGAGC AGCAACAGCG AACGCGCGCG GCGGCGTGA
|
Protein sequence | MKLLRYGPVG QEKPGLLDAD GRIRDLSSLI DDVAGGALSD VSLARLAAAD PASLPLVAGE PRIGACVGRI GKFVCIGLNY ADHAAEAGLA VPAEPVVFGK WTSAVSGPHD GIEIPRGSVK TDWEVELGVV IGRACKNVDE ADALSHVAGY CVVNDVSERE WQIERGGQWD KGKGFDTFGP IGPWLVTRDE IADPQALDLW LDVDGRRYQS GNTRTMVFTV ARLIAYLSRC MSLQPGDVIS TGTPPGVGMG VKPSPVYLKP GQTVRCGVAG LGEQQQRTRA AA
|
| |