Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A1024 |
Symbol | |
ID | 4888281 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 989428 |
End bp | 990192 |
Gene Length | 765 bp |
Protein Length | 254 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640130964 |
Product | 4-hydroxyphenylacetate degradation bifunctional isomerase/decarboxylase, C-terminal subunit |
Protein accession | YP_001062023 |
Protein GI | 126443773 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) |
TIGRFAM ID | [TIGR02303] 4-hydroxyphenylacetate degradation bifunctional isomerase/decarboxylase, C-terminal subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGACCG CACGCGTAGT GTTCAACGGC GCGCTGCATG CGGCCGAGCC GGCGGGCGAC GCGCTCGTGC GGCTCGATAC CGGGCGCGTG CTCGCCGAGA CGGAAGTCTC GTGGCTGCCG CCCGTCGTGC CGCGCACGGC GTTCGCGCTC GGCCTGAACT ACGCCGATCA CGCGAAGGAG CTCGCGTTCA GCGCGCCGGC CGAGCCGCTC ATCTTCCTGA AAGGTCCGAA CACGTTCATC GGCCACCGCG CGCAAACGGT GCGGCCCGCC GACGCGACGC ACATGCACTA TGAGTGCGAG CTCGCGGTCG TGATCGGCCG CGAGGCGCGG CGCGTCACGC GCGCGCGGGC GCTCGACCAC GTGCTCGGCT ATACGATCGG CAACGATTAC GCGATCCGCG ATTATCTGGA GAATTTTTAC CGACCGAACC TGCGCGTGAA GAACCGCGAC ACGTGCACGC CGCTCGGCCC GTGGCTCGTG AGCCGCGACG AGGTCGGCGA CGCGTCGGAT CTCGCGCTGC GCACGACGGT CAACGGCGTC GAGACGCAGC GCGGCAACAC GCGCGACATG ATCTTCGACG TCGCGTCGCT GATCGAATAC ATCAGCGGAT TCATGACGCT CTCGCCGGGC GACCTGATCC TGACCGGCAC GCCCGAGGGG CTCGCCGACA CGCAGCCGGG CGACGAGGTC GTGACCGAGA TCGAGGGAAT CGGCCGGCTC GTCAACACGA TCGTCGGCGA AGCGGATTAC TATCGCGCCG GCTGA
|
Protein sequence | MRTARVVFNG ALHAAEPAGD ALVRLDTGRV LAETEVSWLP PVVPRTAFAL GLNYADHAKE LAFSAPAEPL IFLKGPNTFI GHRAQTVRPA DATHMHYECE LAVVIGREAR RVTRARALDH VLGYTIGNDY AIRDYLENFY RPNLRVKNRD TCTPLGPWLV SRDEVGDASD LALRTTVNGV ETQRGNTRDM IFDVASLIEY ISGFMTLSPG DLILTGTPEG LADTQPGDEV VTEIEGIGRL VNTIVGEADY YRAG
|
| |