Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A1160 |
Symbol | |
ID | 4888290 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 1103713 |
End bp | 1104555 |
Gene Length | 843 bp |
Protein Length | 280 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640131099 |
Product | 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase |
Protein accession | YP_001062157 |
Protein GI | 126442313 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 41 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACTGC TGCGATACGG GGCGAAGTCC CGAGAAAAAC CCGGCCTGCT CGATGCGCAG GGGCGCATTC GCGATCTGTC CGGCGTCATC GACGACGTCG CGGGCGACGC GCTCGGCCCC GATGCGCTCG CGCGGCTGCG CGCGATCGAT CCGGCGAGCC TGCCGCTCGT CGACGGCGCG CCGCGCCTCG GCGCGTGCGT GGGCTGCGTC GGCAAGTTCG TCTGCATCGG GCTCAACTAT TCGGATCACG CGGCCGAATC CGGCATGGAC GTGCCGAGCG AGCCCGTCGT CTTCGGCAAG TGGACGAGCG CGATCTGCGG GCCCGACGAC GACGTCGAAC TCCCGCCCGG CTCGACGAAG ACCGATTGGG AAGTGGAGCT CGGCGTCGTG ATCGGCACGG GCGGGCGCGA CATCGACGAA GCGCGTGCGC TTGCGCACGT GGCCGGCTAT TGCATCGTCA ACGACGTATC CGAGCGCGCG TACCAGCTCG AGCGCGGCGG CACGTGGGAC AAGGGCAAGG GATGCGACAC GTTCGGCCCG CTCGGGCCCT GGCTCGTGAC GGCCGACGAA GTGCCGGACC CGCACCGGCT GAAGCTGTGG CTCGACGTCG ACGGCCGCCG CTATCAGCAT GGCTCGACCG CGACGATGAT CTTTCGCGTG CCGTTCCTGA TCAGCTACTT GAGCCGCTTC ATGAGCCTGC AGCCGGGCGA CGTGATCTCG ACCGGCACGC CGCCGGGCGT CGGGCTTGGT CAAAAGCCGC CCGTCTATCT GCGCGCGGGG CAGGTGATGA CAGTCGGCAT CGAAGGGCTC GGCGAGCAGC GGCAGCGGGT CGTGCAGCGA TGA
|
Protein sequence | MKLLRYGAKS REKPGLLDAQ GRIRDLSGVI DDVAGDALGP DALARLRAID PASLPLVDGA PRLGACVGCV GKFVCIGLNY SDHAAESGMD VPSEPVVFGK WTSAICGPDD DVELPPGSTK TDWEVELGVV IGTGGRDIDE ARALAHVAGY CIVNDVSERA YQLERGGTWD KGKGCDTFGP LGPWLVTADE VPDPHRLKLW LDVDGRRYQH GSTATMIFRV PFLISYLSRF MSLQPGDVIS TGTPPGVGLG QKPPVYLRAG QVMTVGIEGL GEQRQRVVQR
|
| |