Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A1083 |
Symbol | |
ID | 4904029 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | + |
Start bp | 1041212 |
End bp | 1042054 |
Gene Length | 843 bp |
Protein Length | 280 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640144189 |
Product | fumarylacetoacetate hydrolase family protein |
Protein accession | YP_001075118 |
Protein GI | 126456513 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACTGC TGCGATACGG GGCGAAGTCC CGAGAAAAAC CCGGCCTGCT CGATGCGCAG GGGCGCATTC GCGATCTGTC CGGCGTCATC GACGACGTCG CGGGCGACGC GCTCGGCCCC GATGCGCTCG CGCGGCTGCG CGCGATCGAT CCGGCGAGCC TGCCGCTCGT CGACGGCGCG CCGCGCCTCG GCGCGTGCGT GGGCCGCGTC GGCAAGTTCG TCTGCATCGG GCTCAACTAT TCGGATCACG CGGCCGAATC CGGCATGGAC GTGCCGAGCG AGCCCGTCGT CTTCGGCAAG TGGACGAGCG CGATCTGCGG GCCCGACGAC GACGTCGAAC TCCCGCCCGG CTCAACGAAG ACCGATTGGG AAGTGGAGCT CGGCGTCGTG ATCGGCACGG GCGGGCGCGA CATCGACGAA GCGCGTGCGC TTGCGCACGT GGCCGGCTAT TGCATCGTCA ACGACGTGTC CGAGCGCGCG TACCAGCTCG AGCGCGGCGG CACGTGGGAC AAGGGCAAGG GATGCGACAC GTTCGGCCCG CTCGGGCCCT GGCTCGTGAC GGCCGACGAA GTGCCGGACC CGCACCGGCT GAAGCTGTGG CTCGACGTCG ACGGTCGCCG CTATCAGCAT GGCTCGACCG CGACGATGAT CTTTCGCGTG CCGTTCCTGA TCAGCTACTT GAGCCGCTTC ATGAGCCTGC AGCCGGGCGA CGTGATCTCG ACCGGCACGC CGCCGGGCGT CGGGCTTGGT CAAAAGCCGC CCGTCTATCT GCGCGCGGGG CAGGTGATGA CAGTCGGCAT CGAAGGGCTC GGCGAGCAGC GGCAGCGGGT CGTGCAGCGA TGA
|
Protein sequence | MKLLRYGAKS REKPGLLDAQ GRIRDLSGVI DDVAGDALGP DALARLRAID PASLPLVDGA PRLGACVGRV GKFVCIGLNY SDHAAESGMD VPSEPVVFGK WTSAICGPDD DVELPPGSTK TDWEVELGVV IGTGGRDIDE ARALAHVAGY CIVNDVSERA YQLERGGTWD KGKGCDTFGP LGPWLVTADE VPDPHRLKLW LDVDGRRYQH GSTATMIFRV PFLISYLSRF MSLQPGDVIS TGTPPGVGLG QKPPVYLRAG QVMTVGIEGL GEQRQRVVQR
|
| |