Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A0389 |
Symbol | |
ID | 4904840 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | - |
Start bp | 370332 |
End bp | 371369 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640143496 |
Product | putative oxygenase |
Protein accession | YP_001074432 |
Protein GI | 126456261 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.767205 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTATCCG TGGGCAATGA GGACCTCGAC CTGACAAACC GCCACAACGC GGAAGACTGG CTGCCCGACC GAATCCGGCT GCGCAACGTG TGGCTGCCGC TCGCCCATAC GTTCGAGATC GGCGAGTGCG CTTCGCGCTG GTACGTTCAT TCGGAGCCGT GCTATCTGTG GCGCGCGGCA GGCCGCATCC ATGCGTGCCC CTGGCATCCC GGACTGCCGG CGGCGAAGCG CCCCACGCCG CGCCCGCGGG ACGCGGACGC CGCGTGCTAC CCGGTCGTCG AACGATTCGG CTATGTATGG GTGTGGTACG GCGAGCCCGA GGCCGCGAGC GACGCCTTCG TGCCCGACGT GCCGTTCCTG CCGCGCGACG GCGGCCTGCC GAAATACATG CAGGGCAACA TCCGGGTCGA TTGCTGCGCG CCGCTGCTCA TCGAAAACCT GCTCGATCTG ACGCACTCGG ACTTTCTGCA CGCGAAGGTG TTCGGCGATC AGCACGCCGA AGAGGACCGG GTCGACGTCA GCTACACGTC CGAGACGGTC ACGATGATCC GCCGCTGCAA GAACAAGTCG ATCCTGCCGA TCATGCGCTG GTTCGGCGGC GTGCGCGCGA AATATCAGGA CATTCACGCG GTGGTCCACG TCCACGTGCG CAGCTCGATC GCGCTTGCCT ACGGCCGTCA TACGCCGGGC AGCGATCTGC CGCTGTTCCA TCCGTGCGTG CCCGAATCGC GCAACTACTG CCGGCTCAAC TTCGCGCTGA ACGCGACGCA GGCGCCCTGG CCGCTACGCC TGCTGTTGCC GTTCGTGCCC TACGTCGTCG GCCTTCAGGA CAACAGCATG GTCAGGCGGC AGAGCGGCCG CTATCTGGAC GCCGGCGAGC GCCGCGATCT GTATTCGCGT TTCGACCGCG CCGGCTTGCG TTACCGGATT CTGCTGCAGC AGCTCGCGAA ACGGCAGAGC GAGGGCGATT TCAGTTACGC GGACGATGCG CTGCCGAGCC GGGACGCGCG CGGCATCCTC GGGATGCCGA ACGAATAG
|
Protein sequence | MLSVGNEDLD LTNRHNAEDW LPDRIRLRNV WLPLAHTFEI GECASRWYVH SEPCYLWRAA GRIHACPWHP GLPAAKRPTP RPRDADAACY PVVERFGYVW VWYGEPEAAS DAFVPDVPFL PRDGGLPKYM QGNIRVDCCA PLLIENLLDL THSDFLHAKV FGDQHAEEDR VDVSYTSETV TMIRRCKNKS ILPIMRWFGG VRAKYQDIHA VVHVHVRSSI ALAYGRHTPG SDLPLFHPCV PESRNYCRLN FALNATQAPW PLRLLLPFVP YVVGLQDNSM VRRQSGRYLD AGERRDLYSR FDRAGLRYRI LLQQLAKRQS EGDFSYADDA LPSRDARGIL GMPNE
|
| |