Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_0184 |
Symbol | |
ID | 4881857 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | - |
Start bp | 179247 |
End bp | 180050 |
Gene Length | 804 bp |
Protein Length | 267 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640126112 |
Product | HAD-superfamily hydrolase |
Protein accession | YP_001057237 |
Protein GI | 126438847 |
COG category | [R] General function prediction only |
COG ID | [COG1011] Predicted hydrolase (HAD superfamily) |
TIGRFAM ID | [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01993] pyrimidine 5'-nucleotidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.610769 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAGCGCGC GCCGCCAGCC GCGCCGCCCC GCCTCGCCTG CGGCGAGCGA GCAGGCGCCC GAGCGACGGC GCAGGCGGCG CGCCCGCCCT CAGGCGGGCG GCCCCGTCTG GCTCTTCGAT CTCGACAACA CGCTGCACCA CGCGTCGCAT GCGATCTTCC CGGCGATCAA CCGGGCGATG ACGCAGTACA TCATCGATAC GCTGAAAGTC GAGCGCGCGC ACGCCGATCA TCTGCGCACC CACTACACGC GCCGCTACGG CGCGGCGCTC CTCGGCCTCG CGCGCCATCA CCCGATCGAT CCGCACGATT TCCTGAAGGT CGTCCACACG TTCGCCGACC TGCCGTCGAT GGTGCGCGCC GAGCGCGGGC TCGCGCGGCT CGTCGCCGCG CTGCCCGGCC GCAAGATCGT GCTGACGAAC GCCCCCGAAA CCTATGCGCG CGCGGTGCTG CGCGAGCTGA AGATCGACCG CCTGTTCGAG CGCGTGATCG CGATCGAGCA GATGCGCGAT CGCCGCGCAT GGCGCGCGAA GCCCGACGCC ACGATGCTGC GCCGGGCGAT GCGCGCGGCG CACGCGCGCC TGCCGGACGC CATCCTCGTC GAGGATACGC GCGGCCACCT GAAGCGCTAC AAGCGGCTCG GCATCCGCAC CGTCTGGATC ACCGGGCACC TGCCCGGCCA TCTGCCAAGC TACGGACGAC CGCACTATGT CGATCGTCGC ATTGGTTCGC TAAAATCGCT GCGATTGGGC ACTCGATCGG GGCGACGAAA ATGCAGCCGA CTCACCCGCA TGACCCGGCA GTAA
|
Protein sequence | MSARRQPRRP ASPAASEQAP ERRRRRRARP QAGGPVWLFD LDNTLHHASH AIFPAINRAM TQYIIDTLKV ERAHADHLRT HYTRRYGAAL LGLARHHPID PHDFLKVVHT FADLPSMVRA ERGLARLVAA LPGRKIVLTN APETYARAVL RELKIDRLFE RVIAIEQMRD RRAWRAKPDA TMLRRAMRAA HARLPDAILV EDTRGHLKRY KRLGIRTVWI TGHLPGHLPS YGRPHYVDRR IGSLKSLRLG TRSGRRKCSR LTRMTRQ
|
| |