Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMA10229_0410 |
Symbol | |
ID | 4789634 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei NCTC 10229 |
Kingdom | Bacteria |
Replicon accession | NC_008835 |
Strand | + |
Start bp | 412069 |
End bp | 413328 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | DszC family monooxygenase |
Protein accession | YP_001024233 |
Protein GI | 124383200 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.993874 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTCACG CCCCTGCCGA CCCGACGCGC TCGACCGGCG CCGATTATGC ATCGCTCGCC GCCCGTTTCC GGCCGATCTT CGCGCGCATC GCCGAAGGCG CGATCGAGCG CGACCGCACG CGCGCGCTGC CGCACGAACC GATTCGCTGG CTGAAGGAAG CCGGTTTCGG CGCGTTGCGC GTGCCCGTGC ATGCGGGCGG CGCCGGCGCG TCGGTTCCTC AGCTCGTTCA ATTGCTGATC GAGCTCGCGG CCGCGGATTC GAATCTGCCG CAGGCGCTGC GCGGCCATTT CGCCTTCGTC GAGGACTGGC TGAATGCGCC GCCCGACGCC GCGCGGCGCG CGTGGTTCGA CCGCTTCGCG AGCGGCCAGC TCGTCGGCAA CGCGTGGACG GAAGTGGGCG ACGTCGCGCT CGGCGAAGTG CGCACGAAGG TGGCGAAACG CGACGGCGGC TGGGTGGTGA ACGGCGAGAA GTACTACAGC ACCGGAGCGA TCTTCGCGGA CTGGATCGAC GTCTACGCGC AGCGCACGGA CGACGGCGGC CCGGTGATCG CCGCGGTCGC GGCGCGCCAG GACGGCGTGA TCCTCGGCGA CGACTGGGAC GGCTTCGGCC AGGCGACGAC GGGCAGCGGC ACGACACGCT TCGTCGACGC GCACGTCGAC GAAGCGAACG TCATCGATTT CGCGCGCCGC TTCAAATATC AGACCGCGTT CTACCAGTTG TTCCATCTCG CGACGCTCGC CGGCATCGGC CGCGCGGTCG AGCGCGACGC GAGCGCGCTC GTGCGCGGGC GCCGCCGGGT CTACAGCCAC GGCAACGCGC CGCGCGTGAG CGACGACGCA CAGATCCTGC AGGTCGTCGG CGAGATCTCG GCATGGGCGT ATGCGGCCGA GGCGATCGCG CTGCGCGCCG CGCAGCCGTC GCAGCGCGCG TACGAAGCGC GCGTCGGCGG CGACGCGGCC GCCGAGCACG ACGCGAACGT CGCGGCCGAA ATCGAATCGG CGCAGGGGCA ACTGGTGGTG TCGGAGCTCG TGCTGCGCGC GGCGACGCAT CTGTTCGACG CGCTCGGCGC GTCGGCGACG CGCGCGACGA ACGCGCTCGA CCGTCACTGG CGCAACGCGC GCACGGTCGC ATCGCATAAT CCGCTCGTCT ACAAGGCGAG GATCGTCGGG GATCGCGCGG TCAACGGAAC CGAGCCGCCC TACGTCTGGC AGATCGGCGC CGGGCCCGGC GGGCCGCGCG AACCGGAACA GGCGGCGTGA
|
Protein sequence | MSHAPADPTR STGADYASLA ARFRPIFARI AEGAIERDRT RALPHEPIRW LKEAGFGALR VPVHAGGAGA SVPQLVQLLI ELAAADSNLP QALRGHFAFV EDWLNAPPDA ARRAWFDRFA SGQLVGNAWT EVGDVALGEV RTKVAKRDGG WVVNGEKYYS TGAIFADWID VYAQRTDDGG PVIAAVAARQ DGVILGDDWD GFGQATTGSG TTRFVDAHVD EANVIDFARR FKYQTAFYQL FHLATLAGIG RAVERDASAL VRGRRRVYSH GNAPRVSDDA QILQVVGEIS AWAYAAEAIA LRAAQPSQRA YEARVGGDAA AEHDANVAAE IESAQGQLVV SELVLRAATH LFDALGASAT RATNALDRHW RNARTVASHN PLVYKARIVG DRAVNGTEPP YVWQIGAGPG GPREPEQAA
|
| |