Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_2739 |
Symbol | |
ID | 4884852 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | + |
Start bp | 2708661 |
End bp | 2709671 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640128667 |
Product | luciferase-like monooxygenase |
Protein accession | YP_001059763 |
Protein GI | 126439990 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | [TIGR03558] luciferase family oxidoreductase, group 1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCCCGT TTTCCGTTCT CGACCTCGCG CCGATTCCCG CCGGCGCCGA CGCCGCCCAG GCGTTGCGCC ATTCCGTCGA CCTCGCGCGG CACGCCGAGC GCCTCGGCTA TCGCCGCTAC TGGCTCGCCG AGCACCACAA CATGCCCGGC ATCGCGAGCG CGGCGACCGC GGTCGTGATC GGCCACGTCG CGGGCGCGAC GCGGACGATT CGCGTCGGCT CGGGCGGCGT GATGCTGCCG AACCATGCGC CGCTCGTGAT CGCCGAGCAG TTCGGCACGC TCGCGTCGCT GTACCCGGGC CGCATCGATC TCGGTCTCGG GCGCGCGCCC GGCACCGATC AGACGACGGC CCGCGCGCTG CGCCGCGACC TGATCGGCAG CGCCGATTCG TTCCCCGACG ACGTGGTGGA GCTGCAGCGC TACTTCGCCG CGCCCGCCGC CGGCCAGCGC GTGCGCGCCG TGCCGGGCGC GGGGCTCGAC GTGCCGATCT GGCTGCTCGG CTCGAGCCTG TTCAGCGCGC AGCTCGCCGC GATGCTCGGG CTGCCGTTCG CGTTCGCTTC GCATTTCGCG CCGGACTACC TGATGCGCGC GCTCGACGTG TACCGCGCGC AGTTCCGGCC GTCCGCCGCG CTCGACAAGC CGTATGCGAT GGTCGGCGTG AACGTGTTCG CCGCCGACAC CGACGACGAC GCGCGACGCC TGTTCACGTC GCTGCAGCAG CAGTTCCTGA AGCTGCGGCG CGGCACGCCC GGCCAACTGC CGCCGCCCGT CGAATCGCTC GACGCGCTCG GCGCGACCGA GCAGGAACTC GCGAACGTCG CGCATGCACT GTCGTTCGCC GCGGTCGGCT CGCGCGACAC CGTGCACGAG CGGCTGCGGC GGTTGATCGC GCAGACGGGC GCGGACGAGC TGATCGTCGC CGCGCAGATC TTCGATCACG GCGCACGGGT GCGCTCGTAC GAGATCGCCG CGCAGGTGCG CGACGCGCTT CGCAACGAAG CCGGGGTTTG A
|
Protein sequence | MIPFSVLDLA PIPAGADAAQ ALRHSVDLAR HAERLGYRRY WLAEHHNMPG IASAATAVVI GHVAGATRTI RVGSGGVMLP NHAPLVIAEQ FGTLASLYPG RIDLGLGRAP GTDQTTARAL RRDLIGSADS FPDDVVELQR YFAAPAAGQR VRAVPGAGLD VPIWLLGSSL FSAQLAAMLG LPFAFASHFA PDYLMRALDV YRAQFRPSAA LDKPYAMVGV NVFAADTDDD ARRLFTSLQQ QFLKLRRGTP GQLPPPVESL DALGATEQEL ANVAHALSFA AVGSRDTVHE RLRRLIAQTG ADELIVAAQI FDHGARVRSY EIAAQVRDAL RNEAGV
|
| |