Gene BURPS668_2739 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_2739 
Symbol 
ID4884852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp2708661 
End bp2709671 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content73% 
IMG OID640128667 
Productluciferase-like monooxygenase 
Protein accessionYP_001059763 
Protein GI126439990 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID[TIGR03558] luciferase family oxidoreductase, group 1 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCCCGT TTTCCGTTCT CGACCTCGCG CCGATTCCCG CCGGCGCCGA CGCCGCCCAG 
GCGTTGCGCC ATTCCGTCGA CCTCGCGCGG CACGCCGAGC GCCTCGGCTA TCGCCGCTAC
TGGCTCGCCG AGCACCACAA CATGCCCGGC ATCGCGAGCG CGGCGACCGC GGTCGTGATC
GGCCACGTCG CGGGCGCGAC GCGGACGATT CGCGTCGGCT CGGGCGGCGT GATGCTGCCG
AACCATGCGC CGCTCGTGAT CGCCGAGCAG TTCGGCACGC TCGCGTCGCT GTACCCGGGC
CGCATCGATC TCGGTCTCGG GCGCGCGCCC GGCACCGATC AGACGACGGC CCGCGCGCTG
CGCCGCGACC TGATCGGCAG CGCCGATTCG TTCCCCGACG ACGTGGTGGA GCTGCAGCGC
TACTTCGCCG CGCCCGCCGC CGGCCAGCGC GTGCGCGCCG TGCCGGGCGC GGGGCTCGAC
GTGCCGATCT GGCTGCTCGG CTCGAGCCTG TTCAGCGCGC AGCTCGCCGC GATGCTCGGG
CTGCCGTTCG CGTTCGCTTC GCATTTCGCG CCGGACTACC TGATGCGCGC GCTCGACGTG
TACCGCGCGC AGTTCCGGCC GTCCGCCGCG CTCGACAAGC CGTATGCGAT GGTCGGCGTG
AACGTGTTCG CCGCCGACAC CGACGACGAC GCGCGACGCC TGTTCACGTC GCTGCAGCAG
CAGTTCCTGA AGCTGCGGCG CGGCACGCCC GGCCAACTGC CGCCGCCCGT CGAATCGCTC
GACGCGCTCG GCGCGACCGA GCAGGAACTC GCGAACGTCG CGCATGCACT GTCGTTCGCC
GCGGTCGGCT CGCGCGACAC CGTGCACGAG CGGCTGCGGC GGTTGATCGC GCAGACGGGC
GCGGACGAGC TGATCGTCGC CGCGCAGATC TTCGATCACG GCGCACGGGT GCGCTCGTAC
GAGATCGCCG CGCAGGTGCG CGACGCGCTT CGCAACGAAG CCGGGGTTTG A
 
Protein sequence
MIPFSVLDLA PIPAGADAAQ ALRHSVDLAR HAERLGYRRY WLAEHHNMPG IASAATAVVI 
GHVAGATRTI RVGSGGVMLP NHAPLVIAEQ FGTLASLYPG RIDLGLGRAP GTDQTTARAL
RRDLIGSADS FPDDVVELQR YFAAPAAGQR VRAVPGAGLD VPIWLLGSSL FSAQLAAMLG
LPFAFASHFA PDYLMRALDV YRAQFRPSAA LDKPYAMVGV NVFAADTDDD ARRLFTSLQQ
QFLKLRRGTP GQLPPPVESL DALGATEQEL ANVAHALSFA AVGSRDTVHE RLRRLIAQTG
ADELIVAAQI FDHGARVRSY EIAAQVRDAL RNEAGV