Gene BURPS1106A_2799 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_2799 
Symbol 
ID4901096 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp2758030 
End bp2759088 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content72% 
IMG OID640136026 
Productluciferase-like monooxygenase 
Protein accessionYP_001067050 
Protein GI126453937 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID[TIGR03558] luciferase family oxidoreductase, group 1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCCGGC CCGCGTTCCG TCCCCATGCC AACCGTTCCG TCGTCGCCAT GATCCCGTTT 
TCCGTTCTCG ACCTCGCGCC GATTCCCGCC GGCGCCGACG CCGCCCAGGC GTTGCGCCAT
TCCGTCGACC TCGCGCGGCA CGCCGAGCGC CTCGGCTATC GCCGCTACTG GCTCGCCGAG
CACCACAACA TGCCCGGCAT CGCGAGCGCG GCGACCGCGG TCGTGATCGG CCACGTCGCG
GGCGCGACGC GGACGATTCG CGTCGGCTCG GGCGGCGTGA TGCTGCCGAA CCATGCGCCG
CTCGTGATCG CCGAGCAGTT CGGCACGCTC GCGTCGCTGT ACCCGGGCCG CATCGATCTC
GGTCTCGGGC GCGCGCCCGG CACCGATCAG ACGACGGCCC GCGCGCTGCG CCGCGACCTG
ATCGGCAGCG CCGATTCGTT CCCCGACGAC GTGGTGGAGC TGCAGCGCTA CTTCGCCGCA
CCCGCCGCCG GCCAGCGCGT GCGCGCCGTG CCGGGCGCGG GGCTCGACGT GCCGATCTGG
CTGCTCGGCT CGAGCCTGTT CAGCGCGCAG CTCGCCGCGA TGCTCGGGCT GCCGTTCGCG
TTCGCTTCGC ATTTCGCGCC GGACTACCTG ATGCGCGCGC TCGACGTGTA CCGCGCGCAG
TTCCGGCCGT CCGCCGCGCT CGACAAGCCG TATGCGATGG TCGGCGTGAA CGTGTTCGCC
GCCGACACCG ACGACGACGC GCGACGCCTG TTCACGTCGC TGCAGCAGCA GTTCCTGAAG
CTGCGGCGCG GCACGCCCGG CCAACTGCCG CCGCCCGTCG AATCGCTCGA CGCGCTCGGC
GCGACCGAGC AGGAACTCGC GAACGTCGCG CATGCACTGT CGTTCGCCGC GGTCGGCTCG
CGCGACACCG TGCACGAGCG GCTGCGGCGG TTGATCGCGC AGACGGGCGC GGACGAGCTG
ATCGTCGCCG CGCAAATCTT CGATCACGGC GCACGGGTGC GCTCGTACGA GATCGCCGCG
CAGGTGCGCG ACGCGCTTCG CGACGAAGCC GGGGTTTGA
 
Protein sequence
MRRPAFRPHA NRSVVAMIPF SVLDLAPIPA GADAAQALRH SVDLARHAER LGYRRYWLAE 
HHNMPGIASA ATAVVIGHVA GATRTIRVGS GGVMLPNHAP LVIAEQFGTL ASLYPGRIDL
GLGRAPGTDQ TTARALRRDL IGSADSFPDD VVELQRYFAA PAAGQRVRAV PGAGLDVPIW
LLGSSLFSAQ LAAMLGLPFA FASHFAPDYL MRALDVYRAQ FRPSAALDKP YAMVGVNVFA
ADTDDDARRL FTSLQQQFLK LRRGTPGQLP PPVESLDALG ATEQELANVA HALSFAAVGS
RDTVHERLRR LIAQTGADEL IVAAQIFDHG ARVRSYEIAA QVRDALRDEA GV