Gene BURPS1106A_A0838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A0838 
Symbol 
ID4905849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp835896 
End bp836990 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content67% 
IMG OID640143944 
Productluciferase family monooxygenase 
Protein accessionYP_001074874 
Protein GI126456412 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.304793 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCATG AGCCGGACCC CCTCAAGTTC GCCTACTGGG TGCCCAACGT CAGCGGCGGC 
CTCGTCGTCA GCAAGATCGA GCAGCGCACG AGCTGGGATA TCGATTACAA CCGCCGCCTC
GCGCGCCTCG CCGAGCAGAG CGGCTTCGAC TATGCGTTGT CGCAGATCCG TTTCACGGCC
GGCTACGGCG CCGAATATCA GCACGAGTCC GTTGCGTTCA GCCATGCGCT GCTCGCGGCG
ACCGAGCGCC TCAACGTGAT CGCGGCGATC CTGCCGGGGC CGTGGCATCC GGCCGTCGTC
GCGAAACAGC TCGCGACGAT CGATCAACTG AACCAGGGGC GCGTCGCGAT CAATGTCGTG
AGCGGCTGGT TCAAGGGCGA ATTCACCGCG ATCGGCGAGC CGTGGCTCGA GCACGACGAG
CGCTATCGCC GCTCCGAGGA GTTCATCCGC GCGGTGAAGG GCGTCTGGAC GCAGGACAAC
TTCACGTTCA AGGGCGACTT CTACCGGTTC AACGATTACA CGCTCAAGCC GAAGCCGCTG
CGGCAGCCGC ACCCGGAAAT CTTCCAGGGC GGCAATTCGG CGGCCGCGCG CCGGATGGCG
GCCGCCGTGT CCGACTGGTA CTTCATGAAC GGCAACACGC CCGACGGCCA TCGCGCGCAG
ATCGACGAGA TTCGCGCGGC GGCGGCGGCG CACGGGCGGC GGGTGAAGTT CGGCGTCAAT
GCGTTCATCA TCGCGCGCGA CACCGAGCGC GAGGCGCGCG ACGTGCTCGA CGAGATCGTG
CGCCACGCGG ACGTCGACGC GGTCAACGCG TTCGGCCATG CGGTCCAGCA GGCGGGCAAG
GCCGCGCCCG AAGGGCGGGG AATGTGGGCC GATTCGAAGT TCGCCGATCT CGTGCAGTAC
AACGACGGCT TCAAGACCAA CCTGATCGGC ACCCCCGAGC AGATCGCCGA GCGCATCGTC
GCGCTGAAGG CGATCGGCGT CGATCTCGTG CTCGGCGGAT TCCTGCATTA TCTGGAAGAC
GTCGAGTATT TCGGCAAGCG CGTGCTGCCG CTCGTGCGCG AACTGGAGCG GCGGCGCGAC
GCGCAGCCGG CGTGA
 
Protein sequence
MSHEPDPLKF AYWVPNVSGG LVVSKIEQRT SWDIDYNRRL ARLAEQSGFD YALSQIRFTA 
GYGAEYQHES VAFSHALLAA TERLNVIAAI LPGPWHPAVV AKQLATIDQL NQGRVAINVV
SGWFKGEFTA IGEPWLEHDE RYRRSEEFIR AVKGVWTQDN FTFKGDFYRF NDYTLKPKPL
RQPHPEIFQG GNSAAARRMA AAVSDWYFMN GNTPDGHRAQ IDEIRAAAAA HGRRVKFGVN
AFIIARDTER EARDVLDEIV RHADVDAVNA FGHAVQQAGK AAPEGRGMWA DSKFADLVQY
NDGFKTNLIG TPEQIAERIV ALKAIGVDLV LGGFLHYLED VEYFGKRVLP LVRELERRRD
AQPA