Gene BURPS1106A_1836 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_1836 
SymbolmsuD 
ID4902169 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp1795353 
End bp1796510 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content70% 
IMG OID640135066 
Productalkanesulfonate monooxygenase 
Protein accessionYP_001066102 
Protein GI126454735 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID[TIGR03565] alkanesulfonate monooxygenase, FMNH(2)-dependent 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGTGT TCTGGTTCAT CCCCACGCAC GGCGACAGCC GCTATCTCGG CACGGCCGAG 
GGCGCGCGCG CCGCGGACTA CGACTACTTC CGGCAGGTTG CCGTCGCGGC CGACACGCTC
GGCTACGACG GCGTGCTGCT GCCGACGGGG CGTTCGTGCG AGGATGCGTG GGTGGTCGCC
TCGAGCCTGA TTCCGGCGAC GAAGCGCCTG AAGTTCCTGG TCGCGATCCG CCCGGGCCTG
TCGTCGCCGG GGCTCTCCGC GCGGATGGCG TCGACGTTCG ACCGGCTCTC CGATGGGCGT
TTGCTGATCA ACGTCGTGAC GGGCGGCGAT TCGGCCGAGC TAGAAGGCGA TGGCCTCTTC
GCCGATCACG ACACGCGCTA CGCGATCACC GACGACTTCC TGCACATCTG GCGCGGGCTG
CTCGCCGAAT CGCACGAGAA CGGCGGCATC GATTTCGACG GCGAGCACCT GAGCGCGAAG
GGCGGCAAGC TGCTGTACCC GCCCGTTCAG CGCCCGCATC CGCCGCTCTG GTTCGGCGGC
TCGTCGCCCG CCGCGCACGC GATCGCGGCC GACCACATCG ATACGTACCT GAGCTGGGGC
GAGCCGCCCG CGGCGGTCGA GAAGAAGATC GCCGACATCC GCGCGCGCGC GGCCGCGCGC
GGCCGCGAGA TCAAGTTCGG GATTCGCCTG CACGTGATCG TGCGCGAGAC GCAGGAAGAG
GCATGGCGCG ACGCCGATCG CCTCATCAGC CGGCTCGACG ACGATACGAT CGCGCGCGCG
CAACAGGCGT TCGCGAAGAT GGATTCCGAA GGGCAGCGCC GGATGGCCGC GCTGCACGGC
GGCAAGCGCG GCTCGCGCCA GGAGCTCGAG ATCTATCCGA ACCTGTGGGC GGGCGTCGGG
CTCGTGCGCG GCGGCGCGGG GACGGCGCTC GTCGGGAATC CCGAGCAAAT CGCCGCGCGG
ATGCGCGAGT ACGCGGCGCT CGGCATCGAG ACGTTCATCC TGTCCGGCTA TCCGCATCTC
GAGGAATCGT ACCGCTTCGC CGAGCTCGTG TTTCCGCTCG TCAAGGGCGG CGGCAACACG
CGCCGCGCGG GGCCGCTGTC GGGGCCGTTC GGCGAAGTCG TCGGCAACCA GTATCTGCCG
AAGGCGAGCC AGAGCTGA
 
Protein sequence
MNVFWFIPTH GDSRYLGTAE GARAADYDYF RQVAVAADTL GYDGVLLPTG RSCEDAWVVA 
SSLIPATKRL KFLVAIRPGL SSPGLSARMA STFDRLSDGR LLINVVTGGD SAELEGDGLF
ADHDTRYAIT DDFLHIWRGL LAESHENGGI DFDGEHLSAK GGKLLYPPVQ RPHPPLWFGG
SSPAAHAIAA DHIDTYLSWG EPPAAVEKKI ADIRARAAAR GREIKFGIRL HVIVRETQEE
AWRDADRLIS RLDDDTIARA QQAFAKMDSE GQRRMAALHG GKRGSRQELE IYPNLWAGVG
LVRGGAGTAL VGNPEQIAAR MREYAALGIE TFILSGYPHL EESYRFAELV FPLVKGGGNT
RRAGPLSGPF GEVVGNQYLP KASQS