Gene BURPS668_1821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_1821 
SymbolmsuD 
ID4882851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp1789616 
End bp1790773 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content69% 
IMG OID640127749 
Productalkanesulfonate monooxygenase 
Protein accessionYP_001058857 
Protein GI126439281 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID[TIGR03565] alkanesulfonate monooxygenase, FMNH(2)-dependent 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGTGT TCTGGTTCAT CCCCACGCAC GGCGACAGCC GCTATCTCGG CACGGCCGAG 
GGCGCGCGCG CCGCGGACTA CGACTACTTC CGGCAGGTTG CCGTCGCAGC CGACACGCTC
GGCTACGACG GTGTGCTGCT GCCGACGGGG CGTTCGTGCG AGGATGCGTG GGTGGTCGCC
TCGAGCCTGA TTCCGGCGAC GAAGCGCCTG AAGTTCCTGG TCGCGATCCG CCCGGGCCTG
TCGTCGCCGG GGCTCTCCGC GCGGATGGCG TCGACGTTCG ACCGGCTCTC CGATGGGCGT
TTGCTGATCA ACGTCGTGAC GGGCGGCGAT TCGGCCGAGC TAGAAGGCGA TGGCCTCTTC
GCCGATCACG ACACGCGCTA CGCGATCACC GACGACTTCC TGCACATCTG GCGCGGGCTG
CTCGCCGAAT CGCACGAGAA CGGCGGCATC GATTTCGACG GCGAGCACCT GAGCGCGAAG
GGCGGCAAGC TGCTGTACCC GCCCGTTCAG CGCCCGCATC CGCCGCTCTG GTTCGGCGGC
TCGTCGCCCG CCGCGCACGC GATCGCGGCC GACCACATCG ATACGTACCT GAGCTGGGGC
GAGCCGCCTG CGGCGGTCGA GAAGAAGATC GCCGACATCC GCGCGCGCGC GGCCGCGCGC
GGCCGCGAGA TCAAGTTCGG GATTCGCCTG CACGTGATCG TGCGCGAGAC GCAGGAAGAG
GCATGGCGCG ACGCCGATCG CCTCATCAGC CGGCTCGACG ACGATACGAT CGCGCGCGCG
CAACAGGCGT TCGCGAAGAT GGATTCCGAA GGGCAGCGCC GGATGGCCGC GCTGCACGGC
GGCAAGCGCG GCTCGCGCCA GGAGCTCGAG ATCTATCCGA ACCTGTGGGC GGGCGTCGGG
CTCGTGCGCG GCGGCGCGGG GACGGCGCTC GTCGGGAATC CCGAGCAAAT CGCCACGCGG
ATGCGCGAGT ACGCGGCGCT CGGCATCGAG ACGTTCATCC TGTCCGGCTA TCCGCATCTC
GAGGAATCGT ACCGCTTCGC CGAGCTCGTG TTTCCGCTCG TCAAGGGCGG CGGCAACACG
CGCCGCGCGG GGCCGCTGTC GGGGCCGTTC GGCGAAGTCG TCGGCAACCA GTATCTGCCG
AAGGCGAGCC AGAGCTGA
 
Protein sequence
MNVFWFIPTH GDSRYLGTAE GARAADYDYF RQVAVAADTL GYDGVLLPTG RSCEDAWVVA 
SSLIPATKRL KFLVAIRPGL SSPGLSARMA STFDRLSDGR LLINVVTGGD SAELEGDGLF
ADHDTRYAIT DDFLHIWRGL LAESHENGGI DFDGEHLSAK GGKLLYPPVQ RPHPPLWFGG
SSPAAHAIAA DHIDTYLSWG EPPAAVEKKI ADIRARAAAR GREIKFGIRL HVIVRETQEE
AWRDADRLIS RLDDDTIARA QQAFAKMDSE GQRRMAALHG GKRGSRQELE IYPNLWAGVG
LVRGGAGTAL VGNPEQIATR MREYAALGIE TFILSGYPHL EESYRFAELV FPLVKGGGNT
RRAGPLSGPF GEVVGNQYLP KASQS