Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_1836 |
Symbol | msuD |
ID | 4902169 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | + |
Start bp | 1795353 |
End bp | 1796510 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640135066 |
Product | alkanesulfonate monooxygenase |
Protein accession | YP_001066102 |
Protein GI | 126454735 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | [TIGR03565] alkanesulfonate monooxygenase, FMNH(2)-dependent |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGTGT TCTGGTTCAT CCCCACGCAC GGCGACAGCC GCTATCTCGG CACGGCCGAG GGCGCGCGCG CCGCGGACTA CGACTACTTC CGGCAGGTTG CCGTCGCGGC CGACACGCTC GGCTACGACG GCGTGCTGCT GCCGACGGGG CGTTCGTGCG AGGATGCGTG GGTGGTCGCC TCGAGCCTGA TTCCGGCGAC GAAGCGCCTG AAGTTCCTGG TCGCGATCCG CCCGGGCCTG TCGTCGCCGG GGCTCTCCGC GCGGATGGCG TCGACGTTCG ACCGGCTCTC CGATGGGCGT TTGCTGATCA ACGTCGTGAC GGGCGGCGAT TCGGCCGAGC TAGAAGGCGA TGGCCTCTTC GCCGATCACG ACACGCGCTA CGCGATCACC GACGACTTCC TGCACATCTG GCGCGGGCTG CTCGCCGAAT CGCACGAGAA CGGCGGCATC GATTTCGACG GCGAGCACCT GAGCGCGAAG GGCGGCAAGC TGCTGTACCC GCCCGTTCAG CGCCCGCATC CGCCGCTCTG GTTCGGCGGC TCGTCGCCCG CCGCGCACGC GATCGCGGCC GACCACATCG ATACGTACCT GAGCTGGGGC GAGCCGCCCG CGGCGGTCGA GAAGAAGATC GCCGACATCC GCGCGCGCGC GGCCGCGCGC GGCCGCGAGA TCAAGTTCGG GATTCGCCTG CACGTGATCG TGCGCGAGAC GCAGGAAGAG GCATGGCGCG ACGCCGATCG CCTCATCAGC CGGCTCGACG ACGATACGAT CGCGCGCGCG CAACAGGCGT TCGCGAAGAT GGATTCCGAA GGGCAGCGCC GGATGGCCGC GCTGCACGGC GGCAAGCGCG GCTCGCGCCA GGAGCTCGAG ATCTATCCGA ACCTGTGGGC GGGCGTCGGG CTCGTGCGCG GCGGCGCGGG GACGGCGCTC GTCGGGAATC CCGAGCAAAT CGCCGCGCGG ATGCGCGAGT ACGCGGCGCT CGGCATCGAG ACGTTCATCC TGTCCGGCTA TCCGCATCTC GAGGAATCGT ACCGCTTCGC CGAGCTCGTG TTTCCGCTCG TCAAGGGCGG CGGCAACACG CGCCGCGCGG GGCCGCTGTC GGGGCCGTTC GGCGAAGTCG TCGGCAACCA GTATCTGCCG AAGGCGAGCC AGAGCTGA
|
Protein sequence | MNVFWFIPTH GDSRYLGTAE GARAADYDYF RQVAVAADTL GYDGVLLPTG RSCEDAWVVA SSLIPATKRL KFLVAIRPGL SSPGLSARMA STFDRLSDGR LLINVVTGGD SAELEGDGLF ADHDTRYAIT DDFLHIWRGL LAESHENGGI DFDGEHLSAK GGKLLYPPVQ RPHPPLWFGG SSPAAHAIAA DHIDTYLSWG EPPAAVEKKI ADIRARAAAR GREIKFGIRL HVIVRETQEE AWRDADRLIS RLDDDTIARA QQAFAKMDSE GQRRMAALHG GKRGSRQELE IYPNLWAGVG LVRGGAGTAL VGNPEQIAAR MREYAALGIE TFILSGYPHL EESYRFAELV FPLVKGGGNT RRAGPLSGPF GEVVGNQYLP KASQS
|
| |