Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_1989 |
Symbol | ssuD |
ID | 3688750 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007434 |
Strand | + |
Start bp | 2161573 |
End bp | 2163084 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637728445 |
Product | alkanesulfonate monooxygenase |
Protein accession | YP_333385 |
Protein GI | 76810366 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | [TIGR03565] alkanesulfonate monooxygenase, FMNH(2)-dependent |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.17879 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAGGCGC GGCGCCATGC GGCGCGCGCC GGGTTGTCAG GGACTTGGTG CGACATTATA GGGGGCCCGT GCGCGCATAT TCGCTGCAGC GGCGCGCTAT GCACGAAAGC GGAAAACCTA AGCTCAAAAA AATCGTTCCT ATACGAAATC GCGCTCCCTA GACTGATTCC CCGAGAGACG GCCATGCCGC CCCCCCCGCG CGGCGCGCGG CGGCGCGTGC GCGTTCCCGC GATTTCCGCG GCACGCGTTT CATCCCGCGC ATCGGCGAGC GGCGCGCGCA TCGTGGCGCA TCGTGGCGCG GCCCCGCATC GCCGTCGCCC ATTTCTAGCA CGATCAGTCA GGCAGGAGTT TCGCATGAAT GTGTTCTGGT TCATCCCCAC GCACGGCGAC AGCCGCTATC TCGGCACGGC CGAGGGCGCG CGCGCCGCGG ACTACGACTA CTTCCGGCAG GTTGCCGTCG CGGCCGACAC GCTCGGCTAC GACGGCGTGC TGCTGCCGAC GGGGCGTTCG TGCGAGGATG CGTGGGTGGT CGCCTCGAGC CTGATTCCGG CGACGAAGCG CCTGAAGTTC CTGGTCGCGA TCCGCCCGGG CCTGTCGTCG CCGGGGCTCT CCGCGCGGAT GGCGTCGACG TTCGACCGGC TCTCCGATGG GCGTTTGCTG ATCAACGTCG TGACGGGCGG CGATTCGGCC GAGCTAGAAG GCGATGGCCT CTTCGCCGAT CACGACACGC GCTACGCGAT CACCGACGAC TTCCTGCACA TCTGGCGCGG GCTGCTCGCC GAATCGCACG AGAACGGCGG CATCGATTTC GACGGCGAGC ACCTGAGCGC GAAGGGCGGC AAGCTGCTGT ACCCGCCCGT TCAGCGCCCG CATCCGCCGC TCTGGTTCGG CGGCTCGTCG CCCGCCGCGC ACGCGATCGC GGCCGACCAC ATCGATACGT ACCTGAGCTG GGGCGAGCCG CCCGCGGCGG TCGAGAAGAA GATCGCCGAC ATCCGCGCGC GCGCGGCCGC GCGCGGCCGC GAGATCAAGT TCGGGATTCG CCTGCACGTG ATCGTGCGCG AGACGCAGGA AGAGGCATGG CGCGACGCCG ATCGCCTCAT CAGCCGGCTC GACGACGATA CGATCGCGCG CGCGCAACAG GCGTTCGCGA AGATGGATTC CGAAGGGCAG CGCCGGATGG CCGCGCTGCA CGGCGGCAAG CGCGGCTCGC GCCAGGAGCT CGAGATCTAT CCGAACCTGT GGGCGGGCGT CGGGCTCGTG CGCGGCGGCG CGGGGACGGC GCTCGTCGGG AATCCCGAGC AAATCGCCGC GCGGATGCGC GAGTACGCGG CGCTCGGCAT CGAGACGTTC ATCCTGTCCG GCTATCCGCA TCTCGAGGAA TCGTACCGCT TCGCCGAGCT CGTGTTTCCG CTCGTCAAGG GCGGCGGCAA CACGCGCCGC GCGGGGCCGC TGTCGGGGCC GTTCGGCGAA GTCGTCGGCA ACCAGTATCT GCCGAAGGCG AGCCAGAGCT GA
|
Protein sequence | MKARRHAARA GLSGTWCDII GGPCAHIRCS GALCTKAENL SSKKSFLYEI ALPRLIPRET AMPPPPRGAR RRVRVPAISA ARVSSRASAS GARIVAHRGA APHRRRPFLA RSVRQEFRMN VFWFIPTHGD SRYLGTAEGA RAADYDYFRQ VAVAADTLGY DGVLLPTGRS CEDAWVVASS LIPATKRLKF LVAIRPGLSS PGLSARMAST FDRLSDGRLL INVVTGGDSA ELEGDGLFAD HDTRYAITDD FLHIWRGLLA ESHENGGIDF DGEHLSAKGG KLLYPPVQRP HPPLWFGGSS PAAHAIAADH IDTYLSWGEP PAAVEKKIAD IRARAAARGR EIKFGIRLHV IVRETQEEAW RDADRLISRL DDDTIARAQQ AFAKMDSEGQ RRMAALHGGK RGSRQELEIY PNLWAGVGLV RGGAGTALVG NPEQIAARMR EYAALGIETF ILSGYPHLEE SYRFAELVFP LVKGGGNTRR AGPLSGPFGE VVGNQYLPKA SQS
|
| |