Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_A0393 |
Symbol | soxA |
ID | 3693642 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007435 |
Strand | - |
Start bp | 550189 |
End bp | 553197 |
Gene Length | 3009 bp |
Protein Length | 1002 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637730647 |
Product | putative sarcosine oxidase alpha subunit |
Protein accession | YP_335552 |
Protein GI | 76818599 |
COG category | [E] Amino acid transport and metabolism [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0404] Glycine cleavage system T protein (aminomethyltransferase) [COG0492] Thioredoxin reductase |
TIGRFAM ID | [TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCAGA AAGACCGACT CGGCGCAGGC GGGCGCATCA ACCGCGCACA GCCGCTCACC TTCACGTTCA ACGGCCGCAC GTATCAGGGC TTCCAGGGCG ACACGCTCGC GTCTGCGCTG CTCGCCAACG GCGTGCACTT CGTCGCGCGC AGCTTCAAGT ACCACCGTCC GCGCGGGATC GTGACGGCGG GCGTCGACGA GCCGAACGCC GTCGTGCAGC TCGAAACCGG CGCGTACACG GTGCCGAACG CGCGCGCGAC CGAGGTCGAG CTGTATCAGG GGCTCGTCGC GACGAGCGTG AACGCGAAGC CGTCGCTCGA GCACGACCGG ATGGCGGTGA TGCAGAAGCT CGCGCGTTTC CTGCCGGCGG GCTTCTACTA CAAGACGTTC ATGTGGCCGC GCAATCTGTG GCCGAAGTAC GAGGAGAAGA TCCGCGAGGC GGCCGGCCTC GGCAAGGCGC CCGACACGCT CGACGCCGAC CGCTACGACA AGTGCTACGC GCACTGCGAC GTGCTCGTCG TCGGCGGCGG CCCGACGGGG CTCGCGGCCG CGCATGCGGC GGCCGTCAAC GGCGCGCGCG TGATCCTCGT CGACGATCAG CGCGAGCTGG GCGGCAGCCT GCTCGCGTGC CGCGCGGAGA TCGACGGCAA GCCGGCGCTG CAATGGGTCG AGAAGATCGA GGCGGAGCTC GCGAAGCTGC CCGACATGAG CATCCTCACG CGCAGCACCG CGTTCGGCTA TCAGGATCAC AACCTCGTGA CCGTCGTGCA GCGGCTCACC GATCATCTGC CGGTGTCGAT GCGCAAGGGC ACGCGCGAGA TGATCTGGAA GGTGCGCGCC AAGCGCGTGA TCCTCGCCAC GGGCGCGCAC GAGCGGCCGC TCGTGTTCGG CAATAACGAT CTGCCGGGCG TGATGACCGC GTCGGCCGTG TCGGCGTACA TCCATCGCTA CGGTGTGCTG CCGGGGCGCG TCGCGGTCGT CGCGACGAAC AACGATCGCG GCTATCAGTG CGCGCTCGAT CTGAAGGCGT GCGGCGCGAA GGTGACGGTC GTCGATGCGC GCGCGTCGAC GCGCGGCGCA TTGCCCGCGG TCGCCAAGCG CCACGGCATC ACGGTGATGA GCGGCGCGGC CGTGTCGGCT GCGGCGGGCA AGCTGCGCGT CGCGTCGGTC GATGTCGTCT CCTATGCCAA TGGCCGCTCG GGCGGCAAGA TCGCGACGCT GCCGTGCGAT CTGGTCGCGA TGTCGGGCGG CTTCAGCCCG GTGCTGCACC TGTTCGCGCA ATCGGGCGGC AAGGCGCACT GGAACGACGA CAAGGCCTGC TTCGTGCCCG GCAAGCCGGT GCAGGCGGAA GCGAGCGTCG GCGCGGCGGC CGGCGAGTTC GAGCTCGCGC GCGCGCTGCG GCTCGCGCTC GACGCGGGCG TCGCCGCGGC GAAATCGGCG GGCTTTGCCG CCGAGCGTCC GCCCGTGCCG AAGCTCGCCG AGGCGGTGGA GGACGCGCTG CTGCCGTTGT GGCTCGCGAG CGGCGCAGAA GCGGCGGTTC GCGGTCCGAA GCAGTTCGTC GATTTCCAGA ACGACGTCGG CGCGGCCGAC ATCCTGCTCG CCGCGCGCGA AGGTTTCGAA TCGGTCGAGC ACGTGAAGCG CTACACGGCG ATGGGCTTCG GCACCGATCA GGGCAAGCTC GGCAACATCA ACGGGATGGC GATCCTCGCG CAGGCGCTCG GCAAGACGAT TCCGGAGACG GGCACGACGA CGTTCCGCCC GAACTACACG CCGGTGTCGT TCGGCGCGTT CGCGGGCCGC GAGCTCGGCG ATTTCCTCGA CCCGATCCGC AAGACCTGCG TTCACGAATG GCATGTCGAG CACGGCGCGA TGTTCGAGGA CGTCGGCAAC TGGAAGCGGC CGTGGTACTT CCCGCGCAAC GGCGAGGATC TGCACGCGGC GGTCAAGCGC GAGTGCCTCG CGGTGCGCAA CGGCGTCGGC ATGCTCGATG CGTCGACGCT CGGCAAGATC GATATCCAGG GCCCGGACGC GGTGAAGCTG CTGAACTGGG TATACACGAA CCCGTGGAAC AAGCTCGAGG TCGGCAAGTG CCGCTACGGG CTGATGCTCG ACGAGAACGG CATGGTGTTC GACGACGGCG TGACCGTGCG CCTGGGCGAC CAGCACTTCA TGATGACGAC CACCACGGGC GGCGCCGCGC GCGTGCTCAC GTGGCTCGAG CGCTGGCTGC AGACGGAATG GCCGGACATG AAGGTGCGCC TTTCGTCCGT CACCGATCAC TGGGCGACGT TCGCGGTGGT CGGCCCGAAG AGCCGCCGGG TCGTGCAGAA GGTGTGCAAG GACATCGACT TCGCGAACGA CGCGTTCCCG TTCATGAGCT ATCGCGACGG CACGGTCGCC GGCGTGAAGT CGCGCGTGAT GCGCATCAGC TTCTCGGGCG AGCTCGCGTA CGAAGTGAAC GTGCCGGCGA ACGCGGGCCG CGCGGTATGG GAAGCGCTGA TGGACGCGGG CGCGGAGTTC GACATCACGC CGTACGGCAC CGAGACGATG CACGTGCTGC GCGCGGAGAA GGGCTACATC ATCGTCGGTC AGGATACCGA CGGATCGATC ACGCCGTTCG ATCTCGGCAT GGGCGGGCTC GTCGCGAAAT CGAAGGATTT CCTCGGCCGC CGCTCGCTCA CGCGCGCCGA TACCGCGAAG AGCGGCCGCA AGCAGTTCGT CGGCCTGCTG ACCGACGACG CGCAGTCTGT TTTGCCCGAA GGCGGCCAGA TCGTCGAGCT CGATGCGGCC GCGCGTGCGG ACGGCACGAC GCCGATGCTC GGTCACGTGA CGTCGAGCTA TTACAGTCCG ATCCTGAACC GCTCGATCGC GCTCGCGGTC GTGAAGGGCG GATTGAGCCG GATGGGCGAG CGCGTCGCGG TCTCGCTCGC GAACGGGCGG CGTGTCGCCG CGACGATTTC GAGCCCGGTT TTCTACGACA CCGAAGGGGT ACGTCAACAT GTGGAATGA
|
Protein sequence | MSQKDRLGAG GRINRAQPLT FTFNGRTYQG FQGDTLASAL LANGVHFVAR SFKYHRPRGI VTAGVDEPNA VVQLETGAYT VPNARATEVE LYQGLVATSV NAKPSLEHDR MAVMQKLARF LPAGFYYKTF MWPRNLWPKY EEKIREAAGL GKAPDTLDAD RYDKCYAHCD VLVVGGGPTG LAAAHAAAVN GARVILVDDQ RELGGSLLAC RAEIDGKPAL QWVEKIEAEL AKLPDMSILT RSTAFGYQDH NLVTVVQRLT DHLPVSMRKG TREMIWKVRA KRVILATGAH ERPLVFGNND LPGVMTASAV SAYIHRYGVL PGRVAVVATN NDRGYQCALD LKACGAKVTV VDARASTRGA LPAVAKRHGI TVMSGAAVSA AAGKLRVASV DVVSYANGRS GGKIATLPCD LVAMSGGFSP VLHLFAQSGG KAHWNDDKAC FVPGKPVQAE ASVGAAAGEF ELARALRLAL DAGVAAAKSA GFAAERPPVP KLAEAVEDAL LPLWLASGAE AAVRGPKQFV DFQNDVGAAD ILLAAREGFE SVEHVKRYTA MGFGTDQGKL GNINGMAILA QALGKTIPET GTTTFRPNYT PVSFGAFAGR ELGDFLDPIR KTCVHEWHVE HGAMFEDVGN WKRPWYFPRN GEDLHAAVKR ECLAVRNGVG MLDASTLGKI DIQGPDAVKL LNWVYTNPWN KLEVGKCRYG LMLDENGMVF DDGVTVRLGD QHFMMTTTTG GAARVLTWLE RWLQTEWPDM KVRLSSVTDH WATFAVVGPK SRRVVQKVCK DIDFANDAFP FMSYRDGTVA GVKSRVMRIS FSGELAYEVN VPANAGRAVW EALMDAGAEF DITPYGTETM HVLRAEKGYI IVGQDTDGSI TPFDLGMGGL VAKSKDFLGR RSLTRADTAK SGRKQFVGLL TDDAQSVLPE GGQIVELDAA ARADGTTPML GHVTSSYYSP ILNRSIALAV VKGGLSRMGE RVAVSLANGR RVAATISSPV FYDTEGVRQH VE
|
| |