Gene BURPS1106A_A1860 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A1860 
SymbolsoxA 
ID4903491 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp1823960 
End bp1826968 
Gene Length3009 bp 
Protein Length1002 aa 
Translation table11 
GC content68% 
IMG OID640144966 
Productsarcosine oxidase, alpha subunit 
Protein accessionYP_001075894 
Protein GI126455912 
COG category[E] Amino acid transport and metabolism
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0404] Glycine cleavage system T protein (aminomethyltransferase)
[COG0492] Thioredoxin reductase 
TIGRFAM ID[TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.769627 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAGA AAGACCGACT CGGCGCAGGC GGGCGCATCA ACCGCGCACA GCCGCTCACC 
TTCACGTTCA ACGGCCGCAC GTATCAGGGC TTCCAGGGCG ACACGCTCGC GTCTGCGCTG
CTCGCCAACG GCGTGCACTT CGTCGCGCGC AGCTTCAAGT ACCACCGTCC GCGCGGGATC
GTGACGGCGG GCGTCGACGA GCCGAACGCC GTCGTGCAGC TCGAAACCGG CGCGTACACG
GTGCCGAACG CGCGCGCGAC CGAGGTCGAG CTGTATCAGG GGCTCGTCGC GACGAGCGTG
AACGCGAAGC CGTCGCTCGA GCACGACCGG ATGGCGGTGA TGCAGAAGCT CGCGCGTTTC
CTGCCGGCGG GCTTCTACTA CAAGACGTTC ATGTGGCCGC GCAATCTGTG GCCGAAGTAC
GAGGAGAAGA TCCGCGAGGC GGCCGGCCTC GGCAAGGCGC CCGACACGCT CGACGCCGAC
CGCTACGACA AGTGCTACGC GCACTGCGAC GTGCTCGTCG TCGGCGGCGG CCCGACGGGG
CTCGCGGCCG CGCATGCGGC GGCCGTCAAC GGTGCGCGCG TGATCCTCGT CGACGATCAG
CGCGAGCTGG GCGGCAGCCT GCTCGCGTGC CGCGCGGAGA TCGACGGCAA GCCGGCGCTG
CAATGGGTCG AGAAGATCGA GGCGGAGCTC GCGAAGCTGC CCGACGTGAG CATCCTCACG
CGCAGCACCG CGTTCGGCTA TCAGGATCAC AACCTCGTGA CCGTCGTGCA GCGGCTCACC
GATCATCTGC CGGTGTCGAT GCGCAAGGGC ACGCGCGAGA TGATCTGGAA GGTGCGCGCC
AAGCGCGTGA TCCTCGCCAC GGGCGCGCAC GAGCGGCCGC TCGTGTTCGG CAATAACGAT
CTGCCGGGCG TGATGACCGC GTCGGCCGTG TCGGCGTACA TCCATCGCTA CGGCGTGCTG
CCGGGCCGCG TCGCGGTCGT CGCGACGAAC AACGATCGCG GCTATCAGTG CGCGCTCGAT
CTGAAGGCAT GCGGCGCGAA GGTGACGGTC GTCGACGCGC GCGCGTCGAC GCGCGGCGCA
TTGCCCGCGG TCGCCAAGCG CCACGGCATC ACGGTGATGA GCGGCGCGGC CGTGTCGGCT
GCGGCGGGCA AGCTGCGCGT CGCGTCGGTC GATGTCGTCT CCTATGCCAA TGGCCGCTCG
GGCGGCAAGA TCGCGACGCT GCCGTGCGAT CTGGTCGCGA TGTCGGGCGG CTTCAGCCCG
GTGCTGCACC TGTTCGCGCA ATCGGGCGGC AAGGCGCACT GGAACGACGA CAAGGCCTGC
TTCGTGCCCG GCAAGCCGGT GCAGGCGGAA GCGAGCGTCG GCGCGGCGGC CGGCGAGTTC
GAGCTCGCGC GCGCGCTGCG GCTCGCGCTC GACGCGGGCG TCGCCGCGGC GAAATCGGCG
GGCTTTGCCG CCGAGCGTCC GCCCGTGCCG AAGCTCGCCG AGGCGGTGGA GGACGCGCTG
CTGCCGTTGT GGCTCGCGAG CGGCGCAGAA GCGGCGGTTC GCGGTCCGAA GCAGTTCGTC
GATTTCCAGA ACGACGTCGG CGCGGCCGAC ATCCTGCTCG CCGCGCGCGA AGGCTTCGAA
TCGGTCGAGC ACGTGAAGCG CTACACGGCG ATGGGCTTCG GCACCGATCA GGGCAAGCTC
GGCAACATCA ACGGGATGGC GATCCTCGCG CAGGCGCTCG GCAAGACGAT TCCGGAGACG
GGCACGACGA CGTTCCGCCC GAACTACACG CCGGTGTCGT TCGGCGCGTT CGCGGGCCGC
GAGCTCGGCG ATTTCCTCGA CCCGATCCGC AAGACCTGCG TTCACGAATG GCATGTCGAG
CACGGCGCGA TGTTCGAGGA CGTCGGCAAC TGGAAGCGGC CGTGGTACTT CCCGCGCAAC
GGCGAGGATC TGCACGCGGC GGTCAAGCGC GAGTGCCTCG CGGTGCGCAA CGGCGTCGGC
ATGCTCGATG CGTCGACGCT CGGCAAGATC GATATCCAGG GCCCGGACGC GGTGAAGCTG
CTGAACTGGG TATACACGAA CCCGTGGAAC AAGCTCGAGG TCGGCAAGTG CCGCTACGGG
CTGATGCTCG ACGAGAACGG CATGGTGTTC GACGACGGCG TGACCGTGCG CCTGGGCGAC
CAGCACTTCA TGATGACGAC CACCACGGGC GGCGCCGCGC GCGTGCTCAC GTGGCTCGAG
CGCTGGCTGC AGACGGAATG GCCGGACATG AAGGTGCGCC TTTCGTCCGT CACCGATCAC
TGGGCGACGT TCGCGGTGGT CGGCCCGAAG AGCCGCCGGG TCGTGCAGAA GGTGTGCAAG
GACATCGACT TCGCGAACGA CGCGTTCCCG TTCATGAGCT ATCGCGACGG CACGGTCGCC
GGCGTGAAGT CGCGCGTGAT GCGCATCAGC TTCTCGGGCG AGCTCGCGTA CGAAGTGAAC
GTGCCGGCGA ACGCGGGCCG CGCGGTATGG GAAGCGCTGA TGGACGCGGG CGCGGAGTTC
GACATCACGC CGTACGGCAC CGAGACGATG CACGTGCTGC GCGCGGAGAA GGGCTACATC
ATCGTCGGTC AGGATACCGA CGGATCGATC ACGCCGTTCG ATCTCGGCAT GGGCGGGCTC
GTCGCGAAAT CGAAGGATTT CCTCGGCCGC CGCTCGCTCA CGCGCGCCGA TACCGCGAAG
AGCGGCCGCA AGCAGTTCGT CGGCCTGCTG ACCGACGACG CGCAGTCTGT TTTGCCCGAA
GGCGGCCAGA TCGTCGAGCT CGATGCGGCC GCGCGTGCGG ACGGCACGAC GCCGATGCTC
GGTCACGTGA CGTCGAGCTA TTACAGTCCG ATCCTGAACC GCTCGATCGC GCTCGCGGTC
GTGAAGGGCG GATTGAGCCG GATGGGCGAG CGCGTCGCGG TCTCGCTCGC GAACGGGCGG
CGTGTCGCCG CGACGATTTC GAGCCCGGTT TTCTACGACA CCGAAGGGGT ACGTCAACAT
GTGGAATGA
 
Protein sequence
MSQKDRLGAG GRINRAQPLT FTFNGRTYQG FQGDTLASAL LANGVHFVAR SFKYHRPRGI 
VTAGVDEPNA VVQLETGAYT VPNARATEVE LYQGLVATSV NAKPSLEHDR MAVMQKLARF
LPAGFYYKTF MWPRNLWPKY EEKIREAAGL GKAPDTLDAD RYDKCYAHCD VLVVGGGPTG
LAAAHAAAVN GARVILVDDQ RELGGSLLAC RAEIDGKPAL QWVEKIEAEL AKLPDVSILT
RSTAFGYQDH NLVTVVQRLT DHLPVSMRKG TREMIWKVRA KRVILATGAH ERPLVFGNND
LPGVMTASAV SAYIHRYGVL PGRVAVVATN NDRGYQCALD LKACGAKVTV VDARASTRGA
LPAVAKRHGI TVMSGAAVSA AAGKLRVASV DVVSYANGRS GGKIATLPCD LVAMSGGFSP
VLHLFAQSGG KAHWNDDKAC FVPGKPVQAE ASVGAAAGEF ELARALRLAL DAGVAAAKSA
GFAAERPPVP KLAEAVEDAL LPLWLASGAE AAVRGPKQFV DFQNDVGAAD ILLAAREGFE
SVEHVKRYTA MGFGTDQGKL GNINGMAILA QALGKTIPET GTTTFRPNYT PVSFGAFAGR
ELGDFLDPIR KTCVHEWHVE HGAMFEDVGN WKRPWYFPRN GEDLHAAVKR ECLAVRNGVG
MLDASTLGKI DIQGPDAVKL LNWVYTNPWN KLEVGKCRYG LMLDENGMVF DDGVTVRLGD
QHFMMTTTTG GAARVLTWLE RWLQTEWPDM KVRLSSVTDH WATFAVVGPK SRRVVQKVCK
DIDFANDAFP FMSYRDGTVA GVKSRVMRIS FSGELAYEVN VPANAGRAVW EALMDAGAEF
DITPYGTETM HVLRAEKGYI IVGQDTDGSI TPFDLGMGGL VAKSKDFLGR RSLTRADTAK
SGRKQFVGLL TDDAQSVLPE GGQIVELDAA ARADGTTPML GHVTSSYYSP ILNRSIALAV
VKGGLSRMGE RVAVSLANGR RVAATISSPV FYDTEGVRQH VE