Gene BURPS668_A1952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1952 
Symbol 
ID4886767 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1890076 
End bp1893084 
Gene Length3009 bp 
Protein Length1002 aa 
Translation table11 
GC content69% 
IMG OID640131890 
Productsarcosine oxidase, alpha subunit, heterotetrameric 
Protein accessionYP_001062947 
Protein GI126444992 
COG category[E] Amino acid transport and metabolism
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0404] Glycine cleavage system T protein (aminomethyltransferase)
[COG0492] Thioredoxin reductase 
TIGRFAM ID[TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAGA AAGACCGACT CGGCGCAGGC GGGCGCATCA ACCGCGCACA GCCGCTCACC 
TTCACGTTCA ACGGCCGCAC GTATCAGGGC TTCCAGGGCG ACACGCTCGC GTCTGCGCTG
CTCGCCAACG GCGTGCACTT CGTCGCGCGC AGCTTCAAGT ACCACCGTCC GCGCGGGATC
GTGACGGCGG GCGTCGACGA GCCGAACGCC GTCGTGCAGC TCGAAACCGG CGCGTACACG
GTGCCGAACG CGCGCGCGAC CGAGGTCGAG CTGTATCAGG GGCTCGTCGC GACGAGCGTG
AACGCGAAGC CGTCGCTCGA GCACGACCGG ATGGCGGTGA TGCAGAAGCT CGCGCGCTTC
CTGCCGGCGG GCTTCTACTA CAAGACGTTC ATGTGGCCGC GCAATCTGTG GCCGAAGTAC
GAAGAGAAGA TCCGCGAGGC GGCCGGCCTC GGCAAGGCGC CCGACACGCT CGACGCCGAC
CGCTACGACA AGTGCTACGC GCACTGCGAC GTGCTCGTCG TCGGCGGCGG CCCGACGGGG
CTCGCGGCCG CGCATGCGGC GGCCGTCAAC GGCGCGCGCG TGATCCTCGT CGACGATCAG
CGCGAGCTGG GCGGCAGCCT GCTCGCGTGC CGCGCGGAGA TCGACGGCAA GCCGGCGCTG
CAATGGGTCG AGAAGATCGA GGCGGAGCTC GCGAAGCTGC CCGACGTGAG CATCCTCACG
CGCAGCACCG CGTTCGGCTA TCAGGATCAC AACCTCGTGA CCGTCGTGCA GCGGCTCACC
GATCATCTGC CGGTGTCGAT GCGCAAGGGC ACGCGCGAGA TGATCTGGAA GGTGCGCGCC
AAGCGCGTGA TCCTCGCCAC GGGCGCGCAC GAGCGGCCGC TCGTGTTCGG CAATAACGAT
CTGCCGGGCG TGATGACCGC GTCGGCCGTG TCGGCGTACA TCCATCGCTA CGGCGTGCTG
CCGGGGCGCG TCGCGGTCGT CGCGACGAAC AACGATCGCG GCTATCAGTG CGCGCTCGAT
CTGAAGGCGT GCGGCGCGAA GGTGACGGTC GTCGATGCGC GCGCGTCGAC GCGCGGCGCA
TTGCCCGCGG TCGCCAAGCG CCACGGCATC ACGGTGATGA GCGGCGCGGC CGTGTCGGCT
GCGGCGGGCA AGCTGCGCGT CGCGTCGGTC GATGTCGTCT CCTATGCCAA TGGCCGCTCG
GGCGGCAAGA TCGCGACGCT GCCGTGCGAT CTGGTCGCGA TGTCGGGCGG CTTCAGCCCG
GTGCTGCACC TGTTCGCGCA ATCGGGCGGC AAGGCGCACT GGAACGACGA CAAGGCCTGC
TTCGTGCCCG GCAAGCCGGT GCAGGCGGAA GCGAGCGTCG GCGCGGCGGC GGGCGAGTTC
GAGCCCGCGC GCGCGCTGCG GCTCGCGCTC GACGCGGGCG TCGCCGCGGC GAAATCGGCG
GGCTTTGCCG CCGAGCGTCC GCCCGTGCCG AAGCTCGCCG AGGCGGTGGA GGACGCGCTG
CTGCCGTTGT GGCTCGCGAG CGGCGCAGAA GCGGCGGTTC GCGGTCCGAA GCAGTTCGTC
GATTTCCAGA ACGACGTCGG CGCGGCCGAC ATCCTGCTCG CCGCGCGCGA AGGTTTCGAA
TCGGTCGAGC ACGTGAAGCG CTACACGGCG ATGGGCTTCG GCACCGATCA GGGCAAGCTC
GGCAACATCA ACGGGATGGC GATCCTCGCG CAGGCGCTCG GCAAGACGAT TCCGGAGACG
GGCACGACGA CGTTCCGCCC GAACTACACG CCGGTGTCGT TCGGCGCGTT CGCGGGCCGC
GAGCTCGGCG ATTTCCTCGA CCCGATCCGC AAGACCTGCG TTCACGAATG GCATGTCGAG
CACGGCGCGA TGTTCGAGGA CGTCGGCAAC TGGAAGCGGC CGTGGTACTT CCCGCGCAAC
GGCGAGGATC TGCACGCGGC GGTCAAGCGC GAGTGCCTCG CGGTGCGCAA CGGCGTCGGC
ATGCTCGATG CGTCGACGCT CGGCAAGATC GATATCCAGG GCCCGGACGC GGTGAAGCTG
CTGAACTGGG TGTACACGAA CCCGTGGAAC AAGCTCGAGG TCGGCAAGTG CCGCTACGGG
CTGATGCTCG ACGAGAACGG CATGGTGTTC GACGACGGCG TGACCGTGCG CCTGGGCGAC
CAGCACTTCA TGATGACGAC CACCACGGGC GGCGCCGCGC GCGTGCTCAC GTGGCTCGAG
CGCTGGCTGC AGACGGAATG GCCGGACATG AAGGTGCGCC TTTCGTCCGT CACCGATCAC
TGGGCGACGT TCGCGGTGGT CGGCCCGAAG AGCCGCCGGG TCGTGCAGAA GGTGTGCAAG
GACATCGACT TCGCGAACGA CGCGTTCCCG TTCATGAGCT ATCGCGACGG CACGGTCGCC
GGCGTGAAGT CGCGCGTGAT GCGCATCAGC TTCTCGGGCG AGCTCGCGTA CGAAGTGAAC
GTGCCGGCGA ACGCGGGCCG CGCGGTATGG GAAGCGCTGA TGGACGCGGG CGCGGAGTTC
GACATCACGC CGTACGGCAC CGAGACGATG CACGTGCTGC GCGCGGAGAA GGGCTACATC
ATCGTCGGTC AGGATACCGA CGGATCGATC ACGCCGTTCG ATCTCGGCAT GGGCGGGCTC
GTCGCGAAAT CGAAGGATTT CCTCGGCCGC CGCTCGCTCA CGCGCGCCGA TACCGCGAAG
AGCGGCCGCA AGCAGTTCGT CGGCCTGCTG ACCGACGACG CGCAGTCTGT TTTGCCCGAA
GGCGGCCAGA TCGTCGAGCT CGATGCGGCC GCGCGTGCGG ACGGCACGAC GCCGATGCTC
GGTCACGTGA CGTCGAGCTA TTACAGTCCG ATCCTGAACC GCTCGATCGC GCTTGCGGTC
GTGAAGGGCG GATTGAGCCG GATGGGCGAG CGCGTCGCGG TCTCGCTCGC GAACGGGCGG
CGCGTCGCCG CGACGATTTC GAGCCCGGTT TTCTACGACA CCGAAGGGGT ACGTCAACAT
GTGGAATGA
 
Protein sequence
MSQKDRLGAG GRINRAQPLT FTFNGRTYQG FQGDTLASAL LANGVHFVAR SFKYHRPRGI 
VTAGVDEPNA VVQLETGAYT VPNARATEVE LYQGLVATSV NAKPSLEHDR MAVMQKLARF
LPAGFYYKTF MWPRNLWPKY EEKIREAAGL GKAPDTLDAD RYDKCYAHCD VLVVGGGPTG
LAAAHAAAVN GARVILVDDQ RELGGSLLAC RAEIDGKPAL QWVEKIEAEL AKLPDVSILT
RSTAFGYQDH NLVTVVQRLT DHLPVSMRKG TREMIWKVRA KRVILATGAH ERPLVFGNND
LPGVMTASAV SAYIHRYGVL PGRVAVVATN NDRGYQCALD LKACGAKVTV VDARASTRGA
LPAVAKRHGI TVMSGAAVSA AAGKLRVASV DVVSYANGRS GGKIATLPCD LVAMSGGFSP
VLHLFAQSGG KAHWNDDKAC FVPGKPVQAE ASVGAAAGEF EPARALRLAL DAGVAAAKSA
GFAAERPPVP KLAEAVEDAL LPLWLASGAE AAVRGPKQFV DFQNDVGAAD ILLAAREGFE
SVEHVKRYTA MGFGTDQGKL GNINGMAILA QALGKTIPET GTTTFRPNYT PVSFGAFAGR
ELGDFLDPIR KTCVHEWHVE HGAMFEDVGN WKRPWYFPRN GEDLHAAVKR ECLAVRNGVG
MLDASTLGKI DIQGPDAVKL LNWVYTNPWN KLEVGKCRYG LMLDENGMVF DDGVTVRLGD
QHFMMTTTTG GAARVLTWLE RWLQTEWPDM KVRLSSVTDH WATFAVVGPK SRRVVQKVCK
DIDFANDAFP FMSYRDGTVA GVKSRVMRIS FSGELAYEVN VPANAGRAVW EALMDAGAEF
DITPYGTETM HVLRAEKGYI IVGQDTDGSI TPFDLGMGGL VAKSKDFLGR RSLTRADTAK
SGRKQFVGLL TDDAQSVLPE GGQIVELDAA ARADGTTPML GHVTSSYYSP ILNRSIALAV
VKGGLSRMGE RVAVSLANGR RVAATISSPV FYDTEGVRQH VE