Gene BTH_II0998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_II0998 
Symbol 
ID3844797 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007650 
Strand
Start bp1183890 
End bp1186898 
Gene Length3009 bp 
Protein Length1002 aa 
Translation table11 
GC content68% 
IMG OID637838301 
Productsarcosine oxidase, alpha subunit 
Protein accessionYP_439195 
Protein GI83717636 
COG category[E] Amino acid transport and metabolism
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0404] Glycine cleavage system T protein (aminomethyltransferase)
[COG0492] Thioredoxin reductase 
TIGRFAM ID[TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAGA AAGACCGACT CGGCGCAGGC GGGCGCATCA ACCGCGCACA GCCGCTCACC 
TTCACGTTCA ACGGCCGCAC GTATCAGGGC TTCCAGGGCG ACACGCTCGC GTCGGCGCTG
CTCGCCAACG GCGTGCACTT CGTCGCGCGC AGCTTCAAGT ATCACCGCCC GCGCGGGATC
GTGACGGCGG GCGTTGACGA GCCGAACGCC GTCGTGCAGC TCGAAACCGG CGCGCACACG
GTGCCGAACG CGCGCGCGAC CGAGATCGAG CTGTATCAGG GGCTCGTCGC GACAAGCGTG
AACGCGAAGC CGTCGCTCGA GCACGACCGG ATGGCGGTGA TGCAGAAGTT CGCGCGCTTC
CTGCCGGCGG GCTTCTATTA CAAGACGTTC ATGTGGCCGC GCAATCTGTG GCCGAAGTAC
GAAGAGAAGA TCCGCGAGGC GGCCGGCCTC GGCAAGGCGC CCGACACGCT TGACGCCGAC
CGCTACGACA AGTGCTACGC GCACTGCGAC GTGCTCGTCG TCGGCGGCGG TCCGGCGGGG
CTCGCGTCCG CGCACGCGGC GGCCGTCAAC GGCGCGCGCG TGATCCTCGT CGACGATCAG
CGCGAGCTGG GCGGCAGCCT GCTCGCGTGC CGCGCGGAGA TCGACGGCAA GCCGGCGCTG
CAATGGGTCG AGAAGATCGA GGCGGAACTC TCGAAGCTCC CCGACGTGAA GATCCTCACG
CGCAGCACCG CGTTCGGCTA TCAGGATCAC AACCTCGTGA CCGTCGTGCA GCGGCTCACC
GATCATCTGC CGGTGTCGAT GCGCAAGGGC ACGCGCGAGA TGATCTGGAA GGTGCGCGCC
AAGCGCGTGA TCCTCGCCAC GGGCGCGCAC GAGCGGCCGC TCGTGTTCGG CAACAACGAT
CTGCCGGGCG TGATGACCGC GTCGGCCGTG TCGACATACA TCCATCGCTA CGGCGTGCTG
CCGGGGCGCG TCGCGGTCGT CGCGACGAAC AACGATCGCG GCTATCAGTG CGCGCTCGAC
CTGAAGGCGT GCGGCGCGAA GGTGACGGTC GTCGACGCGC GCGCGTCGAC GCGCGGCGCG
CTGCCCGCGG TCGCGAAACG CAACGGCGTC ACGGTGATGA GCGGCGCGGT CGTGTCGGCC
GCCGCGGGCA AGCTGCGGGT CGCGTCGGTC GACGTCGCGT CGTACGCGAA CGGCCGCTCG
GGCGGCAAGA TCGCGACGCT GCCGTGCGAT CTCGTCGCGA TGTCGGGCGG CTTCAGCCCG
GTGCTGCACC TGTTCGCGCA ATCGGGCGGC AAGGCGCACT GGAACGACGA CAAGGCCTGC
TTCGTGCCCG GCAAGCCGGT GCAGGCGGAA GCGAGCGTCG GCGCGGCGGC GGGCGAGTTC
GAGCTGTCGC GCGCGCTGCG GCTCGCGGTC GACGCGGGCG TGGCCGCGGC GAAATCGACG
GGCTTCGCCG CCGAGCGGCC GCCCGTGCCG AAGCTCGCCG AGGCGGTCGA GGACGCGCTG
CTGCCTTTGT GGCTCGCGAG CGGCGCCGAG GCGGCGGTTC GCGGTCCGAA GCAGTTCGTC
GATTTCCAGA ACGACGTCGG CGCGGCCGAC ATCCTGCTCG CCGCGCGCGA AGGCTTCGAA
TCGGTCGAGC ACGTGAAGCG CTACACGGCG ATGGGTTTCG GCACCGATCA GGGCAAGCTC
GGCAACATCA ACGGGATGGC GATTCTCGCG CAGGCGCTCG GCAAGACGAT TCCGGAGACG
GGCACGACGA CGTTCCGCCC GAACTACACG CCAGTGTCGT TCGGCGCGTT CGCGGGCCGC
GAGCTCGGCG ATTTCCTCGA CCCGATCCGC AAGACCTGCG TGCACGAATG GCACGTCGAG
CACGGCGCGA TGTTCGAGGA CGTCGGCAAC TGGAAGCGGC CGTGGTACTT CCCGCGCAAC
GGCGAGGACC TGCACGCGGC GGTCAAGCGC GAATGCCTCG CGGTGCGCAA CGGTGTCGGC
ATCCTCGATG CGTCGACGCT CGGCAAGATC GACATCCAGG GCCCGGACGC GGTGAAGCTG
CTGAACTGGG TCTACACGAA CCCGTGGAAC AAGCTCGAAG TCGGCAAGTG CCGCTACGGG
CTGATGCTCG ACGAGAACGG CATGGTGTTC GACGACGGCG TGACCGTGCG CCTGGGCGAA
CAGCACTTCA TGATGACGAC GACCACGGGC GGCGCCGCGC GCGTGCTCAC GTGGCTCGAG
CGCTGGCTGC AGACGGAGTG GCCGGACATG AAGGTGCGCC TTTCGTCCGT CACCGATCAC
TGGGCGACGT TCGCGGTGGT CGGCCCGAAG AGCCGCAAGG TCGTGCAGAA GGTGTGCAAG
GACATCGATT TCGCGAACGA CGCGTTCCCG TTCATGAGCT ATCGGGACGG CACGGTCGCC
GGCGTGAAGT CGCGCGTGAT GCGCATCAGC TTCTCCGGCG AACTCGCGTA CGAAGTGAAC
GTGCCGGCGA ACGCGGGCCG CGCGGTGTGG GAAGCGCTGA TGGAAGCGGG CGCGGAGTTC
GACATCACGC CGTACGGCAC CGAGACGATG CACGTGCTGC GCGCGGAGAA GGGCTACATC
ATCGTCGGTC AGGATACCGA CGGATCGATC ACGCCGTTCG ATCTCGGCAT GGGCGGGCTC
GTCGCGAAGT CGAAGGATTT TCTCGGCCGC CGCTCGCTCA CGCGCGCCGA TACCGCGAAG
AGCGGCCGCA AGCAGTTCGT CGGGCTGCTG ACCGACGATG CGCAATACGT GCTGCCGGAA
GGCGGCCAGA TCGTCGAGCT CGACGCGGCC GCGCGCGCGG ACGGCACGAC GCCGATGCTC
GGCCACGTGA CGTCGAGCTA TTACAGCCCG ATCCTGAACC GCTCGATCGC GCTCGCGGTC
GTGAAGGGCG GATTGAGCCG GATGGGCGAG CGCGTTGCGG TTTCGCTCGC GAACGGGCGG
CGCGTCGCCG CGACGATTTC GAGCCCGGTT TTCTACGACA CCGAAGGGGT ACGCCAACAT
GTGGAATGA
 
Protein sequence
MSQKDRLGAG GRINRAQPLT FTFNGRTYQG FQGDTLASAL LANGVHFVAR SFKYHRPRGI 
VTAGVDEPNA VVQLETGAHT VPNARATEIE LYQGLVATSV NAKPSLEHDR MAVMQKFARF
LPAGFYYKTF MWPRNLWPKY EEKIREAAGL GKAPDTLDAD RYDKCYAHCD VLVVGGGPAG
LASAHAAAVN GARVILVDDQ RELGGSLLAC RAEIDGKPAL QWVEKIEAEL SKLPDVKILT
RSTAFGYQDH NLVTVVQRLT DHLPVSMRKG TREMIWKVRA KRVILATGAH ERPLVFGNND
LPGVMTASAV STYIHRYGVL PGRVAVVATN NDRGYQCALD LKACGAKVTV VDARASTRGA
LPAVAKRNGV TVMSGAVVSA AAGKLRVASV DVASYANGRS GGKIATLPCD LVAMSGGFSP
VLHLFAQSGG KAHWNDDKAC FVPGKPVQAE ASVGAAAGEF ELSRALRLAV DAGVAAAKST
GFAAERPPVP KLAEAVEDAL LPLWLASGAE AAVRGPKQFV DFQNDVGAAD ILLAAREGFE
SVEHVKRYTA MGFGTDQGKL GNINGMAILA QALGKTIPET GTTTFRPNYT PVSFGAFAGR
ELGDFLDPIR KTCVHEWHVE HGAMFEDVGN WKRPWYFPRN GEDLHAAVKR ECLAVRNGVG
ILDASTLGKI DIQGPDAVKL LNWVYTNPWN KLEVGKCRYG LMLDENGMVF DDGVTVRLGE
QHFMMTTTTG GAARVLTWLE RWLQTEWPDM KVRLSSVTDH WATFAVVGPK SRKVVQKVCK
DIDFANDAFP FMSYRDGTVA GVKSRVMRIS FSGELAYEVN VPANAGRAVW EALMEAGAEF
DITPYGTETM HVLRAEKGYI IVGQDTDGSI TPFDLGMGGL VAKSKDFLGR RSLTRADTAK
SGRKQFVGLL TDDAQYVLPE GGQIVELDAA ARADGTTPML GHVTSSYYSP ILNRSIALAV
VKGGLSRMGE RVAVSLANGR RVAATISSPV FYDTEGVRQH VE