Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II0998 |
Symbol | |
ID | 3844797 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | + |
Start bp | 1183890 |
End bp | 1186898 |
Gene Length | 3009 bp |
Protein Length | 1002 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637838301 |
Product | sarcosine oxidase, alpha subunit |
Protein accession | YP_439195 |
Protein GI | 83717636 |
COG category | [E] Amino acid transport and metabolism [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0404] Glycine cleavage system T protein (aminomethyltransferase) [COG0492] Thioredoxin reductase |
TIGRFAM ID | [TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCAGA AAGACCGACT CGGCGCAGGC GGGCGCATCA ACCGCGCACA GCCGCTCACC TTCACGTTCA ACGGCCGCAC GTATCAGGGC TTCCAGGGCG ACACGCTCGC GTCGGCGCTG CTCGCCAACG GCGTGCACTT CGTCGCGCGC AGCTTCAAGT ATCACCGCCC GCGCGGGATC GTGACGGCGG GCGTTGACGA GCCGAACGCC GTCGTGCAGC TCGAAACCGG CGCGCACACG GTGCCGAACG CGCGCGCGAC CGAGATCGAG CTGTATCAGG GGCTCGTCGC GACAAGCGTG AACGCGAAGC CGTCGCTCGA GCACGACCGG ATGGCGGTGA TGCAGAAGTT CGCGCGCTTC CTGCCGGCGG GCTTCTATTA CAAGACGTTC ATGTGGCCGC GCAATCTGTG GCCGAAGTAC GAAGAGAAGA TCCGCGAGGC GGCCGGCCTC GGCAAGGCGC CCGACACGCT TGACGCCGAC CGCTACGACA AGTGCTACGC GCACTGCGAC GTGCTCGTCG TCGGCGGCGG TCCGGCGGGG CTCGCGTCCG CGCACGCGGC GGCCGTCAAC GGCGCGCGCG TGATCCTCGT CGACGATCAG CGCGAGCTGG GCGGCAGCCT GCTCGCGTGC CGCGCGGAGA TCGACGGCAA GCCGGCGCTG CAATGGGTCG AGAAGATCGA GGCGGAACTC TCGAAGCTCC CCGACGTGAA GATCCTCACG CGCAGCACCG CGTTCGGCTA TCAGGATCAC AACCTCGTGA CCGTCGTGCA GCGGCTCACC GATCATCTGC CGGTGTCGAT GCGCAAGGGC ACGCGCGAGA TGATCTGGAA GGTGCGCGCC AAGCGCGTGA TCCTCGCCAC GGGCGCGCAC GAGCGGCCGC TCGTGTTCGG CAACAACGAT CTGCCGGGCG TGATGACCGC GTCGGCCGTG TCGACATACA TCCATCGCTA CGGCGTGCTG CCGGGGCGCG TCGCGGTCGT CGCGACGAAC AACGATCGCG GCTATCAGTG CGCGCTCGAC CTGAAGGCGT GCGGCGCGAA GGTGACGGTC GTCGACGCGC GCGCGTCGAC GCGCGGCGCG CTGCCCGCGG TCGCGAAACG CAACGGCGTC ACGGTGATGA GCGGCGCGGT CGTGTCGGCC GCCGCGGGCA AGCTGCGGGT CGCGTCGGTC GACGTCGCGT CGTACGCGAA CGGCCGCTCG GGCGGCAAGA TCGCGACGCT GCCGTGCGAT CTCGTCGCGA TGTCGGGCGG CTTCAGCCCG GTGCTGCACC TGTTCGCGCA ATCGGGCGGC AAGGCGCACT GGAACGACGA CAAGGCCTGC TTCGTGCCCG GCAAGCCGGT GCAGGCGGAA GCGAGCGTCG GCGCGGCGGC GGGCGAGTTC GAGCTGTCGC GCGCGCTGCG GCTCGCGGTC GACGCGGGCG TGGCCGCGGC GAAATCGACG GGCTTCGCCG CCGAGCGGCC GCCCGTGCCG AAGCTCGCCG AGGCGGTCGA GGACGCGCTG CTGCCTTTGT GGCTCGCGAG CGGCGCCGAG GCGGCGGTTC GCGGTCCGAA GCAGTTCGTC GATTTCCAGA ACGACGTCGG CGCGGCCGAC ATCCTGCTCG CCGCGCGCGA AGGCTTCGAA TCGGTCGAGC ACGTGAAGCG CTACACGGCG ATGGGTTTCG GCACCGATCA GGGCAAGCTC GGCAACATCA ACGGGATGGC GATTCTCGCG CAGGCGCTCG GCAAGACGAT TCCGGAGACG GGCACGACGA CGTTCCGCCC GAACTACACG CCAGTGTCGT TCGGCGCGTT CGCGGGCCGC GAGCTCGGCG ATTTCCTCGA CCCGATCCGC AAGACCTGCG TGCACGAATG GCACGTCGAG CACGGCGCGA TGTTCGAGGA CGTCGGCAAC TGGAAGCGGC CGTGGTACTT CCCGCGCAAC GGCGAGGACC TGCACGCGGC GGTCAAGCGC GAATGCCTCG CGGTGCGCAA CGGTGTCGGC ATCCTCGATG CGTCGACGCT CGGCAAGATC GACATCCAGG GCCCGGACGC GGTGAAGCTG CTGAACTGGG TCTACACGAA CCCGTGGAAC AAGCTCGAAG TCGGCAAGTG CCGCTACGGG CTGATGCTCG ACGAGAACGG CATGGTGTTC GACGACGGCG TGACCGTGCG CCTGGGCGAA CAGCACTTCA TGATGACGAC GACCACGGGC GGCGCCGCGC GCGTGCTCAC GTGGCTCGAG CGCTGGCTGC AGACGGAGTG GCCGGACATG AAGGTGCGCC TTTCGTCCGT CACCGATCAC TGGGCGACGT TCGCGGTGGT CGGCCCGAAG AGCCGCAAGG TCGTGCAGAA GGTGTGCAAG GACATCGATT TCGCGAACGA CGCGTTCCCG TTCATGAGCT ATCGGGACGG CACGGTCGCC GGCGTGAAGT CGCGCGTGAT GCGCATCAGC TTCTCCGGCG AACTCGCGTA CGAAGTGAAC GTGCCGGCGA ACGCGGGCCG CGCGGTGTGG GAAGCGCTGA TGGAAGCGGG CGCGGAGTTC GACATCACGC CGTACGGCAC CGAGACGATG CACGTGCTGC GCGCGGAGAA GGGCTACATC ATCGTCGGTC AGGATACCGA CGGATCGATC ACGCCGTTCG ATCTCGGCAT GGGCGGGCTC GTCGCGAAGT CGAAGGATTT TCTCGGCCGC CGCTCGCTCA CGCGCGCCGA TACCGCGAAG AGCGGCCGCA AGCAGTTCGT CGGGCTGCTG ACCGACGATG CGCAATACGT GCTGCCGGAA GGCGGCCAGA TCGTCGAGCT CGACGCGGCC GCGCGCGCGG ACGGCACGAC GCCGATGCTC GGCCACGTGA CGTCGAGCTA TTACAGCCCG ATCCTGAACC GCTCGATCGC GCTCGCGGTC GTGAAGGGCG GATTGAGCCG GATGGGCGAG CGCGTTGCGG TTTCGCTCGC GAACGGGCGG CGCGTCGCCG CGACGATTTC GAGCCCGGTT TTCTACGACA CCGAAGGGGT ACGCCAACAT GTGGAATGA
|
Protein sequence | MSQKDRLGAG GRINRAQPLT FTFNGRTYQG FQGDTLASAL LANGVHFVAR SFKYHRPRGI VTAGVDEPNA VVQLETGAHT VPNARATEIE LYQGLVATSV NAKPSLEHDR MAVMQKFARF LPAGFYYKTF MWPRNLWPKY EEKIREAAGL GKAPDTLDAD RYDKCYAHCD VLVVGGGPAG LASAHAAAVN GARVILVDDQ RELGGSLLAC RAEIDGKPAL QWVEKIEAEL SKLPDVKILT RSTAFGYQDH NLVTVVQRLT DHLPVSMRKG TREMIWKVRA KRVILATGAH ERPLVFGNND LPGVMTASAV STYIHRYGVL PGRVAVVATN NDRGYQCALD LKACGAKVTV VDARASTRGA LPAVAKRNGV TVMSGAVVSA AAGKLRVASV DVASYANGRS GGKIATLPCD LVAMSGGFSP VLHLFAQSGG KAHWNDDKAC FVPGKPVQAE ASVGAAAGEF ELSRALRLAV DAGVAAAKST GFAAERPPVP KLAEAVEDAL LPLWLASGAE AAVRGPKQFV DFQNDVGAAD ILLAAREGFE SVEHVKRYTA MGFGTDQGKL GNINGMAILA QALGKTIPET GTTTFRPNYT PVSFGAFAGR ELGDFLDPIR KTCVHEWHVE HGAMFEDVGN WKRPWYFPRN GEDLHAAVKR ECLAVRNGVG ILDASTLGKI DIQGPDAVKL LNWVYTNPWN KLEVGKCRYG LMLDENGMVF DDGVTVRLGE QHFMMTTTTG GAARVLTWLE RWLQTEWPDM KVRLSSVTDH WATFAVVGPK SRKVVQKVCK DIDFANDAFP FMSYRDGTVA GVKSRVMRIS FSGELAYEVN VPANAGRAVW EALMEAGAEF DITPYGTETM HVLRAEKGYI IVGQDTDGSI TPFDLGMGGL VAKSKDFLGR RSLTRADTAK SGRKQFVGLL TDDAQYVLPE GGQIVELDAA ARADGTTPML GHVTSSYYSP ILNRSIALAV VKGGLSRMGE RVAVSLANGR RVAATISSPV FYDTEGVRQH VE
|
| |