Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMAA0892 |
Symbol | |
ID | 3086997 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei ATCC 23344 |
Kingdom | Bacteria |
Replicon accession | NC_006349 |
Strand | + |
Start bp | 911155 |
End bp | 913824 |
Gene Length | 2670 bp |
Protein Length | 889 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637564796 |
Product | sarcosine oxidase, alpha subunit, truncation |
Protein accession | YP_338415 |
Protein GI | 77358750 |
COG category | [E] Amino acid transport and metabolism [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0404] Glycine cleavage system T protein (aminomethyltransferase) [COG0492] Thioredoxin reductase |
TIGRFAM ID | [TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGAAGC TCGCGCGCTT CCTGCCGGCG GGCTTCTACT ACAAGACGTT CATGTGGCCG CGCAATCTGT GGCCGAAGTA CGAGGAGAAG ATCCGCGAGG CGGCCGGCCT CGGCAAGGCG CCCGACACGC TCGACGCCGA CCGCTACGAC AAGTGCTACG CGCACTGCGA CGTGCTCGTC GTCGGCGGCG GCCCGACGGG GCTCGCGGCC GCGCATGCGG CGGCCGTCAA CGGCGCGCGC GTGATCCTCG TCGACGATCA GCGCGAGCTG GGCGGCAGCC TGCTCGCGTG CCGCGCGGAG ATCGACGGCA AGCCGGCGCT GCAATGGGTC GAGAAGATCG AGGCGGAGCT CGCGAAGCTG CCCGACGTGA GCATCCTCAC GCGCAGCACC GCGTTCGGCT ATCAGGATCA CAACCTCGTG ACCGTCGTGC AGCGGCTCAC CGATCATCTG CCGGTGTCGA TGCGCAAGGG CACGCGCGAG ATGATCTGGA AGGTGCGCGC CAAGCGCGTG ATCCTCGCCA CGGGCGCGCA CGAGCGGCCG CTCGTGTTCG GCAATAACGA TCTGCCGGGC GTGATGACCG CGTCGGCCGT GTCGGCGTAC ATCCATCGCT ACGGTGTGCT GCCGGGGCGC GTCGCGGTCG TCGCGACGAA CAACGATCGC GGCTATCAGT GCGCGCTCGA TCTGAAGGCG TGCGGCGCGA AGGTGACGGT CGTCGATGCG CGCGCGTCGA CGCGCGGCGC ATTGCCCGCG GTCGCCAAGC GCCACGGCAT CACGGTGATG AGCGGCGCGG CCGTGTCGGC TGCGGCGGGC AAGCTGCGCG TCGCGTCGGT CGATGTCGTC TCCTATGCCA ATGGCCGCTC GGGCGGCAAG ATCGCGACGC TGCCGTGCGA TCTGGTCGCG ATGTCGGGCG GCTTCAGCCC GGTGCTGCAC CTGTTCGCGC AATCGGGCGG CAAGGCGCAC TGGAACGACG ACAAGGCCTG CTTCGTGCCC GGCAAGCCGG TGCAGGCGGA AGCGAGCGTC GGCGCGGCGG CCGGCGAGTT CGAGCTCGCG CGCGCGCTGC GGCTCGCGCT CGACGCGGGC GTCGCCGCGG CGAAATCGGC GGGCTTTGCC GCCGAGCGTC CGCCCGTGCC GAAGCTCGCC GAGGCGGTGG AGGACGCGCT GCTGCCGTTG TGGCTCGCGA GCGGCGCAGA AGCGGCGGTT CGCGGTCCGA AGCAGTTCGT CGATTTCCAG AACGACGTCG GCGCGGCCGA CATCCTGCTC GCCGCGCGCG AAGGTTTCGA ATCGGTCGAG CACGTGAAGC GCTACACGGC GATGGGTTTC GGCACCGATC AGGGCAAGCT CGGCAACATC AACGGGATGG CGATCCTCGC GCAGGCGCTC GGCAAGACGA TTCCGGAGAC GGGCACGACG ACGTTCCGCC CGAACTACAC GCCGGTGTCG TTCGGCGCGT TCGCGGGCCG CGAGCTCGGC GATTTCCTCG ACCCGATCCG CAAGACCTGC GTTCACGAAT GGCATGTCGA GCACGGCGCG ATGTTCGAGG ACGTCGGCAA CTGGAAGCGG CCGTGGTACT TCCCGCGCAA CGGCGAGGAT CTGCACGCGG CGGTCAAGCG CGAGTGCCTC GCGGTGCGCA ACGGCGTCGG CATGCTCGAT GCGTCGACGC TCGGCAAGAT CGATATCCAG GGCCCGGACG CGGTGAAGCT GCTGAACTGG GTATACACGA ACCCGTGGAA CAAGCTCGAG GTCGGCAAGT GCCGCTACGG GCTGATGCTC GACGAGAACG GCATGGTGTT CGACGACGGC GTGACCGTGC GCCTGGGCGA CCAGCACTTC ATGATGACGA CCACCACGGG CGGCGCCGCG CGCGTGCTCA CGTGGCTCGA GCGCTGGCTG CAGACGGAAT GGCCGGACAT GAAGGTGCGC CTTTCGTCCG TCACCGATCA CTGGGCGACG TTCGCGGTGG TCGGCCCGAA GAGCCGCCGG GTCGTGCAGA AGGTGTGCAA GGACATCGAC TTCGCGAACG ACGCGTTCCC GTTCATGAGC TATCGCGACG GCACGGTCGC CGGCGTGAAG TCGCGCGTGA TGCGCATCAG CTTCTCGGGC GAGCTCGCGT ACGAAGTGAA CGTGCCGGCG AACGCGGGCC GCGCGGTATG GGAAGCGCTG ATGGACGCGG GCGCGGAGTT CGACATCACG CCGTATGGCA CCGAGACGAT GCACGTGCTG CGCGCGGAGA AGGGCTACAT CATCGTCGGT CAGGATACCG ACGGATCGAT CACGCCGTTC GATCTCGGCA TGGGCGGGCT CGTCGCGAAA TCGAAGGATT TCCTCGGCCG CCGCTCGCTC ACGCGCGCCG ATACCGCGAA GAGCGGCCGC AAGCAGTTCG TCGGCCTGCT GACCGACGAC GCGCAGTCTG TTTTGCCCGA AGGCGGCCAG ATCGTCGAGC TCGATGCGGC CGCGCGTGCG GACGGCACGA CGCCGATGCT CGGTCACGTG ACGTCGAGCT ATTACAGTCC GATCCTGAAC CGCTCGATCG CGCTCGCGGT CGTGAAGGGC GGATTGAGCC GGATGGGCGA GCGCGTCGCG GTCTCGCTCG CGAACGGGCG GCGCGTCGCC GCGACGATTT CGAGCCCGGT TTTCTACGAC ACCGAAGGGG TACGTCAACA TGTGGAATGA
|
Protein sequence | MQKLARFLPA GFYYKTFMWP RNLWPKYEEK IREAAGLGKA PDTLDADRYD KCYAHCDVLV VGGGPTGLAA AHAAAVNGAR VILVDDQREL GGSLLACRAE IDGKPALQWV EKIEAELAKL PDVSILTRST AFGYQDHNLV TVVQRLTDHL PVSMRKGTRE MIWKVRAKRV ILATGAHERP LVFGNNDLPG VMTASAVSAY IHRYGVLPGR VAVVATNNDR GYQCALDLKA CGAKVTVVDA RASTRGALPA VAKRHGITVM SGAAVSAAAG KLRVASVDVV SYANGRSGGK IATLPCDLVA MSGGFSPVLH LFAQSGGKAH WNDDKACFVP GKPVQAEASV GAAAGEFELA RALRLALDAG VAAAKSAGFA AERPPVPKLA EAVEDALLPL WLASGAEAAV RGPKQFVDFQ NDVGAADILL AAREGFESVE HVKRYTAMGF GTDQGKLGNI NGMAILAQAL GKTIPETGTT TFRPNYTPVS FGAFAGRELG DFLDPIRKTC VHEWHVEHGA MFEDVGNWKR PWYFPRNGED LHAAVKRECL AVRNGVGMLD ASTLGKIDIQ GPDAVKLLNW VYTNPWNKLE VGKCRYGLML DENGMVFDDG VTVRLGDQHF MMTTTTGGAA RVLTWLERWL QTEWPDMKVR LSSVTDHWAT FAVVGPKSRR VVQKVCKDID FANDAFPFMS YRDGTVAGVK SRVMRISFSG ELAYEVNVPA NAGRAVWEAL MDAGAEFDIT PYGTETMHVL RAEKGYIIVG QDTDGSITPF DLGMGGLVAK SKDFLGRRSL TRADTAKSGR KQFVGLLTDD AQSVLPEGGQ IVELDAAARA DGTTPMLGHV TSSYYSPILN RSIALAVVKG GLSRMGERVA VSLANGRRVA ATISSPVFYD TEGVRQHVE
|
| |