Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bphy_3331 |
Symbol | |
ID | 6244762 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia phymatum STM815 |
Kingdom | Bacteria |
Replicon accession | NC_010623 |
Strand | - |
Start bp | 265809 |
End bp | 268808 |
Gene Length | 3000 bp |
Protein Length | 999 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 642595121 |
Product | sarcosine oxidase alpha subunit family protein |
Protein accession | YP_001859533 |
Protein GI | 186472191 |
COG category | [E] Amino acid transport and metabolism [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0404] Glycine cleavage system T protein (aminomethyltransferase) [COG0492] Thioredoxin reductase |
TIGRFAM ID | [TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.967331 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCAGA AGAACCGCCT CGGCGCCGGT GGGCGCATCA ACCGCGCGAT TCCGCTGACC TTCACGTTCA ACGGCCGCAC GTATCAAGGC TTTCAGGGCG ACACGCTGGC GTCCGCGCTA CTCGCGAACG GCGTGCACTT CGTCGCGCGC AGCTTCAAGT ATCACCGGCC GCGCGGCATC GTCACGGCCG ACGTCGCCGA ACCGAATGCC GTCGTGCAAC TCGAACGCGG CGCCTACACG GTGCCGAATG CCCGTGCGAC GGAAATCGAG CTGTATCAGG GCCTCGTTGC GACGAGCGTG AACGCCGAGC CGAACCTCGA GCACGATCGC ATGGCCATCA ATCAGAAGTT CGCGCGCTTC ATGCCCGCGG GCTTTTACTA CAAGACCTTC ATGTGGCCGG CCAAATGGTG GCCGAAGTAT GAAGAGAAGA TCCGCGAAGC GGCGGGTCTC GGCAAGGCGC CCGAAGTGCT CGACGCCGAC CGTTACGACA AGTGCTACGC GCATTGCGAC GTGCTCGTCG TGGGCGGCGG GCCGACGGGT CTCGCGGCGG CGCATGCGGC GGCCTCGAGC GGCGCGCGTG TGATCCTCGT CGATGACCAG CGCGAACTCG GCGGCAGCCT GCTGTCGTGC AAGACGGAGA TCGACGGGCA CGCGGCGCTG AGCTGGGTCG AGAAGATCGA GGCGGAACTA TCGCGCATGC CCGACGTGAA GATCCTTTCG CGCAGCACGG CATTCGGCTA TCAGGATCAC AACCTCGTCA CGGTGACGCA GCGACTGACG GATCATCAGC CCGTGTCGAT GCGCAAGGGC ACGCGCGAAC TGCTGTGGAA GATCCGCGCC AAACGCGTGA TCCTCGCGAC GGGCGCTCAC GAGCGTCCCA TCGTGTTCGG CAACAACGAT CTGCCGGGTG TGATGCTGGC GTCGGCCGTG TCGACGTATA TCCATCGCTT CGGCGTGATG CCGGGGCGCA ACGCCGTCGT GTTTACCAAC AACGACGCCG GGTATCGCTG CGCGCTCGAC ATGAAAGCGT GCGGCGCGAG CGTCACCGTC GTCGACCCGC GTGCGCAAGG CAACGGCGCA TTGCAGGCAG CCGCGCGCCG TCACGGCGTG AAGATCATGA ACAACGCCGC CGTGATGACC GCACATGGCA AGCAGCGCGT GACGTCCGTG GAAGTCGTCG CGTATGCGAA CGGCAAGACG GGCGCGAAGC AGGCTGATCT GCAGTGCGAT CTCGTCGCGA TGTCGGGCGG TTTCAGCCCG GTGCTGCACC TGTTCGCGCA GTCGGGCGGC AAGGCTCACT GGAACGATAC GAAGGCGTGC TTCGTGCCGG GCAAGGGCAT GCAGCCGGAA ACGAGCGTCG GCGCAGCGGC GGGCGAATTC AGCCTTGCAC GCGGCCTGCG TCTTGCCGTC GATGCCGGCG TCGAAGCCGT CAAGTCGATC GGTTATGCAG TGACGCGTCC GCAGGTGCCG CAGGTGGCCG AGGTCGTCGA GTCGCCGCTG CAACCGCTGT GGCTCGTCGG CAGCCGGGCA GAAGCGGCGC GCGGTCCGAA GCAGTTCGTC GACTTCCAGA ACGACGTGTC GGCGGCGGAT ATTCTGCTTG CGGCGCGCGA AGGCTTCGAA TCCGTCGAGC ACGTGAAGCG CTATACGGCG ATGGGCTTCG GTACGGATCA GGGCAAGCTC GGCAACATCA ACGGCATGGC GATTCTCGCG GATGCGCTCG GCAAGACGAT TCCCGAAACG GGCACGACGA CGTTCCGTCC GAACTACACG CCCGTGACCT TCGGCACGTT CGCGGGCCGC GAGCTCGGCG ATCTTCTCGA CCCGATCCGC AAGACGGCCG TGCACGAATG GCACGTCGAG AATGGCGCGA TGTTCGAGGA CGTCGGCAAC TGGAAGCGTC CGTGGTACTT CCCGCTGAAG GGCGAAGACC TGCATGCGGC CGTCAAGCGC GAATGCCTTG CGGTACGCAA CAGCGTCGGC ATTCTCGATG CATCGACGCT CGGCAAGATC GACATTCAGG GCCCGGATGC GGCGAAGCTG CTGAACTGGA TGTACACGAA CCCGTGGAGT AAGCTCGAAG TCGGCAAGTG CCGCTACGGC CTGATGCTCG ACGAGAACGG CATGGTGTTC GATGACGGCG TGACGGTGCG CCTCGCCGAC CAGCACTTCA TGATGACGAC CACGACGGGC GGCGCGGCGC GAGTGCTGAC ATGGATGGAG CGCTGGCTGC AGACGGAATG GCCGGACATG AAGGTGCGCC TCGCATCCGT GACCGATCAC TGGGCGACCT TCGCCGTGGT CGGCCCGAAG AGCCGGAAGG TCGTGCAGAA GGTGTGCAGC GACATCGACT TCGCCAACGA AGCGTTCCCG TTCATGTCGT ACCGCAACGG CACGGTGGCG GGTGTGAAGG CGCGTGTGAT GCGCATCAGC TTCTCGGGCG AACTGGCGTA TGAAGTGAAC GTGCCCGCGA ACATGGGCCG CGCGGTATGG GAGGCGCTGA TGGCCGCGGG CGCCGAATTC GATATCACGC CCTACGGCAC GGAAACGATG CACGTGCTGC GCGCGGAGAA GGGCTACATC ATTGTCGGCC AGGATACGGA CGGTTCCGTC ACGCCGCACG ATCTGGGCAT GGGGGGCCTG GTCGCGAAGA CCAAGGACTT CCTGGGACGC CGTTCGCTTG CGCGTTCGGA TACGACGAAG GATAACCGCA AGCAGTTCGT CGGCCTGCTG TCCGACGATC CGCAGTTCGT GATTCCCGAA GGCAGCCAGA TCGTCGCGCG TCCGTTCCAG GGCGACACCG CGCCGATGCT CGGACACGTG ACGTCCAGCT ACTACAGCCC GATTCTGAAT CGATCGATCG CGCTCGCCGT GGTCAAGGGC GGCCTGAACA AGATGGGGCA AAGCGTGACG ATTCCGCTGT CGAGCGGCAA GCAGATCGCC GCGAAGATCG CCAGCCCCGT TTTCTACGAC ACCGAAGGAG TGCGTCAACA TGTGGAATGA
|
Protein sequence | MSQKNRLGAG GRINRAIPLT FTFNGRTYQG FQGDTLASAL LANGVHFVAR SFKYHRPRGI VTADVAEPNA VVQLERGAYT VPNARATEIE LYQGLVATSV NAEPNLEHDR MAINQKFARF MPAGFYYKTF MWPAKWWPKY EEKIREAAGL GKAPEVLDAD RYDKCYAHCD VLVVGGGPTG LAAAHAAASS GARVILVDDQ RELGGSLLSC KTEIDGHAAL SWVEKIEAEL SRMPDVKILS RSTAFGYQDH NLVTVTQRLT DHQPVSMRKG TRELLWKIRA KRVILATGAH ERPIVFGNND LPGVMLASAV STYIHRFGVM PGRNAVVFTN NDAGYRCALD MKACGASVTV VDPRAQGNGA LQAAARRHGV KIMNNAAVMT AHGKQRVTSV EVVAYANGKT GAKQADLQCD LVAMSGGFSP VLHLFAQSGG KAHWNDTKAC FVPGKGMQPE TSVGAAAGEF SLARGLRLAV DAGVEAVKSI GYAVTRPQVP QVAEVVESPL QPLWLVGSRA EAARGPKQFV DFQNDVSAAD ILLAAREGFE SVEHVKRYTA MGFGTDQGKL GNINGMAILA DALGKTIPET GTTTFRPNYT PVTFGTFAGR ELGDLLDPIR KTAVHEWHVE NGAMFEDVGN WKRPWYFPLK GEDLHAAVKR ECLAVRNSVG ILDASTLGKI DIQGPDAAKL LNWMYTNPWS KLEVGKCRYG LMLDENGMVF DDGVTVRLAD QHFMMTTTTG GAARVLTWME RWLQTEWPDM KVRLASVTDH WATFAVVGPK SRKVVQKVCS DIDFANEAFP FMSYRNGTVA GVKARVMRIS FSGELAYEVN VPANMGRAVW EALMAAGAEF DITPYGTETM HVLRAEKGYI IVGQDTDGSV TPHDLGMGGL VAKTKDFLGR RSLARSDTTK DNRKQFVGLL SDDPQFVIPE GSQIVARPFQ GDTAPMLGHV TSSYYSPILN RSIALAVVKG GLNKMGQSVT IPLSSGKQIA AKIASPVFYD TEGVRQHVE
|
| |