Gene Bphy_3331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBphy_3331 
Symbol 
ID6244762 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia phymatum STM815 
KingdomBacteria 
Replicon accessionNC_010623 
Strand
Start bp265809 
End bp268808 
Gene Length3000 bp 
Protein Length999 aa 
Translation table11 
GC content65% 
IMG OID642595121 
Productsarcosine oxidase alpha subunit family protein 
Protein accessionYP_001859533 
Protein GI186472191 
COG category[E] Amino acid transport and metabolism
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0404] Glycine cleavage system T protein (aminomethyltransferase)
[COG0492] Thioredoxin reductase 
TIGRFAM ID[TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.967331 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGA AGAACCGCCT CGGCGCCGGT GGGCGCATCA ACCGCGCGAT TCCGCTGACC 
TTCACGTTCA ACGGCCGCAC GTATCAAGGC TTTCAGGGCG ACACGCTGGC GTCCGCGCTA
CTCGCGAACG GCGTGCACTT CGTCGCGCGC AGCTTCAAGT ATCACCGGCC GCGCGGCATC
GTCACGGCCG ACGTCGCCGA ACCGAATGCC GTCGTGCAAC TCGAACGCGG CGCCTACACG
GTGCCGAATG CCCGTGCGAC GGAAATCGAG CTGTATCAGG GCCTCGTTGC GACGAGCGTG
AACGCCGAGC CGAACCTCGA GCACGATCGC ATGGCCATCA ATCAGAAGTT CGCGCGCTTC
ATGCCCGCGG GCTTTTACTA CAAGACCTTC ATGTGGCCGG CCAAATGGTG GCCGAAGTAT
GAAGAGAAGA TCCGCGAAGC GGCGGGTCTC GGCAAGGCGC CCGAAGTGCT CGACGCCGAC
CGTTACGACA AGTGCTACGC GCATTGCGAC GTGCTCGTCG TGGGCGGCGG GCCGACGGGT
CTCGCGGCGG CGCATGCGGC GGCCTCGAGC GGCGCGCGTG TGATCCTCGT CGATGACCAG
CGCGAACTCG GCGGCAGCCT GCTGTCGTGC AAGACGGAGA TCGACGGGCA CGCGGCGCTG
AGCTGGGTCG AGAAGATCGA GGCGGAACTA TCGCGCATGC CCGACGTGAA GATCCTTTCG
CGCAGCACGG CATTCGGCTA TCAGGATCAC AACCTCGTCA CGGTGACGCA GCGACTGACG
GATCATCAGC CCGTGTCGAT GCGCAAGGGC ACGCGCGAAC TGCTGTGGAA GATCCGCGCC
AAACGCGTGA TCCTCGCGAC GGGCGCTCAC GAGCGTCCCA TCGTGTTCGG CAACAACGAT
CTGCCGGGTG TGATGCTGGC GTCGGCCGTG TCGACGTATA TCCATCGCTT CGGCGTGATG
CCGGGGCGCA ACGCCGTCGT GTTTACCAAC AACGACGCCG GGTATCGCTG CGCGCTCGAC
ATGAAAGCGT GCGGCGCGAG CGTCACCGTC GTCGACCCGC GTGCGCAAGG CAACGGCGCA
TTGCAGGCAG CCGCGCGCCG TCACGGCGTG AAGATCATGA ACAACGCCGC CGTGATGACC
GCACATGGCA AGCAGCGCGT GACGTCCGTG GAAGTCGTCG CGTATGCGAA CGGCAAGACG
GGCGCGAAGC AGGCTGATCT GCAGTGCGAT CTCGTCGCGA TGTCGGGCGG TTTCAGCCCG
GTGCTGCACC TGTTCGCGCA GTCGGGCGGC AAGGCTCACT GGAACGATAC GAAGGCGTGC
TTCGTGCCGG GCAAGGGCAT GCAGCCGGAA ACGAGCGTCG GCGCAGCGGC GGGCGAATTC
AGCCTTGCAC GCGGCCTGCG TCTTGCCGTC GATGCCGGCG TCGAAGCCGT CAAGTCGATC
GGTTATGCAG TGACGCGTCC GCAGGTGCCG CAGGTGGCCG AGGTCGTCGA GTCGCCGCTG
CAACCGCTGT GGCTCGTCGG CAGCCGGGCA GAAGCGGCGC GCGGTCCGAA GCAGTTCGTC
GACTTCCAGA ACGACGTGTC GGCGGCGGAT ATTCTGCTTG CGGCGCGCGA AGGCTTCGAA
TCCGTCGAGC ACGTGAAGCG CTATACGGCG ATGGGCTTCG GTACGGATCA GGGCAAGCTC
GGCAACATCA ACGGCATGGC GATTCTCGCG GATGCGCTCG GCAAGACGAT TCCCGAAACG
GGCACGACGA CGTTCCGTCC GAACTACACG CCCGTGACCT TCGGCACGTT CGCGGGCCGC
GAGCTCGGCG ATCTTCTCGA CCCGATCCGC AAGACGGCCG TGCACGAATG GCACGTCGAG
AATGGCGCGA TGTTCGAGGA CGTCGGCAAC TGGAAGCGTC CGTGGTACTT CCCGCTGAAG
GGCGAAGACC TGCATGCGGC CGTCAAGCGC GAATGCCTTG CGGTACGCAA CAGCGTCGGC
ATTCTCGATG CATCGACGCT CGGCAAGATC GACATTCAGG GCCCGGATGC GGCGAAGCTG
CTGAACTGGA TGTACACGAA CCCGTGGAGT AAGCTCGAAG TCGGCAAGTG CCGCTACGGC
CTGATGCTCG ACGAGAACGG CATGGTGTTC GATGACGGCG TGACGGTGCG CCTCGCCGAC
CAGCACTTCA TGATGACGAC CACGACGGGC GGCGCGGCGC GAGTGCTGAC ATGGATGGAG
CGCTGGCTGC AGACGGAATG GCCGGACATG AAGGTGCGCC TCGCATCCGT GACCGATCAC
TGGGCGACCT TCGCCGTGGT CGGCCCGAAG AGCCGGAAGG TCGTGCAGAA GGTGTGCAGC
GACATCGACT TCGCCAACGA AGCGTTCCCG TTCATGTCGT ACCGCAACGG CACGGTGGCG
GGTGTGAAGG CGCGTGTGAT GCGCATCAGC TTCTCGGGCG AACTGGCGTA TGAAGTGAAC
GTGCCCGCGA ACATGGGCCG CGCGGTATGG GAGGCGCTGA TGGCCGCGGG CGCCGAATTC
GATATCACGC CCTACGGCAC GGAAACGATG CACGTGCTGC GCGCGGAGAA GGGCTACATC
ATTGTCGGCC AGGATACGGA CGGTTCCGTC ACGCCGCACG ATCTGGGCAT GGGGGGCCTG
GTCGCGAAGA CCAAGGACTT CCTGGGACGC CGTTCGCTTG CGCGTTCGGA TACGACGAAG
GATAACCGCA AGCAGTTCGT CGGCCTGCTG TCCGACGATC CGCAGTTCGT GATTCCCGAA
GGCAGCCAGA TCGTCGCGCG TCCGTTCCAG GGCGACACCG CGCCGATGCT CGGACACGTG
ACGTCCAGCT ACTACAGCCC GATTCTGAAT CGATCGATCG CGCTCGCCGT GGTCAAGGGC
GGCCTGAACA AGATGGGGCA AAGCGTGACG ATTCCGCTGT CGAGCGGCAA GCAGATCGCC
GCGAAGATCG CCAGCCCCGT TTTCTACGAC ACCGAAGGAG TGCGTCAACA TGTGGAATGA
 
Protein sequence
MSQKNRLGAG GRINRAIPLT FTFNGRTYQG FQGDTLASAL LANGVHFVAR SFKYHRPRGI 
VTADVAEPNA VVQLERGAYT VPNARATEIE LYQGLVATSV NAEPNLEHDR MAINQKFARF
MPAGFYYKTF MWPAKWWPKY EEKIREAAGL GKAPEVLDAD RYDKCYAHCD VLVVGGGPTG
LAAAHAAASS GARVILVDDQ RELGGSLLSC KTEIDGHAAL SWVEKIEAEL SRMPDVKILS
RSTAFGYQDH NLVTVTQRLT DHQPVSMRKG TRELLWKIRA KRVILATGAH ERPIVFGNND
LPGVMLASAV STYIHRFGVM PGRNAVVFTN NDAGYRCALD MKACGASVTV VDPRAQGNGA
LQAAARRHGV KIMNNAAVMT AHGKQRVTSV EVVAYANGKT GAKQADLQCD LVAMSGGFSP
VLHLFAQSGG KAHWNDTKAC FVPGKGMQPE TSVGAAAGEF SLARGLRLAV DAGVEAVKSI
GYAVTRPQVP QVAEVVESPL QPLWLVGSRA EAARGPKQFV DFQNDVSAAD ILLAAREGFE
SVEHVKRYTA MGFGTDQGKL GNINGMAILA DALGKTIPET GTTTFRPNYT PVTFGTFAGR
ELGDLLDPIR KTAVHEWHVE NGAMFEDVGN WKRPWYFPLK GEDLHAAVKR ECLAVRNSVG
ILDASTLGKI DIQGPDAAKL LNWMYTNPWS KLEVGKCRYG LMLDENGMVF DDGVTVRLAD
QHFMMTTTTG GAARVLTWME RWLQTEWPDM KVRLASVTDH WATFAVVGPK SRKVVQKVCS
DIDFANEAFP FMSYRNGTVA GVKARVMRIS FSGELAYEVN VPANMGRAVW EALMAAGAEF
DITPYGTETM HVLRAEKGYI IVGQDTDGSV TPHDLGMGGL VAKTKDFLGR RSLARSDTTK
DNRKQFVGLL SDDPQFVIPE GSQIVARPFQ GDTAPMLGHV TSSYYSPILN RSIALAVVKG
GLNKMGQSVT IPLSSGKQIA AKIASPVFYD TEGVRQHVE