Gene Bphyt_5050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBphyt_5050 
Symbol 
ID6280045 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia phytofirmans PsJN 
KingdomBacteria 
Replicon accessionNC_010676 
Strand
Start bp1206681 
End bp1209683 
Gene Length3003 bp 
Protein Length1000 aa 
Translation table11 
GC content65% 
IMG OID642616141 
Productsarcosine oxidase, alpha subunit family 
Protein accessionYP_001888784 
Protein GI187919753 
COG category[E] Amino acid transport and metabolism
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0404] Glycine cleavage system T protein (aminomethyltransferase)
[COG0492] Thioredoxin reductase 
TIGRFAM ID[TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.125712 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.268825 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGA AAGACCGACT CCCCAACGGC GGCCGCATCA ATCGCGCAAT GCCGCTGACC 
TTCACCTTCA ACGGCCGCCA GTACCAGGGC TATCAGGGCG ACACGCTGGC TTCGGCGCTG
CTCGCGAACG GCGAGCATTT CGTCGCGCGC AGCTGGAAAT ATCACCGGCC GCGCGGCATC
GTGACAGCGG GTGTGGAAGA GCCGAATGCC GTCGTGCAAC TCGAAACCGG TGCGTACACG
GTGCCGAATG CGCGCGCGAC CGAAGTCGAG TTGTATCAGG GGCTCGTCGC CACCAGCGTG
AACGCGAAGC CGAGCATCGA GAAAGACCGC ATGGCGGTCA ACCAGAAGTT CGCCCGCTTC
ATTCCGGCGG GCTTCTACTA CAAGACCTTC ATGTGGCCGC GCAAATTCTG GCCGAAGTAT
GAAGAAGTGA TCCGCGACGC GGCCGGCCTC GGCAAGGCGC CGGAACATAC TGACGCGGAC
CGTTATGACA AGTGCTTTGC GCATTGCGAC GTGCTGGTGG TGGGCGGTGG CCCGACCGGC
CTCGCGGCGG CGCATGCCGC CGCGTTGTCC GGCGCGCGCG TGACGCTGGT CGACGATCAG
CCGGAACTCG GCGGCTCGCT GCTGTCGTGC CGCGCGGAGA TCGACGGCAA GCCCGCGCTC
CACTGGGTGC AGAAGATCGA GGACGAACTG CGGCAGATGC CCGAAGTGAA GATCCTGTGC
CGCAGCACGG CATTCGGCTA TCAGGACCAC AATCTCGTGA CGTTAACGCA GCGGCTCACC
GAACATCTGC CGGTGTCGCA ACGCAAGGGC ACGCGCGAAC TGATGTGGAA GATCCGCGCG
AAACGCGTGA TTCTCGCGAC CGGCGCGCAC GAGCGTCCCA TCGTGTTCGG CAATAACGAT
CTGCCGGGCG TGATGCTGGC GTCGGCCGTG TCGACCTATC TGCATCGCTA TGCGGTGCTG
CCGGGCCGCA ACGCGGTGGT GTTCACCAAT AACGACGACG GTTATCAATG CGCGCTCGAT
CTGAAAGCAG CCGGCGCTCA GGTGACGGTG GTCGATCCGC GCGCGAGCGA ATCGAAAGGC
ACGCTGCCCG CTCTGGCACG CCGCTACGGC GTCAAGGTGT TGAACGGTGT CGTGATCACG
GCGGCGCACG GCAAACTGCG CGTGGCGTCC GTGGATCTCG CGCCGTATTC GAACGGACAG
GTCGGTGCGA AGCAGAGCGA GCTTGCCTGC GATCTGCTCG CGATGTCCGG CGGTTGGAGC
CCGGTGTTGC ATCTGTTCGC GCAATCGGGC GGCAAGGCGC ACTGGCATGA CGAGAAGGCG
TGCTTCGTGC CGGGCAAGGC GATGCAGCCG GAAACCAGCG TCGGCGCATG CGCGGGCGAC
TTCAAGCTCG GCCAGGGCAT CCGTTTCGCG ATGGACGCGG GCGCCGAAGC CGCGCGCGCG
GCCGGTCATA TCGTGGCTCG GCCCAATCCG GTTCAGGTCG CCGAGATCAC GGAAGCGAAG
ATGCTGCCGC TGTGGCTGGT CGGCGGCCGC GAGATGGCTA CGCGCGGGCC GAAGCAGTTC
ATCGATTTCC AGAACGACGT GTCGGCCGCG GATATTTTCC TCGCGGCACG CGAGGGCTTC
GAATCGGTCG AGCACGTAAA GCGCTATACG GCAATGGGTT TCGGCACCGA CCAGGGCAAG
CTCGGCAACA TCAATGGCAT GGCGATTCTC GCGCAGGCGC TCGGCAAGAC GATTCCGGAG
ACCGGCACCA CGACCTTCCG TCCGAACTAC ACGCCCGTCA CCTTCGGCAC GTTCGCCGGC
CGCGAGCTGG GCGAGTTTCT CGATCCGGTG CGCAAGACGG CGGTACACGA GTGGCATGTG
GAAAACGGCG CGGCTTTCGA GGACGTCGGC AACTGGAAGC GTCCGTGGTA CTACCCGAAA
GCGGGCGAAG ACTTGCACGC GGCGGTGGCG CGCGAATCGC TCGCGGTGCG CACGAGCGTC
GGCATTCTCG ATGCTTCCAC GCTCGGCAAG ATCGACATTC AGGGCCCCGA TTCGGCGAAG
CTCCTGAACT GGGTCTACAC GAATCCGTGG AGCAAGCTGG AAGTCGGCAA GTGCCGCTAC
GGTCTCATGC TCGACGAAAA CGGCATGATC TTCGACGACG GCGTGACCGT GCGCCTCGCC
GATCAGCACT ACATGATGAC GACCACCACC GGCGGCGCGG CGCGCGTGCT GACGTGGCTC
GAACGCTGGC TGCAAACCGA ATGGCCAGAT ATGCGCGTGC GGCTCGCGTC GGTGACGGAT
CATTGGGCGA CTTTTGCCGT GGTCGGTCCG AACAGCCGCA AGGTGCTGCA GAAGGTCTGC
CAGGACATCG ACTTCGCGAA CGCGGCGTTC CCGTTCATGA GCTATCGCGA AGGCACGGTG
GCGGGCGCGG CCTCACGGGT GATGCGCATC AGCTTCTCGG GCGAACTGGC TTATGAAGTG
AACGTGCCGG CGAACGTTGG ACGCGCGGTG TGGGAAGCGC TGATGGCCGC GGGTGCCGAG
TTCGACATCA CGCCGTACGG CACCGAAACC ATGCACGTGC TGCGCGCGGA GAAGGGCTAC
ATCATCGTCG GCCAGGATAC GGACGGCTCG ATGACGCCGT ACGACCTCGG CATGGGCGGG
CTCGTCGCGA AGTCGAAGGA CTTTCTCGGC AAGCGTTCGC TGACGCGCTC GGACACCGCG
AAGGCCGGGC GTAAGCAACT GGTCGGCCTG CTCTCGGACG ATCCGTCGTT CGTGATACCG
GAAGGCTCGC AGATCGTCGC GGGTCCGTTC CAGGGCGAGA CGGCGGCGAT GCTCGGCCAC
GTCACGTCGA GCTACTACAG CCCGATCCTG AAGCGTTCGA TCGCGATGGC GGTCGTCAAG
GGCGGCCTCG ACAAGATCGG CGAAACGGTC ACGATTCCGC TTTCGAGCGG CAAACAGATT
GCAGCGAAGG TCACCAGTTC GGTGTTCTAC GACAGCGAAG GAGCACGTCA ACATGTGGAA
TGA
 
Protein sequence
MSQKDRLPNG GRINRAMPLT FTFNGRQYQG YQGDTLASAL LANGEHFVAR SWKYHRPRGI 
VTAGVEEPNA VVQLETGAYT VPNARATEVE LYQGLVATSV NAKPSIEKDR MAVNQKFARF
IPAGFYYKTF MWPRKFWPKY EEVIRDAAGL GKAPEHTDAD RYDKCFAHCD VLVVGGGPTG
LAAAHAAALS GARVTLVDDQ PELGGSLLSC RAEIDGKPAL HWVQKIEDEL RQMPEVKILC
RSTAFGYQDH NLVTLTQRLT EHLPVSQRKG TRELMWKIRA KRVILATGAH ERPIVFGNND
LPGVMLASAV STYLHRYAVL PGRNAVVFTN NDDGYQCALD LKAAGAQVTV VDPRASESKG
TLPALARRYG VKVLNGVVIT AAHGKLRVAS VDLAPYSNGQ VGAKQSELAC DLLAMSGGWS
PVLHLFAQSG GKAHWHDEKA CFVPGKAMQP ETSVGACAGD FKLGQGIRFA MDAGAEAARA
AGHIVARPNP VQVAEITEAK MLPLWLVGGR EMATRGPKQF IDFQNDVSAA DIFLAAREGF
ESVEHVKRYT AMGFGTDQGK LGNINGMAIL AQALGKTIPE TGTTTFRPNY TPVTFGTFAG
RELGEFLDPV RKTAVHEWHV ENGAAFEDVG NWKRPWYYPK AGEDLHAAVA RESLAVRTSV
GILDASTLGK IDIQGPDSAK LLNWVYTNPW SKLEVGKCRY GLMLDENGMI FDDGVTVRLA
DQHYMMTTTT GGAARVLTWL ERWLQTEWPD MRVRLASVTD HWATFAVVGP NSRKVLQKVC
QDIDFANAAF PFMSYREGTV AGAASRVMRI SFSGELAYEV NVPANVGRAV WEALMAAGAE
FDITPYGTET MHVLRAEKGY IIVGQDTDGS MTPYDLGMGG LVAKSKDFLG KRSLTRSDTA
KAGRKQLVGL LSDDPSFVIP EGSQIVAGPF QGETAAMLGH VTSSYYSPIL KRSIAMAVVK
GGLDKIGETV TIPLSSGKQI AAKVTSSVFY DSEGARQHVE