Gene BamMC406_5067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBamMC406_5067 
Symbol 
ID6180810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia ambifaria MC40-6 
KingdomBacteria 
Replicon accessionNC_010552 
Strand
Start bp2234153 
End bp2237164 
Gene Length3012 bp 
Protein Length1003 aa 
Translation table11 
GC content67% 
IMG OID641684819 
Productsarcosine oxidase alpha subunit family protein 
Protein accessionYP_001811729 
Protein GI172064078 
COG category[E] Amino acid transport and metabolism
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0404] Glycine cleavage system T protein (aminomethyltransferase)
[COG0492] Thioredoxin reductase 
TIGRFAM ID[TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.844501 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGA AAGACCGCCT CGGCACCGGG GGCCGCATCA ATCGCGCGAT TCCGCTGACG 
TTCACGTTTA ACGGCCGCAC GTATCAAGGC TTCCAGGGCG ACACGCTCGC GTCCGCGCTG
CTCGCGAACG GCGTGCACTT CGTCGCGCGC AGCTTCAAGT ATCACCGTCC GCGCGGGATC
GTGACGGCGG ACGTCGCGGA ACCGAATGCC GTCGTGCAGC TCGAGTCCGG CCCGTACACG
GTGCCGAACG CGCGCGCGAC CGAGATCGAG CTGTACCAGG GCCTCGTCGC GACGAGCGTG
AACGCCGAGC CGTCGCTCGA GAACGACAAG TACGCGATCA ACCAGAAGTT CTCGCGCTTC
ATGCCGGCCG GGTTCTACTA CAAGACCTTC ATGTGGCCGC GCAACATGTG GCCGAAGTAC
GAAGAGAAGA TCCGCGAGGC CGCCGGCCTC GGCAAGGCGC CCGAAATGCT CGACGCCGAT
CGCTACGACA AGTGCTACGC GCACTGCGAC GTGCTCGTGG TCGGCGGTGG CCCGTCGGGC
CTCGCCGCCG CGCATGCGGC CGCGACGGCC GGCGCGCGCG TGATCCTCGT CGACGACCAG
CGCGAGCTCG GCGGCAGCCT GCTGTCGTGC CGCGCGGAAA TCGACGGGAA GCCGGCGCAG
CAATGGGTCG AGAAGATCGA GGCCGAGCTG CGCAAGCTGC CCGACGTGAC GATCCTGTCG
CGCAGCACCG CGTTTGGTTA CCAGGACCAC AACCTCGTGA CGATCACGCA GCGCCTGACC
GATCACCAGC CGGTGTCGAT GCGCAAGGGT ACGCGCGAGC TGCTGTGGAA GGTCCGCGCG
AAGCGCGTGA TCCTCGCGAC CGGCGCGCAC GAGCGTCCGA TCGTGTTCGG CAACAACGAC
CTGCCGGGCG TGATGCTCGC CGGCGCGGTG TCCACCTACG TGCATCGCTA CGGCGTGATG
CCGGGCCGCA ACGTGGTCGT GTTCACGAAC AACGATCGTG CGTACCAGAC CGCGCTCGAC
CTGAAGGCGT GCGGTGCGAA GGTGACGGTC GTCGATTCGC GTGCGTCGTC GAACGGCGCG
CTGCCTGCCG CCGCGAAGCG GCAGGGCGTG ACGGTGATGA GCGGTGCCGT GGTGACGGCC
GCTTCGGGCA AGTGGCGCGT ATCGTCGGTC GACGTCGCGT CCTACACGAA CGGCCAGACC
GGCAGCAAGT TGCAGTCGCT GCCGTGCGAC CTCGTCGCGA TGTCGGGCGG CTTCAGCCCG
GTGCTGCACC TGTTCGCGCA GTCCGGCGGC AAGGCGTGCT GGAACGACGA GAAGGCGTGC
TTCCTGCCGG GCAAGCCCGT GCAGGCCGAG GCGAGCGTCG GTGCGGCGGC GGGCGAGTTC
GGTCTCGCAC GTGCGCTGCG GCTCGCACTC GACGCGGGCG TCGAAGCGGC GAAGGCCGCC
GGCTTCGCGG CTGCGCAGCG CGTGGCCGCG CCGCAGGTCG CCGAAACCGT CGAAGGCGCG
CTGCAGCCGT TGTGGCTCGT CGGCAGCCGC GAGGACGCCG CACGCGGGCC GAAGCAGTTC
GTCGACTTCC AGAACGACGT GGCGGCCGCC GACATCCTGC TGGCCGCACG CGAAGGCTTC
GAGTCGGTCG AGCACGTGAA GCGCTACACC GCGATGGGCT TCGGCACCGA CCAGGGCAAG
CTCGGCAACA TCAACGGGAT GGCGATCCTC GCGCAGGCGC TTGGCAAGAC GATTCCCGAA
ACGGGCACGA CGACGTTCCG CCCGAACTAC ACGCCCGTGT CGTTCGGTAC GTTCGCGGGC
CGCGAGACCG GCGATTTCCT CGACCCGATC CGCAAGACGG CCGTGCATGA ATGGCACGTC
GAGCACGGCG CGCTGTTCGA GGACGTCGGC AACTGGAAGC GGCCGTGGTA TTTCCCGAAG
AACGGCGAGG ATCTGCATGC GGCCGTGAAG CGCGAATGCC TCGCAGTGCG CAACGGTGTC
GGCATCCTCG ACGCGTCGAC GCTCGGCAAG ATCGACATCC AGGGTCCGGA CGCGGTGAAG
CTGCTGAACT GGATGTACAC GAACCCGTGG AACAAGCTCG AGATCGGCAA GTGCCGCTAT
GGGCTGATGC TCGACGAAAA CGGGATGGTG TTCGACGACG GCGTGACGGT GCGCCTCGCC
GACCAGCATT TCATGATGAC GACGACCACC GGTGGCGCGG CGCGCGTGCT CACGTGGCTC
GAGCGCTGGC TGCAGACCGA ATGGCCCGAC ATGAAGGTGC GGCTCGCGTC CGTCACCGAT
CACTGGGCGA CGTTCGCGGT GGTCGGCCCG AAGAGCCGCA AGGTCGTGCA GAAGGTGTGC
CAGGACATCG ACTTCGGCAA CGAGGCGTTC CCGTTCATGA GCTACCGGAA CGGCACCGTC
GCGGGCGCGA AGGCGCGCGT GATGCGGATC AGCTTCTCCG GCGAGCTCGC GTATGAAGTG
AACGTGCCGG CCAATGCGGG CCGCGCGGTG TGGGAAGCGC TGATGGCGGC CGGCGCCGAG
TTCGACATCA CGCCGTACGG CACCGAGACG ATGCACGTGC TGCGCGCGGA GAAGGGCTAC
ATCATCGTCG GCCAGGACAC CGACGGCTCG GTCACGCCGT ACGACCTCGG GATGGGCGGG
CTCGTCGCGA AGTCGAAGGA CTTCCTCGGC AAGCGCTCGC TGTCGCGCTC GGATACGTCG
AAGGAAGGCC GCAAGCAGTT CGTCGGCCTG CTGACGGAAG ACGAACAGTT CGTGCTGCCG
GAAGGCGCGC AGATCATCGC GAAGGACACG CAGGTGTCGA CGGTCGATCC GACGCCGATG
ATCGGCCACG TGACGTCGAG TTATTACAGC CCGATCCTGA AGCGCTCGAT TGCGCTGGCG
GTGGTGAAGG GCGGCCTGAA CAAGATGGGC GAGAGCGTCG TGATTCCGCT GGCCAACGGC
CGCCGCATCA CCGCGAAGAT CTCGAGCCCG GTTTTCTACG ATACCGAAGG GGTGCGCCAG
CATGTGGAAT GA
 
Protein sequence
MSQKDRLGTG GRINRAIPLT FTFNGRTYQG FQGDTLASAL LANGVHFVAR SFKYHRPRGI 
VTADVAEPNA VVQLESGPYT VPNARATEIE LYQGLVATSV NAEPSLENDK YAINQKFSRF
MPAGFYYKTF MWPRNMWPKY EEKIREAAGL GKAPEMLDAD RYDKCYAHCD VLVVGGGPSG
LAAAHAAATA GARVILVDDQ RELGGSLLSC RAEIDGKPAQ QWVEKIEAEL RKLPDVTILS
RSTAFGYQDH NLVTITQRLT DHQPVSMRKG TRELLWKVRA KRVILATGAH ERPIVFGNND
LPGVMLAGAV STYVHRYGVM PGRNVVVFTN NDRAYQTALD LKACGAKVTV VDSRASSNGA
LPAAAKRQGV TVMSGAVVTA ASGKWRVSSV DVASYTNGQT GSKLQSLPCD LVAMSGGFSP
VLHLFAQSGG KACWNDEKAC FLPGKPVQAE ASVGAAAGEF GLARALRLAL DAGVEAAKAA
GFAAAQRVAA PQVAETVEGA LQPLWLVGSR EDAARGPKQF VDFQNDVAAA DILLAAREGF
ESVEHVKRYT AMGFGTDQGK LGNINGMAIL AQALGKTIPE TGTTTFRPNY TPVSFGTFAG
RETGDFLDPI RKTAVHEWHV EHGALFEDVG NWKRPWYFPK NGEDLHAAVK RECLAVRNGV
GILDASTLGK IDIQGPDAVK LLNWMYTNPW NKLEIGKCRY GLMLDENGMV FDDGVTVRLA
DQHFMMTTTT GGAARVLTWL ERWLQTEWPD MKVRLASVTD HWATFAVVGP KSRKVVQKVC
QDIDFGNEAF PFMSYRNGTV AGAKARVMRI SFSGELAYEV NVPANAGRAV WEALMAAGAE
FDITPYGTET MHVLRAEKGY IIVGQDTDGS VTPYDLGMGG LVAKSKDFLG KRSLSRSDTS
KEGRKQFVGL LTEDEQFVLP EGAQIIAKDT QVSTVDPTPM IGHVTSSYYS PILKRSIALA
VVKGGLNKMG ESVVIPLANG RRITAKISSP VFYDTEGVRQ HVE