Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A1954 |
Symbol | |
ID | 4886398 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | - |
Start bp | 1893552 |
End bp | 1894796 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640131892 |
Product | sarcosine oxidase beta subunit |
Protein accession | YP_001062949 |
Protein GI | 126443706 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | [TIGR01373] sarcosine oxidase, beta subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCGCT ACTCGATATT CAGCCTGTTT CGCAACGGCC TGTCGTATCA CGAGAACTGG GAAAAGCAGT GGAAGAGCCC GGAGCCGAAG CGCGAGTACG ACGTCGTGAT CGTCGGCGGC GGCGGGCATG GGCTCGCCAC CGCGTACTAC CTCGCGAAGG AGCATGGCGT GACGAACGTC GCGATTCTCG AGAAGGGCTG GATCGGCGGC GGCAACACCG CGCGCAACAC GACGATCGTC CGCTCGAACT ACCTGTGGGA CGAATCCGCC GGGCTCTACG AGAAGGCGAT GAAGCTGTGG GAAGGGCTCG CGCAGGACTT GAACTACAAC GTGATGTTCA GCCAGCGTGG CGTGATGAAC CTCGCGCATA CGCTGCAGGA CGTGCGCGAC ACCGAGCGCC GCGTGAACGC GAACCGGCTG AACGGCGTCG ACGCCGAGTT CCTCACGCCC GCGCAGATCA AGGAGATCGA GCCGACGATC AACCTGAACA GCCGCTACCC GGTGCTTGGC GCGTCGATCC AGCGGCGCGG CGGCGTGGCG CGGCACGACG CGGTGGCCTG GGGCTTCGCG CGCGGCGCGG ACCGCGCGGG CGTCGACATC ATCCAGAACT GCCAGGTGAC GGGAATCCGC CGCGAGGGCG GGGCGGTGGT CGGCGTCGAT ACGGTCAAGG GCTTCATCAA GGCGAAGAAG GTCGCGGTGG TCGCGGCGGG CAACACGACG ACGCTCGCCG ACATGGCTGG CGTGCGGCTG CCGCTCGAAA GCCATCCGCT GCAGGCGCTC GTGTCCGAGC CGATCAAGCC GGTCGTCAAC ACGGTCGTGA TGTCGAACGC GGTGCATGCG TACATCAGCC AGTCCGACAA GGGCGACCTC GTGATCGGCG CGGGTATCGA CCAATACACG GGCTTCGGCC AGCGCGGCAG CTTCCAGATC ATCGAAGGCA CGCTGGAGGC GATCGTCGAG ATGTTCCCGG TGTTCTCGCG CGTGCGGATG AACCGCCAGT GGGGCGGCAT CGTCGACGTG TCGCCGGACG CGTGCCCGAT CATCAGCAAG ACCGACGTGA AGGGCCTGTA TTTCAACTGC GGCTGGGGCA CGGGCGGCTT CAAGGCGACG CCGGGCTCGG GCTGGGTGTT CGCGCATACG ATCGCGCGCG ACGAGCCGCA CGCGCTGAAC GCGCCGTTCG CGCTCGACCG GTTCTACACC GGCCACCTGA TCGACGAGCA CGGCGCGGCC GCCGTCGCGC ATTGA
|
Protein sequence | MSRYSIFSLF RNGLSYHENW EKQWKSPEPK REYDVVIVGG GGHGLATAYY LAKEHGVTNV AILEKGWIGG GNTARNTTIV RSNYLWDESA GLYEKAMKLW EGLAQDLNYN VMFSQRGVMN LAHTLQDVRD TERRVNANRL NGVDAEFLTP AQIKEIEPTI NLNSRYPVLG ASIQRRGGVA RHDAVAWGFA RGADRAGVDI IQNCQVTGIR REGGAVVGVD TVKGFIKAKK VAVVAAGNTT TLADMAGVRL PLESHPLQAL VSEPIKPVVN TVVMSNAVHA YISQSDKGDL VIGAGIDQYT GFGQRGSFQI IEGTLEAIVE MFPVFSRVRM NRQWGGIVDV SPDACPIISK TDVKGLYFNC GWGTGGFKAT PGSGWVFAHT IARDEPHALN APFALDRFYT GHLIDEHGAA AVAH
|
| |