Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RSp0048 |
Symbol | soxA1 |
ID | 1222596 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ralstonia solanacearum GMI1000 |
Kingdom | Bacteria |
Replicon accession | NC_003296 |
Strand | + |
Start bp | 51971 |
End bp | 55057 |
Gene Length | 3087 bp |
Protein Length | 1028 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637239907 |
Product | sarcosine oxidase subunit alpha |
Protein accession | NP_521609 |
Protein GI | 17548269 |
COG category | [E] Amino acid transport and metabolism [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0404] Glycine cleavage system T protein (aminomethyltransferase) [COG0492] Thioredoxin reductase |
TIGRFAM ID | [TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.416697 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCAGA AAGACCGTCT CGGTACGGGT GGCCGTATCA ACCGCGCAAA TCCGTTGACG TTCACGTTCA ACGGCCGTAC GTATCAAGGT TTCCAGGGCG ATACGCTCGC GTCGGCACTG CTGGCGAATG GCGTGCATTT CGTGGCACGC AGCTTCAAGT ACCACCGCCC GCGCGGCATC ATGACCGCAG GTGTGGAAGA ACCGAATGCA GTGGTGCAGC TTGAGTCAGG CCCCTACAGC GTGCCGAACG CGCGCGCGAC AGAGATCGAG CTGTATCAGG GCCTGGTTGC CAACAGCGTG AATGCAGAAC CCTCGCTGGA GAACGACCGC TACGCGATCA ACCAGAAGTT CTCGCGCTTT CTGCCGGCCG GGTTCTACTA CAAGACCTTC ATGTGGCCGC GCAAGATGTG GCCCAAGTAC GAAGAGAAGA TTCGCGGAGC GGCAGGCTTG GGTAAGGCGC CCGACATGCG CGATGCCGAT CGCTATGACA AGTGCTACGC GCACTGCGAC GTGCTGGTCG TGGGCGGTGG GCCGACCGGC CTGGCGGCTG CACATGCCGC AGCAACGGCC GGCGCACGCG TGATTCTCGT GGATGACCAG CGCGAACTCG GCGGGAGCCT GCTCTCGTGC CGAGCGGAAA TCGACGGCAA GCCGGCGCAG CAGTGGGTGG AAAAGATCGA GGCCCAGCTG CGCAAGCTGC CGGATGTGAC CATCCTCACG CGGAGCACGG CGTTCGGCTA TCAGGACCAC AACCTCATTA CGGTGACGCA GCGCCTGACC GATCACCTGC CGATTTCGAT GCGCAAGGGC ACGCGTGAGC TGCTGTGGAA GGTGCGCGCC AAGCGGGTCA TCCTGGCGAC CGGCGCGCAC GAACGCCCGC TCGTGTTCGG CAACAACGAT CTCCCGGGCG TGATGATCGC AGGGGCGGTG TCTTCGTACA TCCATCGCTA TGGCGTGTTG CCCGGCCGCG AAGCCGTTGT GTTCACGAAC AACGACCGCG CTTACCAAAC CGCGCTGGAT CTGAAGGCGT GCGGCGCGAA GGTCACGGTT GTGGACTCGC GTGCCGCTAC CGACGGAGCA CTGCCCGCTG CTGCGAAGCG GCAAGGCGTG ACGGTGATGA ACGGCGCGGT GATCACCACT GCCTCCGGCA AGTGGCGCGT ATCGTCGGTC GATGTTGCCT CGTACGCGAA CTCAAACGGC GCGGTCGTCA CGGCGGCCCC GGCGCAGATG CGCGTGTCAT CGGCCGATGT GGCGGCCTAC CAAGGCGGGC AAGTCGGCGG AAAGATCAAG ACGCTGCCCT GCGATATCGT CGCGACATCG GGAGGCTTCA GTCCTGTCTT GCACCTCTTC GCACAATCGG GCGGGAAGGC GCAGTGGGAC GATGCAAAGG CGTGTTTTGT ACCCGGCAAG CCCATGCAGG CGGAAGTGAG TGTGGGGGCT GCGGCGGGAA AGTTCACGCT CGCAGAGGCC TTGAAGCTTG CCGTGGACGC AGGCGCCGAG GCGGCAAGGA TTGCTGGTTT CGCCGCTGCA CAACGCCCCG TCGCACCGAA AGTGACGGAT GTGAAGGAAA GCGCGCTGCA GCCGCTCTGG CTCATCGGGG ATACCAAGTC CGCCACACGT GGGCCGAAGC AGTTCGTCGA TTTCCAGAAC GATGTGGCAG TGACCGACAT CCTGCTTGCC GCGCGCGAGG GTTTCGAATC GGTCGAGCAC GTCAAGCGCT ATACGGCCAT GGGCTTCGGC ACCGATCAGG GCAAGCTGGG CAACATCAAC GGGATGGCCA TTCTTGCGCA GGCGTTGGGT AAGACGATCC CGGAAACCGG CACGACCACG TTTCGCCCGA ACTACACGCC CGTCACGTTC GGTACGTTCG CCGGGCGCGA ACTGGGCAAT TTCCTCGACC CGGTCCGCAA GACCTGCGTT CACGAATGGC ACGTCGAGCA CGGTGCGCTG TTTGAGGACG TGGGCAACTG GAAGCGCCCC TGGTACTTCC CGAAGAAGGG CGAAGACCTG CATGCGGCGG TCAAGCGGGA ATGCCTTGCG GTGCGCAACA GCGTCGGCAT TCTGGATGCC TCCACGCTTG GCAAGATCGA CATCCAGGGC CCGGATGCGG TGAAGCTGCT CAACTGGATG TACACCAACC CGTGGGGCAA GCTCGAAGTC GGCAAATGCC GCTATGGGTT GATGCTCGAC GAGAACGGCA TGGTGTTCGA CGACGGCGTG ACCGTACGCC TGGCCGATCA GCATTTCATG ATGACGACCA CGACGGGTGG TGCGGCCCGC GTACTGACCT GGCTCGAGCG CTGGCTGCAA ACCGAATGGC CCGACATGCG GGTGCGGCTC GCGTCTGTCA CCGATCACTG GGCGACGTTT GCGGTGGTCG GCCCCAAGAG CCGCAAGGTG GTGCAGAAGG TCTGTCAGGA CATCGACTTC GGCAACGAGG CGTTCCCGTT CATGAGCTAC CGGAACGGCA CCGTGGCCGG CGCAAAAGCC CGCGTTATGC GGATCAGCTT CTCGGGCGAA CTGGCCTACG AAGTGAACGT GCCGGCCAAT GCGGGCCGCG CCGTGTGGGA AGCGTTGATG GCCGCGGGCG CCGAGTTCGA CATCACGCCA TACGGCACCG AAACCATGCA CGTGCTGCGT GCCGAGAAGG GCTACATCAT CGTCGGCCAG GACACGGATG GGTCGATCAC TCCGCACGAC CTGGGCATGG GCGGCATGGT CGCTAAGACG AAGGACTGCC TCGGCAAGCG CTCGCTCACG CGGTCGGACA CCGCCAAGGA GGGCCGCAAG CAGTTTGTCG GCCTGCTGAC CGATGACGCA CAGTTCGTGT TGCCCGAAGG CGCGCAGATC GTCACCAACG GCACGCAGGT TTCTGCAGAG AGCCCGACAC CGATGGTCGG CCACGTGACG TCGAGCTACT ACAGCCCGAT CCTGAAGCGG TCGATTGCGC TGGCGGTGGT CAAGGGTGGC CTGAGCAAGA TGGGCGAGAG CGTGGTGATT CCGCTGGCCA ACGGCAAGCG CGTCACCGCG AAGATCTCGA GCCCGGTTTT CTACGATACG GAAGGGGGGC GCCAACATGT GGAATGA
|
Protein sequence | MSQKDRLGTG GRINRANPLT FTFNGRTYQG FQGDTLASAL LANGVHFVAR SFKYHRPRGI MTAGVEEPNA VVQLESGPYS VPNARATEIE LYQGLVANSV NAEPSLENDR YAINQKFSRF LPAGFYYKTF MWPRKMWPKY EEKIRGAAGL GKAPDMRDAD RYDKCYAHCD VLVVGGGPTG LAAAHAAATA GARVILVDDQ RELGGSLLSC RAEIDGKPAQ QWVEKIEAQL RKLPDVTILT RSTAFGYQDH NLITVTQRLT DHLPISMRKG TRELLWKVRA KRVILATGAH ERPLVFGNND LPGVMIAGAV SSYIHRYGVL PGREAVVFTN NDRAYQTALD LKACGAKVTV VDSRAATDGA LPAAAKRQGV TVMNGAVITT ASGKWRVSSV DVASYANSNG AVVTAAPAQM RVSSADVAAY QGGQVGGKIK TLPCDIVATS GGFSPVLHLF AQSGGKAQWD DAKACFVPGK PMQAEVSVGA AAGKFTLAEA LKLAVDAGAE AARIAGFAAA QRPVAPKVTD VKESALQPLW LIGDTKSATR GPKQFVDFQN DVAVTDILLA AREGFESVEH VKRYTAMGFG TDQGKLGNIN GMAILAQALG KTIPETGTTT FRPNYTPVTF GTFAGRELGN FLDPVRKTCV HEWHVEHGAL FEDVGNWKRP WYFPKKGEDL HAAVKRECLA VRNSVGILDA STLGKIDIQG PDAVKLLNWM YTNPWGKLEV GKCRYGLMLD ENGMVFDDGV TVRLADQHFM MTTTTGGAAR VLTWLERWLQ TEWPDMRVRL ASVTDHWATF AVVGPKSRKV VQKVCQDIDF GNEAFPFMSY RNGTVAGAKA RVMRISFSGE LAYEVNVPAN AGRAVWEALM AAGAEFDITP YGTETMHVLR AEKGYIIVGQ DTDGSITPHD LGMGGMVAKT KDCLGKRSLT RSDTAKEGRK QFVGLLTDDA QFVLPEGAQI VTNGTQVSAE SPTPMVGHVT SSYYSPILKR SIALAVVKGG LSKMGESVVI PLANGKRVTA KISSPVFYDT EGGRQHVE
|
| |