Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RSc1103 |
Symbol | soxA2 |
ID | 1219915 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ralstonia solanacearum GMI1000 |
Kingdom | Bacteria |
Replicon accession | NC_003295 |
Strand | + |
Start bp | 1159152 |
End bp | 1162163 |
Gene Length | 3012 bp |
Protein Length | 1003 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637237469 |
Product | sarcosine oxidase subunit alpha |
Protein accession | NP_519224 |
Protein GI | 17545822 |
COG category | [E] Amino acid transport and metabolism [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0404] Glycine cleavage system T protein (aminomethyltransferase) [COG0492] Thioredoxin reductase |
TIGRFAM ID | [TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0428535 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCAGA AAGACCGTCT CGGTACGGGT GGCCGTATCA ATCGTGCGAT TCCGCTGACG TTTACGTTCA ACGGCCGCAC GTATCAGGGT TTCCAGGGCG ACACGCTGGC GTCTGCGCTG CTCGCGAATG GCGTGCATTT CGTCGCGCGC AGCTTCAAGT ACCACCGTCC GCGCGGGATC ATGACGGCGG GCGTCGAGGA GCCGAACGCG GTGGTGCAGC TCGAGTCGGG CCCGTACAGC GTGCCGAATG CGCGCGCGAC CGAGATCGAG CTATACCAGG GGCTCATCGC CACCAGCGTG AACGCCGAAC CGTCGCTCGA AAACGATCGC TATGCGATCA GCCAGATGTT TTCGCGCTTC CTGCCCGCCG GTTTCTACTA CAAGACCTTC ATGTGGCCGC GCAAGATGTG GCCGAAGTAC GAAGAAAAGA TCCGCGAAGC GGCCGGCCTT GGCAAGGCGC CCGACATGCG CGACGCGGAC CGCTACGACA AGTGTTACGC GCACTGCGAC GTGCTCGTCG TGGGTGGCGG CCCGACGGGG CTCGCGGCCG CACACGCGGC GGCGATGGCC GGGGCGCGCG TCATCCTGGT TGAAGACCAG CGCGAGCTTG GCGGCAGCCT GCTGTCGTGC CGCGCGGAAA TCGGCGGCAA GCCAGCGCTG CAGTGGGTCG AGAAGATCGA AGCCCAGCTG CGCAAGCTGC CCGACGTGAG TATCCTCACG CGCAGTACCG CGTTCGGCTA TCAGGATCAC AACCTCGTGA CCGTCACGCA GCGCCTGACG GATCATCTGC CGATCTCGAT GCGCAAGGGC ACGCGCGAGC TGCTGTGGAA GGTCCGCGCC AAGCGCGTCA TTCTCGCAAC GGGCGCGCAC GAGCGGCCGA TCGTGTTCGG CAACAACGAC CTGCCGGGTG TGATGCTCGC GGGCGCCGTG TCCACGTACA TCCATCGCTT CGGCGTGCTG CCGGGGCGTG ACGCCGTCGT GTTCACGAAC AACGACCGCG CCTACCAGAC CGCGCTCGAT CTGAAGGCGT GCGGTGCGAA GGTCACGGTC GTCGACGCGC GCGCACCCGG CAACGGTGCG CTGCCCGCCG TTGCGAAGCG CCAGGGCGTA ACGGTGATGC ATGGCGCGGT GATCACGGCT GCGTCCGGCA AGTGGCGCGT GTCATCGGTC GACGTCGCGT CTTACGCGAA TGGGCAGGTG GGCGGCAAGC AGAAGACGCT GCCGTGCGAC CTTGTCGCGA CGTCGGGCGG TTTCAGCCCG GTGCTGCACC TGTTCGCGCA ATCGGGCGGC AAGGCCCAGT GGAACGATGA CAAGGCGTGC TTCGTGCCGG GCAAGACCGT GCAGGCCGAG GCGAGCGTCG GCGCGGCAGC GGGTGAATTC GCCCTTGCGC ACGCGCTGCA GCTTGCAGTG GATGCGGGGG CCGAGGCTGC ACAGGCGGCG GGTTGCACGG CCGCGCAACG CGCTGTCGCA CCGCGGGTCG CGGAAACGGC CGAAGGCGCG CTGCAACCGC TGTGGCTCAT CGGTAGCCGC GAGGCCGCTG CACGCGGGCC GAAGCAGTTC GTTGACTTCC AGAATGACGT GGCGGTCACC GACATCCTGC TCGCCGCGCG CGAGGGTTTC GAGTCGGTCG AGCACGTCAA GCGCTACACG GCGATGGGCT TCGGCACCGA TCAGGGCAAG CTCGGCAACA TCAACGGGAT GGCGATTCTC GCCCAGGCGC TCGGCAAGTC GATTCCGGAA ACGGGCACGA CGACGTTCCG CCCCAACTAC ACACCCGTGT CGTTCGGCAC GTTCGCGGGC CGCGAGCTGG GCAACTTCCT CGACCCGGTC CGCAAGACCT GCATTCATGA GTGGCATGTC GAGCACGGTG CGCTGTTCGA AGACGTCGGC AACTGGAAAC GACCCTGGTA TTTTCCGAAG AACGGCGAAG ACCTGCATGC GGCCGTGAAG CGCGAGTGCC TGGCGGTGCG CAACGGTGTC GGCATGCTCG ATGCGTCCAC GCTCGGCAAG ATCGACATCC AGGGCCCCGA CGCGGTGAAG CTGCTGAACT GGGTGTACAC GAACCCGTGG GGCAAGCTCG ACGTCGGCAA GTGCCGCTAC GGGCTGATGC TCGATGAGAA CGGGATGGTG TTCGACGACG GCGTGACCGT GCGCCTCGCC GACCAGCATT TCATGATGAC GACCACGACG GGCGGCGCAG CGCGGGTGCT CACCTGGCTC GAGCGCTGGC TGCAGACCGA GTGGCCCGAC ATGAAGGTGC GGCTCGCGTC CGTCACCGAC CACTGGGCGA CGTTCGCGGT GGTCGGCCCG AAGAGCCGCA AGGTCGTGCA GAAGGTGTGC CAGGACATCG ACTTCGGCAA CGAAGCGTTC CCGTTCATGA GCTATCGCAA CGGCACCGTC GCGGGCGCCA AGGCGCGCGT GATGCGGATC AGCTTCTCGG GCGAACTGGC CTACGAAGTG AACGTGCCGG CCAATGCCGG GCGCGCGGTG TGGGAAGCGC TGATGGCCGC GGGTGCCGAG TTCGACATCA CGCCGTACGG CACCGAAACG ATGCACGTGC TGCGCGCGGA AAAGGGCTAC ATCATCGTCG GCCAGGACAC CGACGGTTCG ATCACGCCAT CCGACCTCGG CATGGGCGGC CTCGTCGCGA AGACGAAGGA CTGCCTCGGC AAGCGTTCGC TCGCGCGTTC CGATACCGCA AAGGCGGGCC GCAAGCAGTT CGTCGGCCTG TTGACCGACG ATGCGCAGTG CGTGCTGCCG GAGGGCGCGC AGATCATCGA CAAGGACACG CAGGTCCGCG TGACGGAACC GACGCCGATG ATCGGCCACG TGACGTCGAG CTACTACAGC CCGATCCTGC AACGTTCGAT CGCGCTGGCG GTGGTGAAGG GTGGTCTGGG CAAGATGGGC GAGAGCGTCG TGATTCCGCT GGCCAACGGC AGGCGTGTCA CCGCGAAGAT CGCGAGCCCG GTTTTCTACG ATACGGAAGG GGTGCGTCAG CATGTGGAAT GA
|
Protein sequence | MSQKDRLGTG GRINRAIPLT FTFNGRTYQG FQGDTLASAL LANGVHFVAR SFKYHRPRGI MTAGVEEPNA VVQLESGPYS VPNARATEIE LYQGLIATSV NAEPSLENDR YAISQMFSRF LPAGFYYKTF MWPRKMWPKY EEKIREAAGL GKAPDMRDAD RYDKCYAHCD VLVVGGGPTG LAAAHAAAMA GARVILVEDQ RELGGSLLSC RAEIGGKPAL QWVEKIEAQL RKLPDVSILT RSTAFGYQDH NLVTVTQRLT DHLPISMRKG TRELLWKVRA KRVILATGAH ERPIVFGNND LPGVMLAGAV STYIHRFGVL PGRDAVVFTN NDRAYQTALD LKACGAKVTV VDARAPGNGA LPAVAKRQGV TVMHGAVITA ASGKWRVSSV DVASYANGQV GGKQKTLPCD LVATSGGFSP VLHLFAQSGG KAQWNDDKAC FVPGKTVQAE ASVGAAAGEF ALAHALQLAV DAGAEAAQAA GCTAAQRAVA PRVAETAEGA LQPLWLIGSR EAAARGPKQF VDFQNDVAVT DILLAAREGF ESVEHVKRYT AMGFGTDQGK LGNINGMAIL AQALGKSIPE TGTTTFRPNY TPVSFGTFAG RELGNFLDPV RKTCIHEWHV EHGALFEDVG NWKRPWYFPK NGEDLHAAVK RECLAVRNGV GMLDASTLGK IDIQGPDAVK LLNWVYTNPW GKLDVGKCRY GLMLDENGMV FDDGVTVRLA DQHFMMTTTT GGAARVLTWL ERWLQTEWPD MKVRLASVTD HWATFAVVGP KSRKVVQKVC QDIDFGNEAF PFMSYRNGTV AGAKARVMRI SFSGELAYEV NVPANAGRAV WEALMAAGAE FDITPYGTET MHVLRAEKGY IIVGQDTDGS ITPSDLGMGG LVAKTKDCLG KRSLARSDTA KAGRKQFVGL LTDDAQCVLP EGAQIIDKDT QVRVTEPTPM IGHVTSSYYS PILQRSIALA VVKGGLGKMG ESVVIPLANG RRVTAKIASP VFYDTEGVRQ HVE
|
| |