Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PSPTO_0458 |
Symbol | soxA-1 |
ID | 1182067 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pseudomonas syringae pv. tomato str. DC3000 |
Kingdom | Bacteria |
Replicon accession | NC_004578 |
Strand | - |
Start bp | 500811 |
End bp | 503831 |
Gene Length | 3021 bp |
Protein Length | 1006 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637391826 |
Product | sarcosine oxidase, alpha subunit |
Protein accession | NP_790307 |
Protein GI | 28867688 |
COG category | [E] Amino acid transport and metabolism [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0404] Glycine cleavage system T protein (aminomethyltransferase) [COG0492] Thioredoxin reductase |
TIGRFAM ID | [TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCAGA GCAATCGCCT GCCCAACGGC GGCCGCATCA ACCGCAGCAA AGTGCTCAAC TTCACCTTCA ACGGCCAGAC CTATCAGGGC TTTGAAGGCG ACTCGCTGGC GTCCGCGCTG CTGGCCAATG GCGTTGAGAT CATTGGTCGC AGTTTCAAGT ATTCCCGGCC ACGCGGCATC TTCGCGGCGG GTTCGGAAGA ACCGAACGCC GTGTTGCAGA TCGGCGCGAC CGAAGCGACG CAGATCCCCA ACGTGCGCGC CACCCAGCAA GCGCTTTATT CGGGGCTGGT GGCGACCAGC ACCAACGGCT GGCCGAGCGT AAACACCGAC GTAATGGGCA TTCTGGGCAA GGTCGGCGGC AAACTGATGC CTCCGGGCTT CTACTACAAA ACCTTCATGT ACCCGCAATC GTTCTGGATG ACGTACGAAA AGTACATCCG CAAGGCGGCA GGCCTCGGTC GCTCACCGAC CGAGAACGAC CCGGACAGCT ACGACGCGAT GAACCAGCAC TGTGATGTGC TGATCGTCGG CGCAGGCCCG GCTGGTCTGG CCGCCGCACT GGCAGCGAGC CGCAGCGGCG CCCGAGTGAT CATCGCCGAT GAACAGGAAG AATTCGGCGG CAGTCTGCTC GACAGCCGTG AAAGCCTGGA TGGCAAACCC GCTGCCGAGT GGGTTGCAAC AGTAGTTGCC GAGCTGAAAA GCCTGCGCAA CGTGACACTG CTGCCACGCG CCACGGTCAA CGGTTATCAC GATCACAATT TTCTGACCAT CCACGAGCGT CTGACCGACC ACCTCGGTGA TCGCGCGCCC ATCGGCATGG TTCGCCAGCG CATGCACCGG GTGCGCGCCA AACGTGTGGT GCTGGCTGCC GGCACGCACG AGCGGCCGCT GGTCTACGGC AACAACGATG TGCCGGGCAA CATGCTTGCG GGTGCAATCT CGACGTATGT ACGTCGTTAC GGCGTCGCGC CGGGCAAAAA ACTCGTGCTG TCCACCAACA ACGACCACGC CTACCGCGTC GCGCTGGACT GGCTCGACGC CAGCCTGCAC GTGGTGGCGA TTGCCGATGC GCGGCACAAT CCACGCGGCC CGCTGGTTGA AGAGGCTCGC GCCAAAGGCA TTCGTATTCT GACCGGCAGC GCCGTGATCG AAGCGCGTGG CAGCAAGCGG GTGAGCGCTG CACGCGTTGC CGCCATCGAC CTCAAAACGC ACAGCGTCAC CAGCCCCGGC GAGTGGCTTG AGTGCGATCT GGTGGCAAGT TCCGGCGGTT ACAGCCCCAT CGTCCATCTG GCCTCGCACC TGGGCGGCAA GCCGGTCTGG CGTGAAGACA TCCTCGGTTT CGTGCCGGGC GAAGCGCCGC AGAAACGTAT CTGTGTGGGC GGCGTGAACG GCGTCTACAG CCTGGCAGAC AGCCTGGCCG ATGGTTTCGA GGGTGGCGTG CGCGCTGCCA GCGAAGCAGG CTTCAAGATT GTCGAGGGCG TGATGCCCAA AGCCCTCAGC CGCGCCGAAG AACCGACGCT GGCGCTGTTC CAGGTGCCCC ATGAAAAAGG CACTGCACGG GCGCCCAAGC AATTCGTCGA TTTCCAGAAC GACGTGACCG CCGCCGCCAT TGAACTGGCG ACCCGTGAGG GTTTCGAGTC GGTCGAGCAC GTCAAACGCT ACACCGCGCT GGGCTTCGGC ACCGATCAGG GCAAGCTGGG CAACGTCAAC GGGCTGGCCA TCGCAGCCCG CTCGTTGAAC GTGTCGATCC CGGAAATGGG CACCACCATG TTCCGCCCCA ACTACACACC GGTCACCTTC GGCGCGATTG CCGGGCGTCA TTGCAAGCAC ATCTTCGAGC CGGTGCGCTT CACCGCGCTG CATGCCTGGC ACATCAGAAA CGGTGCCGAA TTCGAAGACG TCGGCCAGTG GAAACGCCCG TGGTACTTCC CTAAAAACGG TGAAGACCTG CCCGCTGCCG TGGCGCGTGA ATGCAAGGCG GTGCGTGACA GCGTCGGCCT GCTGGACGCC TCGACGCTGG GCAAGATCGA CATTCAAGGG CCGGATGCGC GCGAATTCCT CAACCGCATC TATACCAACG CCTGGACCAA ACTGGACGTG GGCAAGGCCC GTTACGGCCT GATGTGCAAG GAAGACGGCA TGGTCTTCGA TGACGGTGTG ACTGCCTGCC TCGCCGACAA CCACTTCCTG ATGACCACCA CGACGGGCGG TGCTGCACGC GTCCTGCAAT GGCTGGAAAT CTATCAGCAA ACCGAATGGC CGGACCTGAA AGTGTATTTC ACCTCAGTGA CCGATCACTG GGCGACCCTG ACGCTGTCTG GCCCCAACAG CCGCAAGCTG CTCAGCGAAG TGACCGATAT CGACCTGGGC CGTGAAGCGT TCCCGTTCAT GACCTGGAAA GAAGGCTTGG TCGCAGGCGT GCCGGCGCGG GTGTTCCGTA TCTCGTTTAC CGGCGAGCTG TCCTATGAGG TCAACATTCA GGCCGACTAC GCGATGGGTG TTCTGGAAAA GATCGCCGAG GCCGGCAAGC AGTACAACCT GACGCCTTAT GGCACTGAAA CCATGCACGT ACTGCGCGCC GAAAAGGGTT TCATCATCGT CGGTCAAGAC ACCGACGGCT CGATGACCCC GGACGATCTG AACATGGGCT GGTGTGTCGG GCGGACCAAA CCGTTCTCGT GGATTGGCTG GCGCGGCATG AACCGCGAAG ACTGTGTGCG TGAACAACGC AAGCAATTGG TCGGCCTCAA ACCCATAGAC TCGACCCAAT GGCTACCGGA AGGCGCTCAG TTAGTGTTCG ACACCAGGCA GGCGATCCCG ATGAGCATGG TCGGCCACGT GACCTCCAGC TACGCGCACA ACTCGCTGGG CTATTCGTTC GCGATGGGCG TGGTCAAAGG CGGCCTGAAC CGGATCGGCG AGCACGTGTT CGCGCCGCTG GCCGATGGCA GCGTGATCGA AGCCGAGATC GTCTCGTCGG TGTTCTTCGA CCCGAAGGGC GAGCGTCAGA ATGTTGAGTA A
|
Protein sequence | MSQSNRLPNG GRINRSKVLN FTFNGQTYQG FEGDSLASAL LANGVEIIGR SFKYSRPRGI FAAGSEEPNA VLQIGATEAT QIPNVRATQQ ALYSGLVATS TNGWPSVNTD VMGILGKVGG KLMPPGFYYK TFMYPQSFWM TYEKYIRKAA GLGRSPTEND PDSYDAMNQH CDVLIVGAGP AGLAAALAAS RSGARVIIAD EQEEFGGSLL DSRESLDGKP AAEWVATVVA ELKSLRNVTL LPRATVNGYH DHNFLTIHER LTDHLGDRAP IGMVRQRMHR VRAKRVVLAA GTHERPLVYG NNDVPGNMLA GAISTYVRRY GVAPGKKLVL STNNDHAYRV ALDWLDASLH VVAIADARHN PRGPLVEEAR AKGIRILTGS AVIEARGSKR VSAARVAAID LKTHSVTSPG EWLECDLVAS SGGYSPIVHL ASHLGGKPVW REDILGFVPG EAPQKRICVG GVNGVYSLAD SLADGFEGGV RAASEAGFKI VEGVMPKALS RAEEPTLALF QVPHEKGTAR APKQFVDFQN DVTAAAIELA TREGFESVEH VKRYTALGFG TDQGKLGNVN GLAIAARSLN VSIPEMGTTM FRPNYTPVTF GAIAGRHCKH IFEPVRFTAL HAWHIRNGAE FEDVGQWKRP WYFPKNGEDL PAAVARECKA VRDSVGLLDA STLGKIDIQG PDAREFLNRI YTNAWTKLDV GKARYGLMCK EDGMVFDDGV TACLADNHFL MTTTTGGAAR VLQWLEIYQQ TEWPDLKVYF TSVTDHWATL TLSGPNSRKL LSEVTDIDLG REAFPFMTWK EGLVAGVPAR VFRISFTGEL SYEVNIQADY AMGVLEKIAE AGKQYNLTPY GTETMHVLRA EKGFIIVGQD TDGSMTPDDL NMGWCVGRTK PFSWIGWRGM NREDCVREQR KQLVGLKPID STQWLPEGAQ LVFDTRQAIP MSMVGHVTSS YAHNSLGYSF AMGVVKGGLN RIGEHVFAPL ADGSVIEAEI VSSVFFDPKG ERQNVE
|
| |