Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PSPTO_2450 |
Symbol | soxA-2 |
ID | 1184102 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pseudomonas syringae pv. tomato str. DC3000 |
Kingdom | Bacteria |
Replicon accession | NC_004578 |
Strand | - |
Start bp | 2705595 |
End bp | 2708501 |
Gene Length | 2907 bp |
Protein Length | 968 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637393825 |
Product | sarcosine oxidase, alpha subunit |
Protein accession | NP_792264 |
Protein GI | 28869645 |
COG category | [E] Amino acid transport and metabolism [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0404] Glycine cleavage system T protein (aminomethyltransferase) [COG0492] Thioredoxin reductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0353624 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACACCC CCGCCCGCCT GCCCGCGCCC ATGGGCCTGC TGATCGATCG CAACCAGCCG CTCACGTTCA GTTTCGACGG TAGAACCTAT CAGGGCCTAC AAGGCGACAG CATTGCCAGC GCCCTGTTGG CCAACGGTCG TTTTCTGCTT TCGCGCTCGT TCAAATACCG CCGCCCGCGT GGCCCGCTGA CCATGGCCGG ACAGGACGCC AACAGCCTGA TCCAGTTGCC TCACGAGCCC AACGTGCTGG CCGATACCTT CGCGCTGCAC GAAGGCCTGC AGGCCAACGG GCAGAACTTC AACGGCTCGC TGGACAACGA CAAGGATGCG TACCTGGGCA AGTTCTCAAA ATTCATGCCG GTGGGCTTCT ATTACCGATC CTTCTACAAG CCCAAAGGCA TGTGGAAAAT CTGGGAGCCG ATCATTCGCA AAAAGGCCGG TCTGGGCGTG CTGGACCTGA ATTTCCAGCC TGAGTATTAC GACAAGGCCT ATCTGTTCAC TGACCTGGCG GTGATCGGCG CAGGGCCTGC CGGGTTGCAG GCAGCGCTGA CTGCCGCCAA CGCCGGGGCC CAGGTGTTGC TGATCGAGCA GCAGCCCATT CTCGGCGGCT CGCTGACCTA CGCGCGCTTC GATATCGACG GCCAGCGTGC CGAACAACTG CGCCTCGAAC TGGTCGCTGC GGTAGAAGCA CACGACAATA TTCAGGTGCT CAAACAGGCC ACGTGCAATG CCTGGTTTAC CGACAATTAC CTGCCGGTGA TTCAAGGCAA GCGCCTGTAC AAAGTGCGCG CCACGCAATG CCTGGTGTGC AGCGGTTCTC TGGATCAGCC GGTGATTTTT CGTAACAACG ATTTGCCCGG CGTGATGCTG ACCAGCGCCG TGCAGCGCCT GATGAAGCTC TACGCAGTCA AGCCGGGCAA ACGCGCGGTG GTGCTGACCG GCAATGATGA CGGCTACCTC GCCGCCCTGG ATTTGCACGA TCAGGGCGTG GAGGTGGTCG CCGTTGCCGA CATGCGCACC AACCCAGCGG ATCGAGAATT ACAGCTCGCC CTGGAGAAGC GCGGCATCGC GTGCCACATG AGCACCACGG TCTACGAGGC GCTGCACGAA AAAGGCATGC GCCATGTCAG CGGTGCCGAA CTGCGCAAGA TCACCGGTCA AGGCCAAGTG GCCAATCATG GCTTGAGTGT CGAGTGCGAC CTGCTGTGCA TGTCGGGCGG CTACATGCCG GTCTATCAGT TGCTTTGTCA GGCGGGCGGC AAGCTTGCCT ACGACGATCA ACAGGCCGAA TTCACCCTCA GCGGCCTGCC GCAGAACCTG GGCATTGCAG GCTCGGTCAA TGGCTATCAC GTGCTGGATA ACGTGCTGGC AGACGCTACC CATGCCGCCG GCGGAATGCT CTCGGCGCCA GGCCTGGAAC CCAACCAAAA TCCGCCTGCG TTGCGCCCTG AAGCAAAGGT CAATTTCAGC TGGCCGATCT TCCCGCACCC CAAGGGCAAG GATTTCGTCG ATTTCGATGA AGACCTGCAA GTGCGCGACA TCGTCAACGC CACGCGCATC GGCTACCGCG ACATCCAACT GGTCAAACGC TACTCCACGG TCGGCATGGG CCCGTCGCAA GGCCGCCACT CTGCGCTGCC GACTGCACGC CTGGTGGCGG CGTCAACCCA GCGCAGCATC AGCGAAACCG GCGTGACTAC GGCGCGTCCG CCCTTCGAGG CCGAGAAACT GGCGCACGTG GCGGGTCGCG CCTTCGACCC GTACCGCCAG ACGCCCATGC ATCAACGGCA CGTGCAGGCA GGCGCGAAAA TGATGCCCGC CGGGATCTGG CAGCGCCCGG CGTTCTACGG CAAGCCGTCG CGACGTGATG CCTGCATGCA GCAGGAAGCC CTGCACGTGC GTAACAAGGT CGGCATCATC GACGTCTCGA CCCTCGGCGG CCTGGACATA CGCGGCCCGG ACGCCGCCGA GCTACTCAAC CGTATGTATA CCTTCGCCTT CCTCAAGCAG CCAGTCGGGC GTTCGCGCTA TGCATTGATG ACCAACGAAC AGGGCGTGGT GATCGATGAC GGCGTGTGTG CACGGTTTGC CGAACAGCAT TTCTACGTCA CCGCCACCAC CAGCGGCGTG GATCGTATCT ATCAGCAGAT GCTCAAGTGG AACGCGCAAT GGCGTCTGAA CGTGGACATC ACCAACGTCA CGGCGGCCAT TGCGGCGGTC AACGTGGCAG GCCCGGATTC ACGCAAGGTG CTGGCGCAGG TGTGCAGCGA TCTTGACCTG TCTACCGAGG GTTTTCCCTA TCTTGGCGTG CGGCAAGGCA CTGTTGCAGG CATCAAGGCA CGCTTGCTGC GAGTCGGCTT CGTCGGCGAA CTGGGCTACG AGATCCACGT TGCGGCGCGG CATGCCCTCA AGCTGTGGGA TGCGCTGAGC GAAGCGGGCA AGGCGTTCGA CATGCGCCCT TTCGGCGTCG AGACCCAGCG TCTGTTACGC CTTGAAAAAG GCCATGTGAT CATCAGTCAG GACACCGATG GCATGACGCA TCCCGGCGAG ATCGACATGG GCTGGGCGGT CAGTCGAACC AAGCCGTTCT TCGTCGGCCG CCGCGCGGTG GACATCCTCG AAGCCTTGCC GCAAAAGCGC AAACTGGTCG GTTTCACCCT GCCCAAAGCC AGCCCGCTGC CGCTTGAGGG CCATCTGGTG CTCAAAGGCG CGGACATCAG CGGCAATGTC ACCTCCTGCG AGTACTCGCA AACCCTCGAC ATGATTATCG GCATGGCCTA CGCCGCGTTC GATCAGAGCA CGCCCGGCCA GCAGATTCCG ATCCGCGTCG AAGACGGCGT GGTGGTTCAG GCCACCGTCG TGAAACTGCC CTTTTTCGAC CCTGAAAACC AGCGCCAGGA GCTCTGA
|
Protein sequence | MNTPARLPAP MGLLIDRNQP LTFSFDGRTY QGLQGDSIAS ALLANGRFLL SRSFKYRRPR GPLTMAGQDA NSLIQLPHEP NVLADTFALH EGLQANGQNF NGSLDNDKDA YLGKFSKFMP VGFYYRSFYK PKGMWKIWEP IIRKKAGLGV LDLNFQPEYY DKAYLFTDLA VIGAGPAGLQ AALTAANAGA QVLLIEQQPI LGGSLTYARF DIDGQRAEQL RLELVAAVEA HDNIQVLKQA TCNAWFTDNY LPVIQGKRLY KVRATQCLVC SGSLDQPVIF RNNDLPGVML TSAVQRLMKL YAVKPGKRAV VLTGNDDGYL AALDLHDQGV EVVAVADMRT NPADRELQLA LEKRGIACHM STTVYEALHE KGMRHVSGAE LRKITGQGQV ANHGLSVECD LLCMSGGYMP VYQLLCQAGG KLAYDDQQAE FTLSGLPQNL GIAGSVNGYH VLDNVLADAT HAAGGMLSAP GLEPNQNPPA LRPEAKVNFS WPIFPHPKGK DFVDFDEDLQ VRDIVNATRI GYRDIQLVKR YSTVGMGPSQ GRHSALPTAR LVAASTQRSI SETGVTTARP PFEAEKLAHV AGRAFDPYRQ TPMHQRHVQA GAKMMPAGIW QRPAFYGKPS RRDACMQQEA LHVRNKVGII DVSTLGGLDI RGPDAAELLN RMYTFAFLKQ PVGRSRYALM TNEQGVVIDD GVCARFAEQH FYVTATTSGV DRIYQQMLKW NAQWRLNVDI TNVTAAIAAV NVAGPDSRKV LAQVCSDLDL STEGFPYLGV RQGTVAGIKA RLLRVGFVGE LGYEIHVAAR HALKLWDALS EAGKAFDMRP FGVETQRLLR LEKGHVIISQ DTDGMTHPGE IDMGWAVSRT KPFFVGRRAV DILEALPQKR KLVGFTLPKA SPLPLEGHLV LKGADISGNV TSCEYSQTLD MIIGMAYAAF DQSTPGQQIP IRVEDGVVVQ ATVVKLPFFD PENQRQEL
|
| |