Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PP_0325 |
Symbol | soxA |
ID | 1044023 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pseudomonas putida KT2440 |
Kingdom | Bacteria |
Replicon accession | NC_002947 |
Strand | + |
Start bp | 389725 |
End bp | 392739 |
Gene Length | 3015 bp |
Protein Length | 1004 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637143704 |
Product | sarcosine oxidase, alpha subunit family |
Protein accession | NP_742492 |
Protein GI | 26987067 |
COG category | [E] Amino acid transport and metabolism [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0404] Glycine cleavage system T protein (aminomethyltransferase) [COG0492] Thioredoxin reductase |
TIGRFAM ID | [TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.684006 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0259905 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCAGA CCTATCGCCT CGCCAGCGGC GGCCGTATCG ACCGCAGCAA GGTCCTGAAC TTCACCTTCA ACGGCAAGAC CTACCAGGGT TATGCCGGTG ACAGCCTGGC CGCCGCGTTG CTGGCCAACG GCGTCGACAT TGTCGGCCGT AGCTTCAAGT ACTCGCGCCC ACGCGGCATC ATCGCCGCCG GTACCGAAGA GCCGAACGCC ATCCTGCAGA TCGGCTCCAG CGAAGCTACC CAGATCCCCA ACGTGCGCGC CACCCAACAG GCACTGTACG CGGGCCTTGT CGCCACCAGC ACCAACGGTT GGCCGAACGT CAACAACGAC GTCATGGGCA TCCTCGGCAA GGTTGGCGGC AGCATGATGC CGCCGGGCTT CTACTACAAA ACCTTCATGT ACCCCAAATC GTTCTGGATG ACTTACGAGA AGTACATCCG TAAAGCCGCC GGCCTGGGCC GTGCGCCGCT GCAGAACGAT CCTGACAGCT ACGACTACAT GAACCGGCAC TGCGACGTGC TGATCGTCGG CGCCGGCCCT GCTGGCCTGG CTGCCGCACT GGCCGCTGCG CGCAGTGGTG CCCGCGTGAT CCTGGCTGAC GAGCAGGAAG AGTTCGGCGG CAGCCTGCTC GACACCCGCG AAACCCTCGA CGGCAAGCCT GCTGCCGACT GGGTCAACGC CGTGGTCAAA GAGCTGGAAG GCCTGCCGGA AGTGACCCTG CTGCCACGTG CCACGGTCAA CGGCTACCAC GACCATAACT TCCTGACCAT TCACGAGCGC CTCACCGACC ACCTCGGCGA TCGCGCCCCG ATCGGTCAGG TTCGCCACCG CGTGCACCGC GTTCGCGCCA AGCGCGTGGT ACTGGCTGCC GGCGCCCACG AGCGCCCGCT GGTGTACGGC AACAACGACG TGCCGGGCAA CATGCTGGCC GGTGCTGTAT CCACCTATGT TCGCCGCTAT GGCGTGGCGC CGGGTCGCAA GTTGGTACTG TCGACCAACA ACGACCACGC TTATCGCGCC GCGCTGGACT GGCACGACGC AGGCCTGCAA GTGGTCGCCA TCGCCGACGC CCGCCACAAC CCACGTGGCT CGCTGGTTGA AGAAGCGCGT GCCAAAGGCA TTCGCATACT CACCTCCAGC GCCGTGATCG AGGCCAAAGG CAGCAAGCAC GTCACCGGCG CCCGTGTGGC GGCGATCGAT GTGCAGGCGC ACAAAGTCAC CAGCCCAGGC GAAGTCCTTG AGTGCGACCT GATCGCCTCC TCGGGCGGTT ACAGCCCGAT CGTGCACCTG GCTTCGCACC TGGGCGGTCG CCCGGTATGG CGTGACGACA TCCTTGGCTT CGTGCCGGGC GATGCGCCGC AGAAGCGTGA GTGCGTCGGT GGTATCAACG GCGTGTATGC CTTGGGCGAT GTCATTGCCG ATGGCTTCGA AGGCGGCGTC CGCGCAGCCA CCGAGGCCGG TTTCAAGGCC ACTGTCGGCA CCCTGCCAAA AACAGTGGCG CGCAAGGAAG AGGCCACTGT GGCACTGTTC CTGGTGCCGC ACGACAAAGG CACCAAGGGG CCGAAGCAGT TCGTCGACCA GCAGAACGAC GTGACCGCAG CCGGTATCGA GCTGGCCACC CGTGAAGGCT TCGAGTCGGT CGAGCACGTC AAGCGCTACA CCGCGCTGGG CTTCGGTACC GACCAGGGCA AACTGGGCAA CATCAACGGC CTGGCCATCG CCGCCCGTTC GATCGGCATC ACCATCCCGG AAATGGGTAC CACCATGTTC CGCCCCAACT ACACGCCGGT TACTTTCGGC GCGGTAGCGG GCCGTCACTG TGGTCACCTG TTCGAGCCCG TGCGCTTCAC TGCCCTGCAT GCCTGGCACG TGAAGAACGG CGCCGAGTTC GAAGACGTCG GCCAGTGGAA GCGCCCGTGG TACTTCCCGA AAGCCGGTGA AGACATCCAT GCTGCCGTGA CTCGCGAATG CAAGGCCGTG CGCGACAGCG TGGGCCTGCT GGACGCCTCG ACCCTGGGCA AGATCGACAT CCAGGGCCCG GACGCGCGCG AGTTCCTCAA CCGCATCTAC ACCAACGCCT GGACCAAGCT CGACGTGGGC AAGGCCCGCT ACGGCCTGAT GTGCAAGGAA GACGGCATGG TCTTCGACGA CGGCGTAACC GCCTGCGTCG GCGACAACCA CTTCATCATG ACCACCACCA CCGGCGGCGC TGCCCGCGTA TTGCAGTGGA TGGAGCTGTA TCACCAGACC GAATGGCCAG AGCTGAAGGT GTACTTCACT TCGGTCACCG ACCACTGGGC CACCATGACC CTGTCCGGCC CTAACAGCCG CAAGCTGCTG AGCGAGCTGA CCGACATCGA CATGGACAAG GAAGCCTTCC CGTTCATGAC CTGGAAGGAA GGCAACGTCG GTGGCGTGCC GGCCCGCGTG TTCCGTATCT CGTTCACCGG CGAGCTGTCG TACGAAGTGA ACGTGCAGGC CAACTACGCC ATGGGCGTGC TGGAACAGAT CATCGAGGCC GGCAAGAAGT ACAACCTGAC CCCGTACGGC ACCGAGACCA TGCACGTACT GCGTGCAGAG AAGGGCTTCA TCATCGTGGG CCAGGACACC GATGGTTCGA TGAACCCGGA CGACCTGAAC ATGAGCTGGT GCGTTGGCCG CAACAAGCCG TTCTCGTGGA TCGGCCTGCG TGGCATGAAC CGCGAAGACT GCGTGCGTGA GAACCGCAAG CAGCTGGTGG GCCTGAAACC GGTCGACCCG ACCAAGTGGC TGCCGGAAGG CGCCCAACTG GTGTTCGACC CGAAACAGCC GATCCCGATG GACATGGTCG GCCACGTCAC CTCCAGCTAC GCGTCCAACT CCCTGGGCTA CTCGTTCGCC ATGGGTGTGG TCAAAGGCGG CCTCAAGCGC ATGGGCGAGC GTGTCTACTC GCCGCAGGCG GATGGCAGCG TGATCGAGGC GGAAATCGTG TCTTCGGTGT TCTTCGATCC GAAGGGTGAG CGGCAGAACG TCTAA
|
Protein sequence | MSQTYRLASG GRIDRSKVLN FTFNGKTYQG YAGDSLAAAL LANGVDIVGR SFKYSRPRGI IAAGTEEPNA ILQIGSSEAT QIPNVRATQQ ALYAGLVATS TNGWPNVNND VMGILGKVGG SMMPPGFYYK TFMYPKSFWM TYEKYIRKAA GLGRAPLQND PDSYDYMNRH CDVLIVGAGP AGLAAALAAA RSGARVILAD EQEEFGGSLL DTRETLDGKP AADWVNAVVK ELEGLPEVTL LPRATVNGYH DHNFLTIHER LTDHLGDRAP IGQVRHRVHR VRAKRVVLAA GAHERPLVYG NNDVPGNMLA GAVSTYVRRY GVAPGRKLVL STNNDHAYRA ALDWHDAGLQ VVAIADARHN PRGSLVEEAR AKGIRILTSS AVIEAKGSKH VTGARVAAID VQAHKVTSPG EVLECDLIAS SGGYSPIVHL ASHLGGRPVW RDDILGFVPG DAPQKRECVG GINGVYALGD VIADGFEGGV RAATEAGFKA TVGTLPKTVA RKEEATVALF LVPHDKGTKG PKQFVDQQND VTAAGIELAT REGFESVEHV KRYTALGFGT DQGKLGNING LAIAARSIGI TIPEMGTTMF RPNYTPVTFG AVAGRHCGHL FEPVRFTALH AWHVKNGAEF EDVGQWKRPW YFPKAGEDIH AAVTRECKAV RDSVGLLDAS TLGKIDIQGP DAREFLNRIY TNAWTKLDVG KARYGLMCKE DGMVFDDGVT ACVGDNHFIM TTTTGGAARV LQWMELYHQT EWPELKVYFT SVTDHWATMT LSGPNSRKLL SELTDIDMDK EAFPFMTWKE GNVGGVPARV FRISFTGELS YEVNVQANYA MGVLEQIIEA GKKYNLTPYG TETMHVLRAE KGFIIVGQDT DGSMNPDDLN MSWCVGRNKP FSWIGLRGMN REDCVRENRK QLVGLKPVDP TKWLPEGAQL VFDPKQPIPM DMVGHVTSSY ASNSLGYSFA MGVVKGGLKR MGERVYSPQA DGSVIEAEIV SSVFFDPKGE RQNV
|
| |