Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PA14_71500 |
Symbol | soxA |
ID | 4380602 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pseudomonas aeruginosa UCBPP-PA14 |
Kingdom | Bacteria |
Replicon accession | NC_008463 |
Strand | + |
Start bp | 6369948 |
End bp | 6372965 |
Gene Length | 3018 bp |
Protein Length | 1005 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639328364 |
Product | sarcosine oxidase alpha subunit |
Protein accession | YP_793893 |
Protein GI | 116053566 |
COG category | [E] Amino acid transport and metabolism [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0404] Glycine cleavage system T protein (aminomethyltransferase) [COG0492] Thioredoxin reductase |
TIGRFAM ID | [TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 59 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCAGA TCAATCGCCT GTCCAGCGGC GGCCGCATCG ACCGCAACCG CCCGCTGACC TTCAGCTTCA ACGGCCAGCA CTACCAGGGC TATGCCGGCG ACACCCTGGC CGCCGCGCTG CTGGCCAACG GCGTCGACAT CGTCGGCCGC AGCTTCAAGT ACTCGCGGGC GCGCGGCATC GTCGCCGCCG GCGCCGAGGA GCCCAACGCG ATCCTGCAGA TCGGCTCCCG CGAAGCCACC CAGATCCCCA ACGTGCGCGC CACCCAGCAG GCGCTCTACG GCGGCCTGGT GGCCACCAGC ACCAACGGCT GGCCGAATGT GCAGAACGAC CTGATGGGAA TCTTCGGCAA GGTCGGCGGC AAGCTGATGC CGCCCGGTTT CTACTACAAG ACCTTCATGT ACCCGCAGTC GATGTGGATG ACCTACGAGA AGTACATCCG CAAGGCCGCC GGCCTGGGCC GCGCGCCGAC CGAGGTCGAT CCGGACAGCT ACGACTGGAT GAACCACCAC TGCGACGTGC TGGTGGTCGG CGCCGGCCCG GCCGGCCTGG CCGCCGCCCT CGCCGCCGCT CGCAGCGGCG CGCGGGTGAT TCTCGCCGAC GAGCAGGAAG AGTTCGGCGG CAGCCTGCTG GACACCCGCG AGACCCTCGA CGGCAAGCCC GCCGCCGAGT GGGTCGCGGA CGCCGTGGCC GAGCTGCAAG GCCTGCCGGA AGTCATCCTG CTACCGCGCT CCACGGTCAA CGGCTACCAC GACCACAACT TCCTCACCAT CCACGAACGG CGCACCGACC ACCTCGGCGA GGTCGCCCCG CTCGGCCAGG TCCGCCAGCG CGTGCACCGC GTACGCGCCA AGCGCGTGGT ACTGGCCGCC GGCGCCCATG AGCGGCCGCT GGTCTACGGC AACAACGACC TGCCCGGCAA CATGCTCGCC GGCGCGGTTT CCACCTATGT ACGGCGCTAC GGCGTGGCAC CGGGCAAGAA ACTGGTGCTG GCCACCAACA ACGACTACGC CTACCGCGTC GCCCTCGACT GGCAGGAAGC CGGCCTGCAA GTGGTGGCCA TCGCCGACGC CAGGGCCAAT CCGCGCGGCG AGTGGGTCGA GGAAGCACGC CAACGCGGTA TGCGGGTGAT CACCGGCAGT TCGGTGATCG AGGCTCGCGG CGGCAAGCGG GTCAGCGGCG CCAAGGTCGC CCGGATCGAC TTGCAGGCCA TGCGCGCCAG CGGCGGCGAA TGGCTGGACT GCGACCTGAT CGCCAGCTCC GGCGGCTACA GCCCGGTCGT GCACCTGGCC TCGCACCTGG GCGGCAAGCC GGAATGGCGC GAGGAAATCC TCGCCTTCGT CCCCGGCGAA GGCCTGCAGA AACGCATCTG CGCCGGCGCC GTGAACGGCG TGTTCGGGCT CGCCGAGGTG CTCGCCGACG GCTACCAGGC CGGGAGCCGC GCCGCCCTCG ACGCCGGCTA CAAGACCGCC GCCGGCAGCC TGCCGAAGGT CCAGCCGCGC CGCGAGGAAG CGTCCGTCGC GCTGTTCCAG GTGCCCCACG AGAAGCCCAC GGCACGCGCA CCGAAGCAGT TCGTCGATCC GCAGAACGAC GTCACCGCCG CCGCCATCGA ACTGGCCTGC CGCGAGGGCT TCGAGTCCAT CGAGCACGTC AAGCGCTACA CCGCGCTGGG CTTCGGCACC GACCAGGGCA AGCTGGGCAA CATCAACGGC CTGGCGATCG CCGCGCGGGC CCAGGGCAAG AGCATCGCCG ACACCGGCAC CACCATGTTC CGTCCGAACT ACACCCCGGT GACCTTCGGC GCCGTCGCCG GCCGCCACTG CGGGCACCTG TTCGAACCGG TGCGCTTCAC CGCCCTGCAC GCCTGGCACG TGAAGAACGG CGCCGAGTTC GAAGACGTCG GCCAGTGGAA GCGGCCGTGG TACTTCCCGC GCCGCGGCGA GGACATGCAC GCCGCCGTGG CCCGCGAATG CCGCGCGGTG CGCGAGGCGG TCGGCCTGCT CGACGCCTCG ACCCTGGGCA AGATCGACAT CCAGGGCCCG GACGCGCGGG AGTTCCTCAA CCGGGTCTAC ACCAACGCCT GGACCAAGCT CGACGTCGGC AAGGCGCGCT ACGGCCTGAT GTGCAAGGAA GACGGGATGG TCTTCGACGA CGGCGTGACC GCCTGCCTGG CCGACAACCA CTTCGTCATG ACCACCACCA CCGGCGGCGC CGCCCGCGTA CTGGAGTGGC TGGAGCTGTA CCACCAGACC GAATGGCCGG AGCTGAAGGT GTACTTCACC TCGGTCACCG ACCACTACGC GACCCTCACC CTGTCCGGCC CGAACAGCCG CAAGCTGCTC GCCGAAGTCA CCGACATCGA CCTGGACAAG GACGCCTTCC CCTTCATGAC CTGGAAGGAA GGCAAGGTCG CCGGGGTGCC GGCGCGGGTG TTCCGCATCT CCTTCACCGG CGAGCTGAGC TACGAGGTGA ACGTCCAGGC CGACTACGCC ATGGGCGTGC TCGAAGCGCT CGCCGAGCAC GGTGCGAAGT ACGGCCTGAC ACCCTACGGC ACCGAGACCA TGCACGTCCT GCGCGCCGAG AAGGGCTTCA TCATCGTCGG CCAGGACACC GATGCCTCGG TCACCCCGGA CGATCTCAAC ATGGGCTGGG CGGTGGGCCG CAGCAAGCCG TTCTCCTGGA TCGGCTGGCG CGGCATGAAC CGCGCCGACT GCCTGCGCGA GGATCGCAAG CAACTGGTCG GACTCAGGCC GAGCAACCCG CAGGAGGTAC TGCCCGAAGG CGCGCAACTG GTGTTCGACA CCCAGCAGGC GATCCCGATG AAGATGGTCG GCCACGTCAC CTCCAGCTAC ATGAGCGCCA GCCTCGGCCA CGGCTTCGCC CTGGCGGTGG TGAAAGGCGG CCTCAAGCGC ATGGGCCAGA AGGTCTACGC GCCGCTGGCG GACGGCCGCT TCATCGAGGC GGAGATCTGC TCCTCGGTGT TCTACGACCC CAAAGGGGAG CGGCAGAACG TGGATTGA
|
Protein sequence | MSQINRLSSG GRIDRNRPLT FSFNGQHYQG YAGDTLAAAL LANGVDIVGR SFKYSRARGI VAAGAEEPNA ILQIGSREAT QIPNVRATQQ ALYGGLVATS TNGWPNVQND LMGIFGKVGG KLMPPGFYYK TFMYPQSMWM TYEKYIRKAA GLGRAPTEVD PDSYDWMNHH CDVLVVGAGP AGLAAALAAA RSGARVILAD EQEEFGGSLL DTRETLDGKP AAEWVADAVA ELQGLPEVIL LPRSTVNGYH DHNFLTIHER RTDHLGEVAP LGQVRQRVHR VRAKRVVLAA GAHERPLVYG NNDLPGNMLA GAVSTYVRRY GVAPGKKLVL ATNNDYAYRV ALDWQEAGLQ VVAIADARAN PRGEWVEEAR QRGMRVITGS SVIEARGGKR VSGAKVARID LQAMRASGGE WLDCDLIASS GGYSPVVHLA SHLGGKPEWR EEILAFVPGE GLQKRICAGA VNGVFGLAEV LADGYQAGSR AALDAGYKTA AGSLPKVQPR REEASVALFQ VPHEKPTARA PKQFVDPQND VTAAAIELAC REGFESIEHV KRYTALGFGT DQGKLGNING LAIAARAQGK SIADTGTTMF RPNYTPVTFG AVAGRHCGHL FEPVRFTALH AWHVKNGAEF EDVGQWKRPW YFPRRGEDMH AAVARECRAV REAVGLLDAS TLGKIDIQGP DAREFLNRVY TNAWTKLDVG KARYGLMCKE DGMVFDDGVT ACLADNHFVM TTTTGGAARV LEWLELYHQT EWPELKVYFT SVTDHYATLT LSGPNSRKLL AEVTDIDLDK DAFPFMTWKE GKVAGVPARV FRISFTGELS YEVNVQADYA MGVLEALAEH GAKYGLTPYG TETMHVLRAE KGFIIVGQDT DASVTPDDLN MGWAVGRSKP FSWIGWRGMN RADCLREDRK QLVGLRPSNP QEVLPEGAQL VFDTQQAIPM KMVGHVTSSY MSASLGHGFA LAVVKGGLKR MGQKVYAPLA DGRFIEAEIC SSVFYDPKGE RQNVD
|
| |