Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPS_2480 |
Symbol | soxA1 |
ID | 3522790 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Colwellia psychrerythraea 34H |
Kingdom | Bacteria |
Replicon accession | NC_003910 |
Strand | + |
Start bp | 2596916 |
End bp | 2599939 |
Gene Length | 3024 bp |
Protein Length | 1007 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 637284935 |
Product | sarcosine oxidase, alpha subunit |
Protein accession | YP_269196 |
Protein GI | 71282317 |
COG category | [E] Amino acid transport and metabolism [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0404] Glycine cleavage system T protein (aminomethyltransferase) [COG0492] Thioredoxin reductase |
TIGRFAM ID | [TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCAAG TTAATCGAAT CGCTGGAAGC AGCAAACGCA TTAATCGCAA CCGCACCTTA ACCTTTAGCT TTAACGGCAA AGAATATACA GGTTTTGAAG GCGATACCGT CGCATCAGCC TTGTTAGCTA ATGGTGTTGA TGTCGTTGGG CGTAGTTTTA AGTACTCACG TCCTCGCGGT ATTATTACCA GTGACTCGCA AGAGCCGAAC GCCATTTTTC AAATTGGCTC GACGCAAGCG ACCACTATTC CTAACCCACG CGCGACACAA ACCGACTTGT ACCAAGGATT AACCGCAAGC TCAACCAACG GTTGGCCTAA TGTTGATTTC GATTTAATGG GCACCGTGGG CAAATTAGGT GGCTCGATGA TGCCGCCCGG GTTTTATTAC AAAACCTTTA TGTTTCCACA ATCGTTATGG ATGTCATACG AGCACTTAAT TCGCAAAGGC GCTGGTTTAG GGGCAAGTCC TCAGCAAAAT GACCCGGACA GTTATGACAA AATGCACCAT CATTGTGATG TGATGATTGT CGGTGGTGGT CCTGCGGGCT TAGCCGCAGC GTTATCTGCT GCGCAAACAG GCGCACGCGT TATCATCAGT GATGAGCAAA ATGAATTTGG CGGCAGTTTA TTATGCTCAA CGCAGCAAAT AGATGGCCAA TTGCCGAGTC AATGGGTAGA AAAAACCGTG GCACAGCTTA GCGAGATGGA TAACGTGATG TTACTTCCTC GCAGCACGGT GTTTGGTTAT TACGACCATA ACCTAGTGGG CATTAATGAA CGTCGCACCG ACCATTTAGG TGAGCATCAA CTGCAAAGCA CCCGTCAACG CGTGCATAAA GTGCGCGCTA AACAAGTGAT TTTAGCCACC GGTGCTCATG AGCGTCCGCT TGTTTATGGT AACAATGACG TGCCAGGTTG TATGTTAGCC AATGCAATTT CTACCTACAT TAATCGCTAT GATGTAGTAC CAGGCAAGCA ATTGGTGTTA ATGACCACCA ATGATAATGC CTACAAAACC GCGATTGATT GGCATCAAGC CGGTCGTAAA GTGGTCGCTA TCGTTGATAC GCGAAGCACC TCAAATGGCG ACTTGGTCAA TAAGGTCAAA AAACTGGGCA TCGATATCAT CTTTGGCCAT GGCGTGATTG AAGTCAAAGG CAGCAAACGC GTCAAAGGCG TTGAGGTTGC GCCAATCAAT GCGAGTAATC ACAGTGTTAC TGGTCCAGCG AAACATATTG TCTGTGATAC GGTTGCCAGC TCAGGTGGTT GGAGCCCTGT TATTCATTTG TCATCACACA CAGGCTCGCG TCCGGTGTGG AACGACGACA TTGCGGGGTT TGTACCCGGT GATACCGTGC AAAAGCAACA CAGTTGCGGT GGACTGGAAG GCGTTTACGC GTTATCAAAA GTCATCAGTG ATGGTTTCAC CACTGGCGCT GTCGCAGCAG AGGCCGCAGG CAAAGGTGAT GGACGTTATG CGGGGAACTC GCCAACAACC AGCGACCCAC AAGAAGATGC GTCCATGGCG CTGTTTCACA TACCGCACAG TAAAAAAACC AGTCGCGCGC CAAAACAGTT TGTTGATTAT CAAAATGATG TCACCGCCGC AGGTATTGAA CTGGCAAACC GTGAAGGCTT TGAATCGATT GAGCATGTCA AACGCTACAC CGCGTTAGGT TTTGGTACGG ACCAAGGCAA GTTAGGTAAT ATCAACGGCA TGGCAATTAC CGCTAAATCG TTAGGTAAAA CTATCCCTGA AACGGGCACC ACTATCTTCC GCCCTATGTA TACCCCCACC ACGTTTGGCG CCTTAGCGGG TGCGGATGTG AAGCACTTGT TCGACCCAGC ACGTTTTAGC GCTATGCATA AATGGCATTT AGAAAATGGC GCTGAGTTTG AAGATGTTGG CCAATGGAAA CGCCCGTGGT ACTTCCCACA GCCAGGCGAA ACCATGCAGC AATCACTCGA GCGTGAATGT TTAGCAACAC GTAACAGTGT CGGTATTTTA GATGCTTCGA CCTTAGGTAA AATTGATATT CAAGGCAAAG ATGCACGCGA ATTTTTAAAC CGCGTCTATA CCAACCCATG GAGCAAGTTA GGCGTAGGCA AATGTCGCTA CGGCGTTATG TGTAAAGAAG ACGGTATGGT CTTTGATGAT GGGGTTACCG TCTGTCTTGA CGATAATCGT TTTATCATGA CCACCACCAC TGGCGGGGCG GCGGGCGTAT TGCAATGGTT AGAGCTATGG CATCAAACCG AGTGGCCTGA GCTGGAGGTG TATTTCTCAA CCGTGACTGA CCATTGGTCA ACCATGACTA TCTCAGGACC TAACTCTCGT AAAGTCTTGG AGAAAATCTG TGATATTGAT GTCAGTAATG ACAGTTTCAA GTACATGGAT TGGCGCGCAG CGACGGTTGC GGGGGTTAAA GCACGCATTT TCCGTATCTC GTTTACCGGC GAGCTGTCGT TTGAAATTAA CGTGCAAGCA AACTATGGCA TGCATGCCTG GAAAGCGGTG ATGGCGGCGG GTGAAGAATT TAATATCACC CCGTATGGCA CCGAAACCAT GCATATTTTA CGTGCAGAAA AAGGCTTTAT CATTGTCGGA CAAGACACCG ATGGCTCGGT GACACCACAA GATTTAGACA TGGACTGGGT TGTGGGTAAG AAAAAAGACT TTAGCTTTAT TGGTAAACGC TCTTGGACGC GCTTTGACAA TAAACGTGAC GATCGTAAAC AAATGGTGGG CTTGAAACCG AAAGACCCTA CTTTTGTACT GCCTGAAGGC GCACAAATTG TCTTTGAGAA AAACCAATCC ATCCCAATGA AAATGGTGGG TCACGTTACC TCAAGTTATT ACAGTGCTTG TATGGGCTAC TCGTTTGCCT TAGCAGTCGT TAAAGGCGGT ATTAGCCGCA AAGGTGAGAG TGTCTATTTG CCATTAAGTG ATGGCACCAC CGTGGAAGCT GAAATTTGCA GCCCAGTATT TTATGATCCA AAGGGAGACC GTCAAAATGT CTAA
|
Protein sequence | MSQVNRIAGS SKRINRNRTL TFSFNGKEYT GFEGDTVASA LLANGVDVVG RSFKYSRPRG IITSDSQEPN AIFQIGSTQA TTIPNPRATQ TDLYQGLTAS STNGWPNVDF DLMGTVGKLG GSMMPPGFYY KTFMFPQSLW MSYEHLIRKG AGLGASPQQN DPDSYDKMHH HCDVMIVGGG PAGLAAALSA AQTGARVIIS DEQNEFGGSL LCSTQQIDGQ LPSQWVEKTV AQLSEMDNVM LLPRSTVFGY YDHNLVGINE RRTDHLGEHQ LQSTRQRVHK VRAKQVILAT GAHERPLVYG NNDVPGCMLA NAISTYINRY DVVPGKQLVL MTTNDNAYKT AIDWHQAGRK VVAIVDTRST SNGDLVNKVK KLGIDIIFGH GVIEVKGSKR VKGVEVAPIN ASNHSVTGPA KHIVCDTVAS SGGWSPVIHL SSHTGSRPVW NDDIAGFVPG DTVQKQHSCG GLEGVYALSK VISDGFTTGA VAAEAAGKGD GRYAGNSPTT SDPQEDASMA LFHIPHSKKT SRAPKQFVDY QNDVTAAGIE LANREGFESI EHVKRYTALG FGTDQGKLGN INGMAITAKS LGKTIPETGT TIFRPMYTPT TFGALAGADV KHLFDPARFS AMHKWHLENG AEFEDVGQWK RPWYFPQPGE TMQQSLEREC LATRNSVGIL DASTLGKIDI QGKDAREFLN RVYTNPWSKL GVGKCRYGVM CKEDGMVFDD GVTVCLDDNR FIMTTTTGGA AGVLQWLELW HQTEWPELEV YFSTVTDHWS TMTISGPNSR KVLEKICDID VSNDSFKYMD WRAATVAGVK ARIFRISFTG ELSFEINVQA NYGMHAWKAV MAAGEEFNIT PYGTETMHIL RAEKGFIIVG QDTDGSVTPQ DLDMDWVVGK KKDFSFIGKR SWTRFDNKRD DRKQMVGLKP KDPTFVLPEG AQIVFEKNQS IPMKMVGHVT SSYYSACMGY SFALAVVKGG ISRKGESVYL PLSDGTTVEA EICSPVFYDP KGDRQNV
|
| |