Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_3777 |
Symbol | soxA |
ID | 7388231 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011989 |
Strand | + |
Start bp | 3142233 |
End bp | 3145226 |
Gene Length | 2994 bp |
Protein Length | 997 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643652547 |
Product | sarcosine oxidase alpha subunit |
Protein accession | YP_002550728 |
Protein GI | 222149771 |
COG category | [E] Amino acid transport and metabolism [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0404] Glycine cleavage system T protein (aminomethyltransferase) [COG0492] Thioredoxin reductase |
TIGRFAM ID | [TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.152645 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGGTG CCAATCGTAT TCCCGGCAAG GGCCGTCTGA CGCCCGCCAG AACCGCGCGT TTCACCGTGG ATGGCCGCAT TCTCACCGCC ATTGAGGGTG ATAGCGTGGC ATCAGCCCTG CTGGCCAATG GCATTCATCT GGTCGGTCGC TCGTTCAAAT ATCACCGTCC ACGCGGCATT CTCACCGCTG GCCCGGAAGA GCCAAACGCG CTGCTGAACA TTTCGCGCGA CAGTGCGCGG CGTCAGCCAA ACGTGCGTGC GCCCGTGCAG GAAGTGTTTG ATGGCATGCG CGTGCAGACG CAAAACCGCT GGCCGTCGCT GTCGATGGAT ATTGGCGCGG TCAACGACCT GCTCTCGCCC TTCTTTGCCG CAGGCTTCTA CTACAAGACC TTCATGTGGC CGAAGAGCTT CTGGCATAAG ATTTATGAGC CGATCATTCG CCGCGCCGCA GGCCTTGGCG TGGCCCCAAC CGAAGACGAC ACCGACCATT ACGCCAACCG CTACGCCCAT TGCGACGTGA TGGTGGTGGG CGGCGGCCTG GCTGGTCTGA CAGCTGCGCT TGCTGCGGCA GAAACTGGCG CTTCCGTGAT TATCTGCGAT GAACAGCCCG AAATGGGCGG TGCCTTCCAT TATGATACGG GAGCCGTGAT TGACGGCCAA AACGGCTATG ACTGGGCGCA AGCCACCGTG GCCAAGCTAA AAGCCATGGA CAATGTGACC GTGCTGACCC GCACCACGGC ATTTGGCTAT TACAATCACA ATTTCCTCGG TCTTGCCGAG CGCGTCACCG ACCATATCGC CAAACCCGCC AAGGGCTTGC CGCGCGAACG GCTGTGGCAG GTACGGGCGA AAAAGGTGGT GCTGGCCACG GGCTCCATTG AGCGCCACAT GGTGTTTGCC AACAACGACC GCCCCGGCAT TATGCTGGCA TCGGCGGCTC GCACCTATCT CAACCATTTT GGCGTGGCCG TGGGCGCAAA AGTGGCAGTC TACACCGCCC ATGATTCGGC CTATGAAGCG GCGATTGACC TGAAAAAAGC GGGCGTGCAG GTGGTGGCCA TCATCGACTG CCGCCGCAAT TCGGGAGCAA GCGTCTTGGC TGACGCCAAG GCGGCAGGCA TTGAGGTTTT GAGCGAACAT TGCGTTCTCG ATACCGCTGG GCGCCTACGG CTAAAGTCCA TCACTATTGC CAGCAAGGGC GGTTTTGATC GCCGCAAGCT GGCCGTCGAT GCGCTGCTGA TGAGCGGCGG CTGGACGCCT TCGGTGCATC TGTTCTCGCA GTCACGCGGC AAGCTGCGTT TCGACGCCGC CAACCAGCGC TTCCTACCAG ACATTTACGT GCAAGATAGT ATCTGCGTTG GAGCCTGCAA CGGCACGGAT GATCTCAGCG AACTGCTGAC TGAGGCCTAT GCCAGCGGCG CAAGGCTGGC AAAAGAAGCC GGCGCAGAAG GCGAAAGCGG CAGCGCACCA AGCGGTGCCA ACGCCTTTGC TTGGACCGGC GGCATGATTG GCGCTGCCGA AGGCGCTGGC CCGGATGATG CAGTCAAAGC CTTTATCGAC TTCCAGCACG ATGTCTGCGC CAAGGATATT CGCTTGGCGG TGCGCGAGGG CATGCATTCC ATCGAGCATA TCAAACGCTT TACCACCAAT GGCATGGCGT CTGATCAGGG CAAGCTGTCC AATATGCACG GGCTGGCAAT TGCCGCCGAA ATGCTGGGAA AAGAAATCCC GCAAGTGGGC CTCACCACCT TCCGCGCCCC CTATACGCCC GTAACATTCG GCACGCTGAT CAACCATTCG CGTGGCGAAT TGTTTGACCC AACCCGCAAA ACTCCGATGC ATGATCTGGA AACGGCGCTG GGTGCCGATT TTGAAGATGT CGGCAACTGG AAACGCGCCT GGTATTACCC CAAGCCCGGC GAGGATATGC ACGCCGCCGT CAACCGCGAG TGCAAAACCG TGCGCGAGGT TGCGGGCGTC TTCAATGCCT CAACGCTGGG CAAAATCGAG GTGGTTGGCC CGGATGCGGC CAAATTCCTC AACCTGATCT ACACCAACCC GTGGGATAGC CTCAAACCCG GCAAATGCCG CTATGGCATC ATGACCCGCG ACGACGGGTT TATCTATGAC GACGGCGTGG TTGGACGGCT GGCCGAAGAC CGCTTCCACG TCACCACCAC CACTGGCGGT GCCGCCCGTG TGCTGAACCA TATGGAAGAT TATCTGCAAA CGGAATTCCC GGAGCTAAAC GTCTGGCTCA CCTCCGCTTC CGAGCAATGG GCGGTGATTG CCGTGCAGGG ACCAAAGGCT AGAGAGATTA TCGAACCCTT CGTGGAAGGC ATTGATATTT CCAACGAGGC CTTCCCGCAT ATGAGCGTGG CGGAAGGGAA ATTCTGCGGC GTGCCAACCC GGCTGTTCCG CGTATCATTT ACAGGCGAAG TGGGCTTTGA AATCAACGTG CCCTCGGATT ACGGCGCATC CGTATTCGAA GCCGTGTGGA AGCGTGCGGA AACCATGGGC GCCTGCCTCT ATGGCACCGA AACCATGCAC GTGTTGCGCG CCGAAAAGGG CTATATCATC GTTGGCCAAG ACACCGATGG CACGCTGACG CCCGATGATG CCAATTACGG CTGGGCAGTG TCGAAGAAAA AGACCGATTT CGTTGGTATT CGCGGCCTCA AACGCCCGGA TCTGGTCCAG GAAGGCCGCA AGCAACTGGT CGGCCTCAAG ACCAAAGACC CCATTGAGGT GCTGGAAGAA GGCGCACAGA TTGTTGCCAA CCCCAACCAG CCCAAGCCGA TGACCATGCT GGGCCATGTC ACCTCGTCCT ACTGGTCGGA AAATCTTGGC CAATCAATTG CCATTGCCAC GGTGGCCGGT GGCCGGGCGC GGATGGGCGA AACGCTCTAT GTGCCGATGC CTGACAAGAC AATCGCCGTG GAAGTGACCG ACATGGTCTT TTACGACAAG GAAGGAAGCC GCATCCATGG TTGA
|
Protein sequence | MSGANRIPGK GRLTPARTAR FTVDGRILTA IEGDSVASAL LANGIHLVGR SFKYHRPRGI LTAGPEEPNA LLNISRDSAR RQPNVRAPVQ EVFDGMRVQT QNRWPSLSMD IGAVNDLLSP FFAAGFYYKT FMWPKSFWHK IYEPIIRRAA GLGVAPTEDD TDHYANRYAH CDVMVVGGGL AGLTAALAAA ETGASVIICD EQPEMGGAFH YDTGAVIDGQ NGYDWAQATV AKLKAMDNVT VLTRTTAFGY YNHNFLGLAE RVTDHIAKPA KGLPRERLWQ VRAKKVVLAT GSIERHMVFA NNDRPGIMLA SAARTYLNHF GVAVGAKVAV YTAHDSAYEA AIDLKKAGVQ VVAIIDCRRN SGASVLADAK AAGIEVLSEH CVLDTAGRLR LKSITIASKG GFDRRKLAVD ALLMSGGWTP SVHLFSQSRG KLRFDAANQR FLPDIYVQDS ICVGACNGTD DLSELLTEAY ASGARLAKEA GAEGESGSAP SGANAFAWTG GMIGAAEGAG PDDAVKAFID FQHDVCAKDI RLAVREGMHS IEHIKRFTTN GMASDQGKLS NMHGLAIAAE MLGKEIPQVG LTTFRAPYTP VTFGTLINHS RGELFDPTRK TPMHDLETAL GADFEDVGNW KRAWYYPKPG EDMHAAVNRE CKTVREVAGV FNASTLGKIE VVGPDAAKFL NLIYTNPWDS LKPGKCRYGI MTRDDGFIYD DGVVGRLAED RFHVTTTTGG AARVLNHMED YLQTEFPELN VWLTSASEQW AVIAVQGPKA REIIEPFVEG IDISNEAFPH MSVAEGKFCG VPTRLFRVSF TGEVGFEINV PSDYGASVFE AVWKRAETMG ACLYGTETMH VLRAEKGYII VGQDTDGTLT PDDANYGWAV SKKKTDFVGI RGLKRPDLVQ EGRKQLVGLK TKDPIEVLEE GAQIVANPNQ PKPMTMLGHV TSSYWSENLG QSIAIATVAG GRARMGETLY VPMPDKTIAV EVTDMVFYDK EGSRIHG
|
| |