Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RSP_2689 |
Symbol | soxA |
ID | 3720380 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides 2.4.1 |
Kingdom | Bacteria |
Replicon accession | NC_007493 |
Strand | + |
Start bp | 1333021 |
End bp | 1336002 |
Gene Length | 2982 bp |
Protein Length | 993 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640070865 |
Product | putative sarcosine oxidase, alpha subunit |
Protein accession | YP_352745 |
Protein GI | 77463241 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0404] Glycine cleavage system T protein (aminomethyltransferase) |
TIGRFAM ID | [TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACGC GTCTCGCACG GGGCGGACGG CTGATCGACC GCAGCCGCGC CATCGACTTC ACCTTCAACG GCAAGCGGAT GCGCGGCTTC GCGGGCGACA CGCTGGCCGC GGCGCTTCTC GCCAACGACC AGATGCTGGT CGGGCGCAGC TTCAAGTATC ACCGTCCGCG CGGCATCGTG GCGGCGGGCG CCGAAGAGCC GAACGCGCTG GTCCAGCTCG GCACCGGCGG CCGGTCGGAG CCGAACCAGC GCACCACCAC GACCGAGCTT TTCGCGGGGC TGACGGCGGC CAGCCAGAAC CACTGGCCGA GCCTCGAATT CGACGTGGGC GCGGTCAATG CGGCAGCGTC GCGCTTCCTG CCCGCAGGCT TCTACTACAA GACCTTCCTG CAGCCGCGCG CCGCGTGGAA ACACCTGTTC GAGCCGGTCA TCCGCCGCTC GGCGGGCCTC GGACGGCCGC CGGAAGAACC CGATGCGGAC CGCTACGAGC AGGCCTATGC CTTCTGCGAC CTCCTTGTGG TGGGCGGAGG CATCGCGGGG CTTCAGGCAG CGGTCAGCGC CTCGGCCTCG GGCCGGAAGG TGATGCTCCT CGAGCAGACG CCGCACTGGG GCGGACGCGC CCCTGTCGAT GACGTGCTGA TCGACGGGCG CCCGGCGGCG GACTGGGTGG CGGACACGGT GGCGGCACTC GAGGCCGCGC CCAATGTCAC GCTCCGCACC CGCTGCATGG CGGCCGGCGT CTACGACCAC GGCTATGTGC TGGCCGAGGA GCGCGTGGCC GATCATACGC CGGGCGACGG ACGGCCGAAG AAGCGCCTCT GGCGCATCCG CGCGGGCAAG GTCCTCACGG CGACGGGAGC CATCGAGCGT CCGCTGCCCT TCGCCGGCAA CGATATTCCG GGCGTCATGC TCGCCTCGGC GGTGCGCGAC TATCTGGTGA ACTGGGCCGT CTCTCCCGGC GACCGGGTGG TGATCGTCAC GAACAACGAC GACGCCTACC GCACCGGCAT CGCCGTTCAC CGCGCGGGGC TGACCGTGCC GGCCGTGCTC GATGCGCGGG CCGAGGCCCA CGGCGCGCTG CCCGAAGAGG TGCGCAGCCT CGGCATCCCG GTCCTCACCA ATCGCGCGGT GGCCAAGGTC AAGGGCGGCA AGCGCGTGAC CGGCGTCACC GTCTGCGCGC AGGCGGGCGA GGGGGCGGTG CTCGACGATT TCGACTGCGA TGCGGTCGCC ATGTCGGGCG GCTGGTCGCC GGTCGTCCAT CTCTGGAGCC ACTGCGGCGG CAAGCTGATC TGGGACGAGG CGCAGGCGGC CTTCCGGCCC GACCCGGCCC GCCCGCCGAT CACCCATGAC GGCTCGCCGA TGGTGGCGGT TGCGGGCTCG GCCAATGGCG AGCTGCTCTC GGCCGATGTG CTGGCCGATG CCCTCCGCGC CGCGGGCGGC GAGGGCGAGG CGCCCCGGGC GCAGAGCCCC GAGGAGGCGC CGACCGAGCC GGTCTGGATC ATGCCGCAGG GCGCCCCGCC GGCGCTCCGC TCGAAGATGT GGCTCGACTA TCAGAACGAC GTGAAAGTGT CGGACGTGCA GCTCGCCGCC CGCGAGGGCT ACGAGTCGGT CGAGCACACC AAGCGCTATA CGACGCTCGG CATGGCCACC GATCAGGGCA AGCTCAGCAA CATCAACGGG CTGGCCGTGC TCGCAGGCTC GCTCAATGCG CCGATCCCCG CGGTGGGCAC CACCACCTTC CGCCCGCCCT ACACGCCCGT CACCCTCGGC GCGCTGGTGG GCGAGGCGCG GGGCGAGATC TTCCAGCCGC TGCGCCGTAC GCCGATGCAC GACTGGCACG AGGCCCATGG CGCCTACTGG GAGCCGGTGG GCCTCTGGCG CCGGCCCTAC TGCTACAGCC GTCCGGGCGA GAGCCATGGC GATGCGGTGG CCCGCGAGGT CACCAACGCG CGCACGAAGC TCGGTCTCCT CGACGCCTCG ACGCTGGGCA AGATCCTCGT GAAGGGGCCC GATGCGGGCC GCTTCCTCGA CATGCTCTAC ACCAACGTCA TGTCGAGCCT GCCGGTCGGG CGCTGCCGCT ACGGCCTCAT GTGCAACGAG AACGGCTTCC TGATGGACGA TGGCGTGGTG GTCCGTCTCT CCGAGGACAG CTGGCTCTGC CACACGACCT CGGGCGGGGC GGACCGGATC CATGCCCATA TGGAGGACTG GCTCCAGTGC GAATGGTGGG ACTGGCAGGT CTATACCGCC AATCTCACCG AGCAGTTCGC GCAGGTGGCC ATCGTCGGCC CCAACGCGCG CCTGCTGCTG GAAAAGCTCG GCGGCATGGA TGTCTCGAAG GAGGCGCTGC CCTTCATGCA CTGGGCGGAA GGCACCATCG CGGGCATTCC CGCGCGGGTG TTCCGCATCA GCTTCTCGGG CGAGCTCTCC TACGAGGTGG CGGTGCCCGC GGGGCAGGGG CTGGCCTTCT GGCAGGCCTG CCTCGAGGCG GGCGCCGAAT TCGGCCTCAT GCCCTACGGC ACCGAGGCGC TGCATGTGAT GCGGGCCGAG AAGGGCTTCA TCATGATCGG CGACGAGACC GACGGCACGG TGGTGCCGCA GGACCTGAAC CTCGGCTGGG CGATCTCGAA GAAGAAGGCG GATTTCATCG GCAAGCGCGG CATGGAGCGA ACCTTCCTGT CGAGCCCCGA CCGCTGGAAG CTCGTGGGTC TCGAGACGCT CGACGGCTCG GTCCTGCCCG ACGGCGCCAT CGCGCCCGCG GCGGGCTCGA ATGGGAACGG GCAGCGCAAC ACTCAGGGCC GCGTGACCTC GACCTACTGG TCGCCCACGC TGAAGAAGGG CATCGCCATG GGCCTCGTCC ATCGCGGGAC GGACCGGATG GGCGAGGTGA TCGAGTTCCC GAAGATCTGG GGTGGCGTGG TGCAGGCGCG GATCGTCGAT CCGGTGTTCT ACGACAAGGC GGGAGAGAAG CAGGATGTCT GA
|
Protein sequence | MSTRLARGGR LIDRSRAIDF TFNGKRMRGF AGDTLAAALL ANDQMLVGRS FKYHRPRGIV AAGAEEPNAL VQLGTGGRSE PNQRTTTTEL FAGLTAASQN HWPSLEFDVG AVNAAASRFL PAGFYYKTFL QPRAAWKHLF EPVIRRSAGL GRPPEEPDAD RYEQAYAFCD LLVVGGGIAG LQAAVSASAS GRKVMLLEQT PHWGGRAPVD DVLIDGRPAA DWVADTVAAL EAAPNVTLRT RCMAAGVYDH GYVLAEERVA DHTPGDGRPK KRLWRIRAGK VLTATGAIER PLPFAGNDIP GVMLASAVRD YLVNWAVSPG DRVVIVTNND DAYRTGIAVH RAGLTVPAVL DARAEAHGAL PEEVRSLGIP VLTNRAVAKV KGGKRVTGVT VCAQAGEGAV LDDFDCDAVA MSGGWSPVVH LWSHCGGKLI WDEAQAAFRP DPARPPITHD GSPMVAVAGS ANGELLSADV LADALRAAGG EGEAPRAQSP EEAPTEPVWI MPQGAPPALR SKMWLDYQND VKVSDVQLAA REGYESVEHT KRYTTLGMAT DQGKLSNING LAVLAGSLNA PIPAVGTTTF RPPYTPVTLG ALVGEARGEI FQPLRRTPMH DWHEAHGAYW EPVGLWRRPY CYSRPGESHG DAVAREVTNA RTKLGLLDAS TLGKILVKGP DAGRFLDMLY TNVMSSLPVG RCRYGLMCNE NGFLMDDGVV VRLSEDSWLC HTTSGGADRI HAHMEDWLQC EWWDWQVYTA NLTEQFAQVA IVGPNARLLL EKLGGMDVSK EALPFMHWAE GTIAGIPARV FRISFSGELS YEVAVPAGQG LAFWQACLEA GAEFGLMPYG TEALHVMRAE KGFIMIGDET DGTVVPQDLN LGWAISKKKA DFIGKRGMER TFLSSPDRWK LVGLETLDGS VLPDGAIAPA AGSNGNGQRN TQGRVTSTYW SPTLKKGIAM GLVHRGTDRM GEVIEFPKIW GGVVQARIVD PVFYDKAGEK QDV
|
| |