Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_1347 |
Symbol | |
ID | 4896761 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | + |
Start bp | 1395639 |
End bp | 1398620 |
Gene Length | 2982 bp |
Protein Length | 993 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640111934 |
Product | sarcosine oxidase alpha subunit family protein |
Protein accession | YP_001043229 |
Protein GI | 126462115 |
COG category | [E] Amino acid transport and metabolism [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0404] Glycine cleavage system T protein (aminomethyltransferase) [COG0492] Thioredoxin reductase |
TIGRFAM ID | [TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0394007 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACGC GTCTCGCACG GGGCGGACGG CTGATCGACC GCAGCCGCGC CATCGACTTC ACCTTCAACG GCAGGCGGAT GCGCGGTTTC GCGGGCGACA CGCTCGCCGC GGCGCTTCTC GCCAACGACC AGATGCTGGT CGGGCGCAGC TTCAAGTATC ACCGTCCGCG CGGCATCGTG GCGGCGGGCG CCGAAGAGCC GAACGCGCTC GTCCAGCTCG GCACCGGCGG CCGGTCGGAG CCGAACCAGC GCACCACCAC GACCGAGCTC TTTGCGGGGC TGACGGCGGC CAGCCAGAAC CACTGGCCGA GCCTCGAATT CGACGTGGGC GCGGTCAATG CGGCGGCGTC GCGCTTCCTG CCCGCAGGCT TCTACTACAA GACCTTCCTG CAGCCGCGCG CCGCGTGGAA ACACCTGTTC GAGCCGGTCA TCCGCCGCTC GGCGGGCCTC GGACGGCCGC CGGAAGCGCC CGATGCGGAC CGCTACGAGC AGGCCTATGC CTTCTGCGAC CTTCTCGTGG TGGGCGGCGG CATCGCGGGG CTTCAGGCGG CGCTCAGCGC CTCGGCCTCG GGCCGGAAGG TGATGCTGCT CGAGCAGACG CCGCACTGGG GCGGACGGGC CCCGGTGGAT GACGTGCTGA TCGACGGGCG CCCGGCGGCG GACTGGGTGG CGGACACGGT GGCAGCGCTC GAGGCCGCGC CCAATGTCAC GCTCCGCACC CGCTGCATGG CGGCCGGCGT CTACGACCAC GGCTATGTGC TGGCCGAGGA GCGCGTGGCC GACCATACGC CGGGCGACGG ACGGCCGAAG AAGCGGCTCT GGCGCATCCG CGCGGGCAAG GTCATCACGG CGACCGGCGC CATCGAGCGG CCGCTGCCCT TCGCGGGCAA CGACATTCCG GGCGTCATGC TCGCCTCGGC GGTGCGCGAC TATCTGGTGA ACTGGGCCGT CTCGCCCGGC GACCGGGTGG TGATCGTCAC GAACAACGAC GACGCCTACC GCACCGCCAT CGCCGTGCAC CGTGCGGGGC TGACCGTGCC GGCCGTGCTT GATGCGCGGG CCGAGGCCCA CGGCGCGCTG CCCGAAGAGG TGCGCAGCCT CGGCATCCCG GTCCTCACCA ACCGCGCGGT GGCCAAGGTC AAGGGCAGCA AGCGGGTGAC GGGCGTCACC GTCTGCGCCC AAGCGGGCGA GGGGGCGGTG CTCGACGATT TCGACTGCGA TGCGGTCGCC ATGTCGGGCG GCTGGTCGCC GGTCGTCCAT CTCTGGAGCC ACTGCGGCGG CAAGCTGATC TGGGACGAGG CGCAGGCGGC CTTCCGGCCC GACCCGGCCC GCCCGCCGAT CACCCATGAC GGCTCGCCGA TGGTGGCGGT TGCGGGCTCG GCCAATGGCG AGATGCTCTC GGCCGACGTG CTGGCCGATG CCCTCCGCGC CGTGGGCGGC GAGGGCGAGG CGCCCCGGGC GCAGAGCCCC GAGGAGGCGC CGACCGAGCC GGTCTGGATC ATGCCGCAGG GCGCCCCGCC GGCGCTGCGC TCGAAGATGT GGCTCGACTA TCAGAACGAC GTGAAAGTGT CGGACGTGCA GCTCGCCGCC CGCGAGGGCT ACGAGTCGGT CGAGCATACC AAGCGCTACA CGACGCTCGG CATGGCCACC GATCAGGGCA AGCTCAGCAA CATCAACGGG CTGGCCGTGC TCGCAGGCTC GCTCAATGCG CCGATCCCCG CGGTGGGCAC CACCACCTTC CGCCCGCCCT ACACGCCCGT CACCCTCGGC GCGCTGGTGG GCGAGGCGCG GGGCGAGATC TTCCAGCCGC TGCGCCGCAC GCCCATGCAC GACTGGCACG AGGCCAACGG CGCCTATTGG GAGCCGGTGG GCCTCTGGCG CCGGCCCTAC TGCTACAGCC GTCCGGGCGA GAGCCATGGC GATGCGGTGG CCCGCGAGGT CACCAACGCG CGCACGAAGC TCGGGCTGCT CGACGCCTCG ACGCTGGGCA AGATCCTCGT GAAGGGGCCC GATGCGGGCC GCTTCCTCGA CATGCTCTAC ACCAACGTCA TGTCGAGCCT GCCGGTCGGG CGCTGCCGCT ATGGCCTCAT GTGCAACGAG AACGGCTTCC TGATGGACGA TGGCGTGGTG GTCCGTCTCT CCGAGGACAG CTGGCTCTGC CACACGACCT CGGGCGGGGC GGACCGGATC CATGCCCATA TGGAGGACTG GCTCCAGTGC GAATGGTGGG ACTGGCAGGT CTATACCGCC AATCTCACCG AGCAGTTCGC GCAGGTGGCC ATCGTCGGCC CCAACGCGCG CCTGCTGCTG GAAAAGCTCG GCGGCATGGA TGTCTCGAAG GAGGCGCTGC CCTTCATGCA CTGGGCGGAA GGCACCCTCG CGGGCATTCC CGCGCGGGTG TTCCGCATCA GCTTCTCAGG CGAGCTCTCC TACGAGGTGG CGGTGCCCGC GGGGCAGGGG CTGGCCTTCT GGCAGGCCTG CCTCGAGGCG GGCGCCGAAT TCGGCCTCAT GCCCTACGGC ACCGAGGCGC TGCATGTGAT GCGGGCCGAG AAGGGCTTCA TCATGATCGG CGACGAGACC GACGGTACGG TGGTGCCGCA GGACCTGAAC CTCGGCTGGG CGATCTCGAA GAAGAAGGCG GACTTCATCG GCAAGCGCGG CATGGAGCGG ACCTTCCTGT CGAGCCCCGA CCGCTGGAAG CTCGTGGGTC TCGAGACGCT CGACGGCTCG GTCCTGCCCG ATGGCGCCAT CGCGCCCGCG GCGGGCTCGA ATGGGAACGG GCAGCGCAAC ACCCAGGGCC GCGTGACCTC GACCTACTGG TCGCCCACGC TGAAGAAGGG GATCGCCATG GGCCTCGTCC ATCGCGGGAC GGACCGGATG GGCGAGGTGA TCGAGTTCCC GAAGATCTGG GGCGGCGTGG TGCAGGCGCG GATCGTCGAT CCGGTGTTCT ACGACAAGGC GGGAGAGAAG CAGGATGTCT GA
|
Protein sequence | MSTRLARGGR LIDRSRAIDF TFNGRRMRGF AGDTLAAALL ANDQMLVGRS FKYHRPRGIV AAGAEEPNAL VQLGTGGRSE PNQRTTTTEL FAGLTAASQN HWPSLEFDVG AVNAAASRFL PAGFYYKTFL QPRAAWKHLF EPVIRRSAGL GRPPEAPDAD RYEQAYAFCD LLVVGGGIAG LQAALSASAS GRKVMLLEQT PHWGGRAPVD DVLIDGRPAA DWVADTVAAL EAAPNVTLRT RCMAAGVYDH GYVLAEERVA DHTPGDGRPK KRLWRIRAGK VITATGAIER PLPFAGNDIP GVMLASAVRD YLVNWAVSPG DRVVIVTNND DAYRTAIAVH RAGLTVPAVL DARAEAHGAL PEEVRSLGIP VLTNRAVAKV KGSKRVTGVT VCAQAGEGAV LDDFDCDAVA MSGGWSPVVH LWSHCGGKLI WDEAQAAFRP DPARPPITHD GSPMVAVAGS ANGEMLSADV LADALRAVGG EGEAPRAQSP EEAPTEPVWI MPQGAPPALR SKMWLDYQND VKVSDVQLAA REGYESVEHT KRYTTLGMAT DQGKLSNING LAVLAGSLNA PIPAVGTTTF RPPYTPVTLG ALVGEARGEI FQPLRRTPMH DWHEANGAYW EPVGLWRRPY CYSRPGESHG DAVAREVTNA RTKLGLLDAS TLGKILVKGP DAGRFLDMLY TNVMSSLPVG RCRYGLMCNE NGFLMDDGVV VRLSEDSWLC HTTSGGADRI HAHMEDWLQC EWWDWQVYTA NLTEQFAQVA IVGPNARLLL EKLGGMDVSK EALPFMHWAE GTLAGIPARV FRISFSGELS YEVAVPAGQG LAFWQACLEA GAEFGLMPYG TEALHVMRAE KGFIMIGDET DGTVVPQDLN LGWAISKKKA DFIGKRGMER TFLSSPDRWK LVGLETLDGS VLPDGAIAPA AGSNGNGQRN TQGRVTSTYW SPTLKKGIAM GLVHRGTDRM GEVIEFPKIW GGVVQARIVD PVFYDKAGEK QDV
|
| |