Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_1874 |
Symbol | |
ID | 5084299 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009428 |
Strand | - |
Start bp | 1923262 |
End bp | 1926243 |
Gene Length | 2982 bp |
Protein Length | 993 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640483435 |
Product | sarcosine oxidase alpha subunit family protein |
Protein accession | YP_001168070 |
Protein GI | 146277911 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0404] Glycine cleavage system T protein (aminomethyltransferase) |
TIGRFAM ID | [TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACGC GTCTGGCCCG GGGCGGGCGG CTGATCGACC GCAATCACCC TCTGGGTTTC ACCTTCAACG GCAAGCGGAT GCGCGGCTTT GCCGGGGATA CGCTTGCGGC GGCCCTGCTG GCCAACGACC AGATGCTGGT CGGGCGCAGC TTCAAGTATC ACCGCCCGCG CGGCATCGTG GCGGCGGGCG CCGAAGAGCC GAACGCGCTT GTCCAGCTTG GCACCGGCGG CCGGTCGGAA CCCAACCAGC GCACCACCAC GACCGAGCTG TTCGCGGGCC TCTCGGCCGC GAGCCAGAAC CACTGGCCGA GCCTCGAGTT CGACGTGGGC GCGGTGAACG CGGCGGCGGG GCGCTTCCTG CCCGCGGGCT TCTACTACAA GACCTTCCTT CAGCCCCGGC TGGCGTGGAA GCACCTGTTC GAGCCGGTGA TCCGCCGCTC GGCGGGCCTC GGGCGGCCGC CGGAAGAGCC CGATGCCGAC CGTTACGAGC AGGCCTACGC CTTCTGCGAC CTCCTCGTGG TGGGCGGCGG CATCGCGGGG TTGCAGGCGG CGCTGAGTGC CTCGGCCTCG GGGCAAAAGG TGATGCTGCT CGAGCAGACG CCGCACTGGG GCGGGCGCGC CCCGGTCGAT GACGTGCTGA TCGAGGGCCG GCCGGTGGCG GACTGGGTGG CGGCCACGGT GGCTTCGCTC GAGGCCGCGC CGAACGTCAC GCTGCGCACC CGCTGCATGG CGGCCGGGGT GCACGACCAC GGCTATGTGC TGGCCGAAGA GCGGGTGGCC GATCATACGC CGGGCGACGG GCGGCCGAAG AAGCGGCTCT GGCGCATCCG CGCGGGCAAG GTGGTGACGG CGACGGGCGC CATCGAACGG CCGCTGCCCT TCGCGGGCAA CGACATTCCG GGCGTGATGC TCGCCTCGGC GGTGCGCGAC TATCTGGTGA ACTGGGCCGT CTCGCCCGGC GACCGGGTGG TGATCGTGAC GAACAACGAC GACGCCTACC GCACCGCCAT CGCCGTCCAC CGCGCGGGGC TGACGGTGCC GGCCGTGCTC GATGCGCGGG CCGAGGCCGA CGGCGCGCTG CCCGAAGAGG TGCGCAGCCT CGGCATCCCG GTCCTGACCA ACCGCGCGGT GGCGAAGGTC AAGGGCGGCA AGCGCGTGAC CGGCGTCGCC GTCTGCGCCC AGGCGGGCGA GGGGGCGGTG CTCGACGAGT TTGCCTGCGA TGCGGTCGCC ATGTCGGGCG GCTGGTCGCC GGTCGTCCAT CTCTGGAGCC ACTGCGGCGG CAAGCTGATC TGGGACGAGG CGCAGGCGGC CTTCCGTCCC GATCCTGCCC GCCCGCCGAT CACCCACGAC GGCTCGGCGA TGGTGGCGGC CGCGGGTTCG GCCAATGGTG AGCTGCTCTC GGCCGATGTG CTGGCCGATG CCATCCGCGC CGTGGGCGGC GAGGGCCCGG CCCCCCGGGC GCAGAGCCCC GAGGAGGCTC CGACCGAGCC GGTCTGGATC ATGCCGCAGG GGGCCACGCC CGCGCTGCGC TCGAAGATGT GGCTCGACTA CCAGAACGAC GTGAAGGTGT CGGACGTGCA GCTTGCCGCC CGCGAGGGCT ACGAGTCGGT CGAGCATACC AAGCGCTACA CGACGCTCGG CATGGCCACC GATCAGGGCA AGCTCAGCAA CATCAACGGG CTTGCGGTGC TGGCGGGATC GCTCAACGCG CCGATCCCCG CGGTTGGCAC CACCACCTTC CGCCCGCCCT ACACGCCCGT CACCTTCGGC GCGCTGGTGG GCGAGGCTCG GGGCGAGATC TTCCAGCCGC TGCGCCGCAC GCCGATGCAC GACTGGCACG AGGCCCATGG CGCCTACTGG GAGCCGGTGG GCCTCTGGCG TCGGCCCTAC TGCTACAGCC GTCCCGGCGA GAGCCACGGC GACGCGGTGG CCCGCGAGGT CACCAACGCG CGCACCAAGC TCGGGCTGCT CGACGCCTCG ACGCTGGGCA AGATCCTCGT GAAGGGGCCC GATGCGGGCC GCTTCCTCGA CATGCTCTAC ACCAACGTCA TGTCGAGCCT GCCCGTGGGC CGCTGCCGCT ACGGCCTCAT GTGCAACGAG AACGGCTTCC TGATGGATGA CGGGGTCGTG GCGCGGATCT CCGAGGACAG CTGGCTCTGC CACACGACCT CGGGCGGGGC CGACCGGATC CACGCCCACA TGGAGGATTG GCTCCAGTGC GAATGGTGGG ACTGGCAGGT CCATACCGCC AACCTGACCG AGCAGTTCGC GCAGGTGGCC ATCGTCGGCC CCAACGCGCG CAGGCTGCTG GAAAAGCTCG GCGGGATGGA CGTCTCGAAG GAGGCGCTGC CCTTCATGCA CTGGGCGGAA GGCACGATCG CGGGCATCCC CGCGCGCGTG TTCCGCATCA GCTTCTCGGG TGAGCTGTCC TACGAGGTGG CGGTTCCTGC GGGGCAGGGG CTGGCCTTCT GGCAGGCCTG CCACGAGGCG GGGGCCGAGT TCGGCGCCAT GCCCTACGGC ACCGAGGCGC TGCATGTGAT GCGGGCCGAG AAGGGCTTCA TCATGATCGG CGACGAGACC GACGGGACGG TGATCCCGCA GGACCTGAAC CTCGGCTGGG CCATCTCGAA GAAGAAGGCC GACTTCATCG GCAAGCGCGG GATGGAGCGG GCCTTCCTCG CCAGCCCCGA CCGCTGGAAG CTCGTGGGGC TCGAGACGCT CGACGGCTCG GTGCTGCCGG ATGGCGCCAT CGCGCCCGCG CCCGGCTCGA ACGCGAATGG CCAGCGCAAC ACGCAAGGCC GCGTGACCTC GACCTACTGG TCGCCGACGC TGAAGAAGGG GATCGCCATG GGCCTCGTCC ATCGTGGCCC CGAGCGGATG GGCGAGGTGA TCGAGTTCCC GAAGATCTGG GGCGGCGTGG TGCAGGCGCG GATCGTGGAT CCGGTGTTCT ACGACAAGGC GGGAGAGAAG CAGGATGTCT GA
|
Protein sequence | MSTRLARGGR LIDRNHPLGF TFNGKRMRGF AGDTLAAALL ANDQMLVGRS FKYHRPRGIV AAGAEEPNAL VQLGTGGRSE PNQRTTTTEL FAGLSAASQN HWPSLEFDVG AVNAAAGRFL PAGFYYKTFL QPRLAWKHLF EPVIRRSAGL GRPPEEPDAD RYEQAYAFCD LLVVGGGIAG LQAALSASAS GQKVMLLEQT PHWGGRAPVD DVLIEGRPVA DWVAATVASL EAAPNVTLRT RCMAAGVHDH GYVLAEERVA DHTPGDGRPK KRLWRIRAGK VVTATGAIER PLPFAGNDIP GVMLASAVRD YLVNWAVSPG DRVVIVTNND DAYRTAIAVH RAGLTVPAVL DARAEADGAL PEEVRSLGIP VLTNRAVAKV KGGKRVTGVA VCAQAGEGAV LDEFACDAVA MSGGWSPVVH LWSHCGGKLI WDEAQAAFRP DPARPPITHD GSAMVAAAGS ANGELLSADV LADAIRAVGG EGPAPRAQSP EEAPTEPVWI MPQGATPALR SKMWLDYQND VKVSDVQLAA REGYESVEHT KRYTTLGMAT DQGKLSNING LAVLAGSLNA PIPAVGTTTF RPPYTPVTFG ALVGEARGEI FQPLRRTPMH DWHEAHGAYW EPVGLWRRPY CYSRPGESHG DAVAREVTNA RTKLGLLDAS TLGKILVKGP DAGRFLDMLY TNVMSSLPVG RCRYGLMCNE NGFLMDDGVV ARISEDSWLC HTTSGGADRI HAHMEDWLQC EWWDWQVHTA NLTEQFAQVA IVGPNARRLL EKLGGMDVSK EALPFMHWAE GTIAGIPARV FRISFSGELS YEVAVPAGQG LAFWQACHEA GAEFGAMPYG TEALHVMRAE KGFIMIGDET DGTVIPQDLN LGWAISKKKA DFIGKRGMER AFLASPDRWK LVGLETLDGS VLPDGAIAPA PGSNANGQRN TQGRVTSTYW SPTLKKGIAM GLVHRGPERM GEVIEFPKIW GGVVQARIVD PVFYDKAGEK QDV
|
| |