Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_1344 |
Symbol | |
ID | 4896771 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | + |
Start bp | 1393578 |
End bp | 1394825 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640111931 |
Product | sarcosine oxidase beta subunit family protein |
Protein accession | YP_001043226 |
Protein GI | 126462112 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | [TIGR01373] sarcosine oxidase, beta subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0947006 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.025984 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACGCT ATTCGGTCTT CGCCATCGCC CGCGAGGCGC TGCGCCATCA TACGGGGTGG GAGCGCGCCT GGGCCTCGCC CGAACCCAAG GCGTCCTATG ACGTGATCGT GATCGGCGCC GGCGGCCACG GCCTCGCCAC GGCCTATTAC CTCGGGAAGA ATTTCGGGAT CACCAATGTT GCGGTGATCG AGAAGGGCTG GCTCGGCGGC GGCAATACCG GGCGCAACAC GACGATCATC CGCTCGAACT ACCTGCAGGA TCCGTCGGCC GCCATCTACG AGAAGGCGCG CAGTCTCTAC GAGACCCTGA GCCAGGATCT GAACTACAAC GTGATGTTCA GCCCGCGCGG GCTCCTCATG CTGGCGCAGA CCCATCACGA GGTGCGCGGC TACATGCGCA CGGTGCATGC CAACCTTCTG CAGGGCGTCG AGACCGAATG GATCGGCCCC GAGCAGGTCA AGCGTCTGGT CCCGATCATG AACATCCACG GGCCGCGCTA TCCGGTGCTG GGGGCGCTCC TTCAGAAGCG GGGCGGCACG GCGCGCCACG ATGCGGTGGC CTGGGGCTAT GCGCGGGCCT GTTCGGCGAT GGGCATGGAC ATCATCCAGC AGTGCGAGGT GAAGGGCGTC CGCTCCGAGG GCGGCGTCGT CACCGGGGTC GAGACGACGA AGGGCTTCAT CGGCACGAAG AAGCTCGCCA TCGTGGTGGC GGGACATTCG GGGCAGGTGG CCGAGATGGC AGGCTTCCGG CTGCCGGTCG AGGCGGTGGC GCTGCAGGCG CTGGTCTCCG AGCCGGTCAA GCCCTGCATC GACGTGGTGG TGATGGCCAA CACGGTGCAC GGCTACATGA GCCAGTCCGA CAAGGGCGAG CTGGTGATCG GCGGCGGCAC CGACAGTTTC AACAACTTCA CCCAGCGCGG CAGCTTCCAT CATATCGAGG AGACGCTGCG CGCGCTGGTG GAGACCTTCC CGATCATCTC GCGCCTCAAG ATGCTGCGGC AGTGGGGCGG GATCGTGGAC ATGACCGGCG ACCGTTCGCC GATCCTCTCG AAGACGCCGC TGGGGAACTG CTTCATCAAC TGCGGCTGGG GCACGGGGGG CTTCAAGGCC ATCCCCGGCT CCGGCTGGGC AATGGCGGAG CTGGTGGCCA AGGGCGAACC GGGCGCGCTT GCCGCGGATT TCGGCATGGA CCGTTTCCGC GAGGGCCGTT TCATCGACGA ATCGGTCGCG GCGGGGGTCG CGCACTGA
|
Protein sequence | MKRYSVFAIA REALRHHTGW ERAWASPEPK ASYDVIVIGA GGHGLATAYY LGKNFGITNV AVIEKGWLGG GNTGRNTTII RSNYLQDPSA AIYEKARSLY ETLSQDLNYN VMFSPRGLLM LAQTHHEVRG YMRTVHANLL QGVETEWIGP EQVKRLVPIM NIHGPRYPVL GALLQKRGGT ARHDAVAWGY ARACSAMGMD IIQQCEVKGV RSEGGVVTGV ETTKGFIGTK KLAIVVAGHS GQVAEMAGFR LPVEAVALQA LVSEPVKPCI DVVVMANTVH GYMSQSDKGE LVIGGGTDSF NNFTQRGSFH HIEETLRALV ETFPIISRLK MLRQWGGIVD MTGDRSPILS KTPLGNCFIN CGWGTGGFKA IPGSGWAMAE LVAKGEPGAL AADFGMDRFR EGRFIDESVA AGVAH
|
| |