Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_2264 |
Symbol | |
ID | 5713917 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 2386296 |
End bp | 2389238 |
Gene Length | 2943 bp |
Protein Length | 980 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641268186 |
Product | putative sarcosine oxidase |
Protein accession | YP_001533601 |
Protein GI | 159044807 |
COG category | [E] Amino acid transport and metabolism [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0404] Glycine cleavage system T protein (aminomethyltransferase) [COG0492] Thioredoxin reductase |
TIGRFAM ID | [TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.710125 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGCTCG ATGGTGCGGG CCTGATCGAC CGGGACACGC CGCTGCGCTT TCAGTTCGAC GACCGTTGGA TGGGCGGGTT TGCGGGGGAC ACGCTGGCCT CGGCTTTGAT GGCGAACGGG GTAAAGCTGG TCGCTCGGTC CTTCAAGTAC CACCGCCCCC GGGGCATCTG GTCGGCGGGT TCCGAAGAGC CCAACGCTTT GGTGCAGGTG GGCAAATTCG CGCGCCAGGT GCCGAATGTG CGTGCGACGC AGGCCGAACT TTTTGAGGGT CTCTACGCCA CCAGCCAGAA CCGGTGGCCG TCGCTGACCC TCGACCTGCT GTCGTTGAAT GATCTGGCGG CGCCCTTCCT CGGGGCGGGG TTCTATTACA AGACCTTCAT GTGGCCGCGC AGCTTCTGGG AAAAGCTCTA TGAGCCCGCA ATTCGGCGGG CGGCAGGGTT GGGTGCCCTG TCGGGTCAGA CCAACACCGA GCCCTACGAG AAGGCCTATG CCCATTGCGA CCTGCTGGTG ATCGGCGCCG GGCCTGCGGG GCTGATGGCG GCCCTGACCG CGGCCCGGGC GGGTGCGGAT GTGATCTTGG CAGAGGAGGG TGACCGTTGT GGCGGGCGGC TCCTTTCCGA GCAGGAGGAA GTCGACGGGG CCTCAGGGCA GGATTGGGTG AGCCGGATAG TGGCGGAGTT GGTGACGAGT GGGCGGGTCC GGGTCATGAC CCGCACCACG GTGACCGGGG CCTATGACGG CGGCATTTAC GGAGCGCTGG AGCGGGTGGG CCTGCACCTG GCCGACCCGG CGCCCGCCCT GCCGCGGGAG TGTTTCTGGC GCATCTCGGC GCGGCGCGCG ATCCTGTGCG CCGGCGCGTC CGAACGGCCC ATCGCCTTTC CGATGAACGA CCGTCCCGGC ATCCTGTCAG CCGCAGCCCT GCGAACCTAC GCCAACCGTT ATCGCACTGT GCCGGGGCCG CGCGTGGCGC TCTTTTGCAA CAATGACGAT GCGTACCGCA CGGCTCGGGA CCTTGTCGCC AGCGGGGTCG AGGTGGCTGC CGTGGTGGAT ACCCGCGCCG AGACGCCCGA GGGCGACTAC CCGGTTTTCC GGGAGGCACG CGTGGTTGAC ACCAGGGGCC GCCATGGCCT GCGCCGGATC ACCGTCGATG GCCCGGACGG ACGGCGGCAG ATCGAGGTGG ACGCCCTTGG CGTCTCTGGC GGCTGGAACC CGAATCTGCA TCTGACCTGC CACATGTACG GGCGCCCCAC CTGGAACCCT GCGCTGCATG CTTTCGTGCC TACGCCTGGG GCGGTGCCAG GTCTCGATGT GGCCGGGGCG GCCGCGGGGG TGTTTTCCAC CCATGGAGCC TTGAAATCGG GAGTGTTCCG GGCGCGCAAG GCGTTGGAGT CGCTGGAGCT GGCACCGTCC TCCATGCCCA TCCCGCAGGC AGATGACGCG CCTTACGTCC TCTCGCCGAT CTATGCGGTC GACGGTGCCA AGGGCGCACG GGGCCGAGCC TGGCTCGACT TCCAGAACGA TGTGACCGTC AAGGATGTCC GACAGGCCGC GTCCGAGAAC ATGACCAGCG TCGAGCACAT GAAGCGTTAC ACCACCCAAG GCATGGCCAC GGATCAGGGC AGGGTGTCGA ATGTGCTGGC GCTTTCGGTG CTCGCCGATG CCACGGGGCG GGGCATCCCA GAGACCGGGA CAACCACCTT CCGCCCGCCC TATACGCCCG TTTCCCTCGC CGCGCTGGGC GCAGGGGCGC AGGGCAAGGG GTTCGCGCCG GAGCGGCTGA CCACGTCTCA TGCAGGCAGC CTCGCGCGCG GTGCACCCAT GGTCGAAGCC GGTCTGTGGT ATCGCCCGAG CTACTTCCCC AGAGACGGCG AGACGACCTG GCGCGAAGCT TGCGACCGGG AGGTCGGCCA TGTTCGCAGT GCGGTCGGGG TGTGCGATGT CTCCACCCTC GGCAAGATCG AGTTGCAGGG CCCCGATGCG GGGGCGTTTC TCGACCTGGT CTATGCCAAC ACCTTCTCGA CCCTGAAACC GGGGCGCGTG CGCTACGGGG TGATGCTGCG CGAGGACGGG CATGTGATGG ATGACGGCAC GACAGCCTGT CTCGGGCCGG GGCACTATGT GATGACAACA ACAACCGTCG CAGCGCACAA GGTCATGCAG CATCTTGAAT TCGTGCAGCA ATGCCTGGTG CCGACGATGC GGGTGCGCTT TACCTCCGTC ACCGAGAATT GGGCGCAATT CGCCGTGGCG GGGCCGCAAG CCCGCGCGTT GCTCAACGAT CTGCTGGACG CGCCGGTGAG GAATGCCAAC TTTCCGTTCA TGGCCTGTGG GGCGGTGCGG TTGGGGGGTG TGGGTGGTCG TTTGTTCCGC ATCTCGTTCT CGGGAGAACA CGCCTATGAA GTCGCTATCC CGGCCCGCTA CGGTGCTGCG CTGTTCGAGC AACTGGTTGC CCGGGCCGAG GCCATGGGCG GCGGAGCCTA CGGGATGGAG GCGCTCAACG TGCTGCGGAT CGAAAAGGGG TTCGTGACCC ATGCGGAGAT CCACGGCCGC GTCACCGCGG ACGATGTCGG GCTGGGGCGG ATGGTGTCAG CCAAGAAGGA CTGCATCGGC AAGACCGCCA GTCAGCGCCC GGGCCTGACC GACCCAGACC GGGAGCAACT TGTGGGCCTG CGGCCCTGCG ACGGGCGGAC CGCTTTGTTG GCCGGGGCGC ACCTGTTCGA GCCCGGGGCA GAGGCGGTGC GGGTCACCGA CCAGGGCTAC GTGACTAGCG TGTGTTTCTC ACCGACGCTA CAGACGCCTA TTGGGCTCGG CTTCCTGCGC CGAGGTCGGG CACGGCATGG GCAGCGGGTG ATGATGATCG ATCACTTGCG GGGGGTGACC ACTGAGTGTG AGGTGACGGA TCCCGTGTTC TTTGATCCCG ACGGAGGGCG CGCCCGTGGC TGA
|
Protein sequence | MRLDGAGLID RDTPLRFQFD DRWMGGFAGD TLASALMANG VKLVARSFKY HRPRGIWSAG SEEPNALVQV GKFARQVPNV RATQAELFEG LYATSQNRWP SLTLDLLSLN DLAAPFLGAG FYYKTFMWPR SFWEKLYEPA IRRAAGLGAL SGQTNTEPYE KAYAHCDLLV IGAGPAGLMA ALTAARAGAD VILAEEGDRC GGRLLSEQEE VDGASGQDWV SRIVAELVTS GRVRVMTRTT VTGAYDGGIY GALERVGLHL ADPAPALPRE CFWRISARRA ILCAGASERP IAFPMNDRPG ILSAAALRTY ANRYRTVPGP RVALFCNNDD AYRTARDLVA SGVEVAAVVD TRAETPEGDY PVFREARVVD TRGRHGLRRI TVDGPDGRRQ IEVDALGVSG GWNPNLHLTC HMYGRPTWNP ALHAFVPTPG AVPGLDVAGA AAGVFSTHGA LKSGVFRARK ALESLELAPS SMPIPQADDA PYVLSPIYAV DGAKGARGRA WLDFQNDVTV KDVRQAASEN MTSVEHMKRY TTQGMATDQG RVSNVLALSV LADATGRGIP ETGTTTFRPP YTPVSLAALG AGAQGKGFAP ERLTTSHAGS LARGAPMVEA GLWYRPSYFP RDGETTWREA CDREVGHVRS AVGVCDVSTL GKIELQGPDA GAFLDLVYAN TFSTLKPGRV RYGVMLREDG HVMDDGTTAC LGPGHYVMTT TTVAAHKVMQ HLEFVQQCLV PTMRVRFTSV TENWAQFAVA GPQARALLND LLDAPVRNAN FPFMACGAVR LGGVGGRLFR ISFSGEHAYE VAIPARYGAA LFEQLVARAE AMGGGAYGME ALNVLRIEKG FVTHAEIHGR VTADDVGLGR MVSAKKDCIG KTASQRPGLT DPDREQLVGL RPCDGRTALL AGAHLFEPGA EAVRVTDQGY VTSVCFSPTL QTPIGLGFLR RGRARHGQRV MMIDHLRGVT TECEVTDPVF FDPDGGRARG
|
| |