Gene Dshi_2264 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_2264 
Symbol 
ID5713917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2386296 
End bp2389238 
Gene Length2943 bp 
Protein Length980 aa 
Translation table11 
GC content67% 
IMG OID641268186 
Productputative sarcosine oxidase 
Protein accessionYP_001533601 
Protein GI159044807 
COG category[E] Amino acid transport and metabolism
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0404] Glycine cleavage system T protein (aminomethyltransferase)
[COG0492] Thioredoxin reductase 
TIGRFAM ID[TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.710125 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCTCG ATGGTGCGGG CCTGATCGAC CGGGACACGC CGCTGCGCTT TCAGTTCGAC 
GACCGTTGGA TGGGCGGGTT TGCGGGGGAC ACGCTGGCCT CGGCTTTGAT GGCGAACGGG
GTAAAGCTGG TCGCTCGGTC CTTCAAGTAC CACCGCCCCC GGGGCATCTG GTCGGCGGGT
TCCGAAGAGC CCAACGCTTT GGTGCAGGTG GGCAAATTCG CGCGCCAGGT GCCGAATGTG
CGTGCGACGC AGGCCGAACT TTTTGAGGGT CTCTACGCCA CCAGCCAGAA CCGGTGGCCG
TCGCTGACCC TCGACCTGCT GTCGTTGAAT GATCTGGCGG CGCCCTTCCT CGGGGCGGGG
TTCTATTACA AGACCTTCAT GTGGCCGCGC AGCTTCTGGG AAAAGCTCTA TGAGCCCGCA
ATTCGGCGGG CGGCAGGGTT GGGTGCCCTG TCGGGTCAGA CCAACACCGA GCCCTACGAG
AAGGCCTATG CCCATTGCGA CCTGCTGGTG ATCGGCGCCG GGCCTGCGGG GCTGATGGCG
GCCCTGACCG CGGCCCGGGC GGGTGCGGAT GTGATCTTGG CAGAGGAGGG TGACCGTTGT
GGCGGGCGGC TCCTTTCCGA GCAGGAGGAA GTCGACGGGG CCTCAGGGCA GGATTGGGTG
AGCCGGATAG TGGCGGAGTT GGTGACGAGT GGGCGGGTCC GGGTCATGAC CCGCACCACG
GTGACCGGGG CCTATGACGG CGGCATTTAC GGAGCGCTGG AGCGGGTGGG CCTGCACCTG
GCCGACCCGG CGCCCGCCCT GCCGCGGGAG TGTTTCTGGC GCATCTCGGC GCGGCGCGCG
ATCCTGTGCG CCGGCGCGTC CGAACGGCCC ATCGCCTTTC CGATGAACGA CCGTCCCGGC
ATCCTGTCAG CCGCAGCCCT GCGAACCTAC GCCAACCGTT ATCGCACTGT GCCGGGGCCG
CGCGTGGCGC TCTTTTGCAA CAATGACGAT GCGTACCGCA CGGCTCGGGA CCTTGTCGCC
AGCGGGGTCG AGGTGGCTGC CGTGGTGGAT ACCCGCGCCG AGACGCCCGA GGGCGACTAC
CCGGTTTTCC GGGAGGCACG CGTGGTTGAC ACCAGGGGCC GCCATGGCCT GCGCCGGATC
ACCGTCGATG GCCCGGACGG ACGGCGGCAG ATCGAGGTGG ACGCCCTTGG CGTCTCTGGC
GGCTGGAACC CGAATCTGCA TCTGACCTGC CACATGTACG GGCGCCCCAC CTGGAACCCT
GCGCTGCATG CTTTCGTGCC TACGCCTGGG GCGGTGCCAG GTCTCGATGT GGCCGGGGCG
GCCGCGGGGG TGTTTTCCAC CCATGGAGCC TTGAAATCGG GAGTGTTCCG GGCGCGCAAG
GCGTTGGAGT CGCTGGAGCT GGCACCGTCC TCCATGCCCA TCCCGCAGGC AGATGACGCG
CCTTACGTCC TCTCGCCGAT CTATGCGGTC GACGGTGCCA AGGGCGCACG GGGCCGAGCC
TGGCTCGACT TCCAGAACGA TGTGACCGTC AAGGATGTCC GACAGGCCGC GTCCGAGAAC
ATGACCAGCG TCGAGCACAT GAAGCGTTAC ACCACCCAAG GCATGGCCAC GGATCAGGGC
AGGGTGTCGA ATGTGCTGGC GCTTTCGGTG CTCGCCGATG CCACGGGGCG GGGCATCCCA
GAGACCGGGA CAACCACCTT CCGCCCGCCC TATACGCCCG TTTCCCTCGC CGCGCTGGGC
GCAGGGGCGC AGGGCAAGGG GTTCGCGCCG GAGCGGCTGA CCACGTCTCA TGCAGGCAGC
CTCGCGCGCG GTGCACCCAT GGTCGAAGCC GGTCTGTGGT ATCGCCCGAG CTACTTCCCC
AGAGACGGCG AGACGACCTG GCGCGAAGCT TGCGACCGGG AGGTCGGCCA TGTTCGCAGT
GCGGTCGGGG TGTGCGATGT CTCCACCCTC GGCAAGATCG AGTTGCAGGG CCCCGATGCG
GGGGCGTTTC TCGACCTGGT CTATGCCAAC ACCTTCTCGA CCCTGAAACC GGGGCGCGTG
CGCTACGGGG TGATGCTGCG CGAGGACGGG CATGTGATGG ATGACGGCAC GACAGCCTGT
CTCGGGCCGG GGCACTATGT GATGACAACA ACAACCGTCG CAGCGCACAA GGTCATGCAG
CATCTTGAAT TCGTGCAGCA ATGCCTGGTG CCGACGATGC GGGTGCGCTT TACCTCCGTC
ACCGAGAATT GGGCGCAATT CGCCGTGGCG GGGCCGCAAG CCCGCGCGTT GCTCAACGAT
CTGCTGGACG CGCCGGTGAG GAATGCCAAC TTTCCGTTCA TGGCCTGTGG GGCGGTGCGG
TTGGGGGGTG TGGGTGGTCG TTTGTTCCGC ATCTCGTTCT CGGGAGAACA CGCCTATGAA
GTCGCTATCC CGGCCCGCTA CGGTGCTGCG CTGTTCGAGC AACTGGTTGC CCGGGCCGAG
GCCATGGGCG GCGGAGCCTA CGGGATGGAG GCGCTCAACG TGCTGCGGAT CGAAAAGGGG
TTCGTGACCC ATGCGGAGAT CCACGGCCGC GTCACCGCGG ACGATGTCGG GCTGGGGCGG
ATGGTGTCAG CCAAGAAGGA CTGCATCGGC AAGACCGCCA GTCAGCGCCC GGGCCTGACC
GACCCAGACC GGGAGCAACT TGTGGGCCTG CGGCCCTGCG ACGGGCGGAC CGCTTTGTTG
GCCGGGGCGC ACCTGTTCGA GCCCGGGGCA GAGGCGGTGC GGGTCACCGA CCAGGGCTAC
GTGACTAGCG TGTGTTTCTC ACCGACGCTA CAGACGCCTA TTGGGCTCGG CTTCCTGCGC
CGAGGTCGGG CACGGCATGG GCAGCGGGTG ATGATGATCG ATCACTTGCG GGGGGTGACC
ACTGAGTGTG AGGTGACGGA TCCCGTGTTC TTTGATCCCG ACGGAGGGCG CGCCCGTGGC
TGA
 
Protein sequence
MRLDGAGLID RDTPLRFQFD DRWMGGFAGD TLASALMANG VKLVARSFKY HRPRGIWSAG 
SEEPNALVQV GKFARQVPNV RATQAELFEG LYATSQNRWP SLTLDLLSLN DLAAPFLGAG
FYYKTFMWPR SFWEKLYEPA IRRAAGLGAL SGQTNTEPYE KAYAHCDLLV IGAGPAGLMA
ALTAARAGAD VILAEEGDRC GGRLLSEQEE VDGASGQDWV SRIVAELVTS GRVRVMTRTT
VTGAYDGGIY GALERVGLHL ADPAPALPRE CFWRISARRA ILCAGASERP IAFPMNDRPG
ILSAAALRTY ANRYRTVPGP RVALFCNNDD AYRTARDLVA SGVEVAAVVD TRAETPEGDY
PVFREARVVD TRGRHGLRRI TVDGPDGRRQ IEVDALGVSG GWNPNLHLTC HMYGRPTWNP
ALHAFVPTPG AVPGLDVAGA AAGVFSTHGA LKSGVFRARK ALESLELAPS SMPIPQADDA
PYVLSPIYAV DGAKGARGRA WLDFQNDVTV KDVRQAASEN MTSVEHMKRY TTQGMATDQG
RVSNVLALSV LADATGRGIP ETGTTTFRPP YTPVSLAALG AGAQGKGFAP ERLTTSHAGS
LARGAPMVEA GLWYRPSYFP RDGETTWREA CDREVGHVRS AVGVCDVSTL GKIELQGPDA
GAFLDLVYAN TFSTLKPGRV RYGVMLREDG HVMDDGTTAC LGPGHYVMTT TTVAAHKVMQ
HLEFVQQCLV PTMRVRFTSV TENWAQFAVA GPQARALLND LLDAPVRNAN FPFMACGAVR
LGGVGGRLFR ISFSGEHAYE VAIPARYGAA LFEQLVARAE AMGGGAYGME ALNVLRIEKG
FVTHAEIHGR VTADDVGLGR MVSAKKDCIG KTASQRPGLT DPDREQLVGL RPCDGRTALL
AGAHLFEPGA EAVRVTDQGY VTSVCFSPTL QTPIGLGFLR RGRARHGQRV MMIDHLRGVT
TECEVTDPVF FDPDGGRARG