Gene Dshi_0804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_0804 
Symbol 
ID5711240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp810647 
End bp812269 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content69% 
IMG OID641266713 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_001532150 
Protein GI159043356 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.271201 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGCGG ACTATGTGGT CATCGGTGCC GGCTCCGCCG GCTGTGTCGT CGCCAATCGC 
CTCAGCACGG ATGCGCGCAA CAAGGTCGTC CTCCTGGAGG CGGGCCCGCC CGACACGAAC
CCCTGGATCC ACATCCCCGT CGGCTATTTC AAGACCATGC ACAATCCGAC CGTGGACTGG
TGCTACAAGA CCCAGGCCGA CCCGGGCCTC AACGGGCGCT CCATCGACTG GCCGCGCGGC
AAGGTGCTGG GGGGCTCGTC CTCGCTCAAC GGGCTGCTCT ACGTGCGCGG CCAGCCCGAG
GATTACGACC GCTGGCGCCA GATGGGCAAT GCGGGCTGGG GCTGGGACGA TGTGCTGCCG
CTCTTCCGGC GGGCCGAGGC GAACGAGCGC GGCGCCGATC CCTGGCACGG CGATGACGGC
CCGCTGGCGG TCTCGAACAT GCGCATCCAG CGCCCGATCT GCGATGCCTG GGTCGCCGCG
GCGCAAGCCA TGGGCTACCC GTTCAACCCC GACTACAACG GCGCCAGCCA GGAGGGGGTG
GGCTATTTCC AGCTCACCAC CCGCAACGGC CGCCGCTGCA GCGCCGCGGT CGCCTACCTC
AAACCCGCCA GGAAACGCCC GAACCTGAGC ATCATCACCC GCGCGCTCGT CACCCGGATC
GAGATGGAGG GCAAGCGCGT CACCGGTGTG ACCTATACCG ACGCGGGCGG GCGCGCCCAC
ACCGTCAGCG CCCGGCGCGA GGTGATCCTG TCCGGCGGTG CGATCAACTC GCCCCATATC
CTGATGCTGT CGGGCATCGG CGACCCCGAC CAGCTCCAGG CCCATGGCAT CACGCCGCGC
CACGCGCTCC CCGGTGTCGG CAAGAACCTG CAGGACCACC TGCAGGCGCG TCTGGTGTTC
AAATGCAACG AGCCGACCCT GAACGATGAG GTCCGCAGCC TCGTCAATCA GGCGCGCATC
GCCCTGAAAT ACGCGCTTTT CCGCGCGGGG CCAATGACCA TGGCCGCCAG CCTCGCCACG
GGCTTTCTGA AAACCCGGCC CGACATCGCC ACGCCGGACA TCCAGTTCCA CGTCCAGCCC
TGGTCCGCCG ACAGCCCCGG CGAAGGCGTG CACCCGTTCT CGGCCTTCAC CATGTCCGTG
TGCCAGCTGC GCCCCGAAAG CCGCGGCGAG ATCCGCCTCG CCGGGCCGGA CCCGCGCACC
TATCCCACGA TCCACCCCAA CTACCTGTCG ACCGAAACCG ACTGCGCCAC CCTCACCGAA
GGCGTCAAGA TCGCGCGCCG GATCGCGCGG GCCGACCCTC TGGCGGGCAA GATCGCCGAG
GAATTCCGCC CCCCCGCCAA TCTCGCGCTC GACGACGATG CGGCCACGCT GGATTGGGCG
CGGAGCAACT CGGTCTCGAT CTACCACCCC ACGGGCACCT GCAAGATGGG CACCGGCCCC
GGCGCCGTGG TGGACGCCCG GCTGCGGGTC CACGGGCTGT CGGGCCTGCG CGTGGCGGAT
TGCTCGATCA TGCCCGAGAT CGTCTCGGGC AACACCAACG CCCCGGCCAT CATGATCGGC
GAGAAGCTCT CCGACATGGT GCTCGAAGAC GCCAGGGACA CCGCCCAAGC GGTCCCGGCC
TGA
 
Protein sequence
MEADYVVIGA GSAGCVVANR LSTDARNKVV LLEAGPPDTN PWIHIPVGYF KTMHNPTVDW 
CYKTQADPGL NGRSIDWPRG KVLGGSSSLN GLLYVRGQPE DYDRWRQMGN AGWGWDDVLP
LFRRAEANER GADPWHGDDG PLAVSNMRIQ RPICDAWVAA AQAMGYPFNP DYNGASQEGV
GYFQLTTRNG RRCSAAVAYL KPARKRPNLS IITRALVTRI EMEGKRVTGV TYTDAGGRAH
TVSARREVIL SGGAINSPHI LMLSGIGDPD QLQAHGITPR HALPGVGKNL QDHLQARLVF
KCNEPTLNDE VRSLVNQARI ALKYALFRAG PMTMAASLAT GFLKTRPDIA TPDIQFHVQP
WSADSPGEGV HPFSAFTMSV CQLRPESRGE IRLAGPDPRT YPTIHPNYLS TETDCATLTE
GVKIARRIAR ADPLAGKIAE EFRPPANLAL DDDAATLDWA RSNSVSIYHP TGTCKMGTGP
GAVVDARLRV HGLSGLRVAD CSIMPEIVSG NTNAPAIMIG EKLSDMVLED ARDTAQAVPA