Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_0804 |
Symbol | |
ID | 5711240 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 810647 |
End bp | 812269 |
Gene Length | 1623 bp |
Protein Length | 540 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641266713 |
Product | glucose-methanol-choline oxidoreductase |
Protein accession | YP_001532150 |
Protein GI | 159043356 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.271201 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAGCGG ACTATGTGGT CATCGGTGCC GGCTCCGCCG GCTGTGTCGT CGCCAATCGC CTCAGCACGG ATGCGCGCAA CAAGGTCGTC CTCCTGGAGG CGGGCCCGCC CGACACGAAC CCCTGGATCC ACATCCCCGT CGGCTATTTC AAGACCATGC ACAATCCGAC CGTGGACTGG TGCTACAAGA CCCAGGCCGA CCCGGGCCTC AACGGGCGCT CCATCGACTG GCCGCGCGGC AAGGTGCTGG GGGGCTCGTC CTCGCTCAAC GGGCTGCTCT ACGTGCGCGG CCAGCCCGAG GATTACGACC GCTGGCGCCA GATGGGCAAT GCGGGCTGGG GCTGGGACGA TGTGCTGCCG CTCTTCCGGC GGGCCGAGGC GAACGAGCGC GGCGCCGATC CCTGGCACGG CGATGACGGC CCGCTGGCGG TCTCGAACAT GCGCATCCAG CGCCCGATCT GCGATGCCTG GGTCGCCGCG GCGCAAGCCA TGGGCTACCC GTTCAACCCC GACTACAACG GCGCCAGCCA GGAGGGGGTG GGCTATTTCC AGCTCACCAC CCGCAACGGC CGCCGCTGCA GCGCCGCGGT CGCCTACCTC AAACCCGCCA GGAAACGCCC GAACCTGAGC ATCATCACCC GCGCGCTCGT CACCCGGATC GAGATGGAGG GCAAGCGCGT CACCGGTGTG ACCTATACCG ACGCGGGCGG GCGCGCCCAC ACCGTCAGCG CCCGGCGCGA GGTGATCCTG TCCGGCGGTG CGATCAACTC GCCCCATATC CTGATGCTGT CGGGCATCGG CGACCCCGAC CAGCTCCAGG CCCATGGCAT CACGCCGCGC CACGCGCTCC CCGGTGTCGG CAAGAACCTG CAGGACCACC TGCAGGCGCG TCTGGTGTTC AAATGCAACG AGCCGACCCT GAACGATGAG GTCCGCAGCC TCGTCAATCA GGCGCGCATC GCCCTGAAAT ACGCGCTTTT CCGCGCGGGG CCAATGACCA TGGCCGCCAG CCTCGCCACG GGCTTTCTGA AAACCCGGCC CGACATCGCC ACGCCGGACA TCCAGTTCCA CGTCCAGCCC TGGTCCGCCG ACAGCCCCGG CGAAGGCGTG CACCCGTTCT CGGCCTTCAC CATGTCCGTG TGCCAGCTGC GCCCCGAAAG CCGCGGCGAG ATCCGCCTCG CCGGGCCGGA CCCGCGCACC TATCCCACGA TCCACCCCAA CTACCTGTCG ACCGAAACCG ACTGCGCCAC CCTCACCGAA GGCGTCAAGA TCGCGCGCCG GATCGCGCGG GCCGACCCTC TGGCGGGCAA GATCGCCGAG GAATTCCGCC CCCCCGCCAA TCTCGCGCTC GACGACGATG CGGCCACGCT GGATTGGGCG CGGAGCAACT CGGTCTCGAT CTACCACCCC ACGGGCACCT GCAAGATGGG CACCGGCCCC GGCGCCGTGG TGGACGCCCG GCTGCGGGTC CACGGGCTGT CGGGCCTGCG CGTGGCGGAT TGCTCGATCA TGCCCGAGAT CGTCTCGGGC AACACCAACG CCCCGGCCAT CATGATCGGC GAGAAGCTCT CCGACATGGT GCTCGAAGAC GCCAGGGACA CCGCCCAAGC GGTCCCGGCC TGA
|
Protein sequence | MEADYVVIGA GSAGCVVANR LSTDARNKVV LLEAGPPDTN PWIHIPVGYF KTMHNPTVDW CYKTQADPGL NGRSIDWPRG KVLGGSSSLN GLLYVRGQPE DYDRWRQMGN AGWGWDDVLP LFRRAEANER GADPWHGDDG PLAVSNMRIQ RPICDAWVAA AQAMGYPFNP DYNGASQEGV GYFQLTTRNG RRCSAAVAYL KPARKRPNLS IITRALVTRI EMEGKRVTGV TYTDAGGRAH TVSARREVIL SGGAINSPHI LMLSGIGDPD QLQAHGITPR HALPGVGKNL QDHLQARLVF KCNEPTLNDE VRSLVNQARI ALKYALFRAG PMTMAASLAT GFLKTRPDIA TPDIQFHVQP WSADSPGEGV HPFSAFTMSV CQLRPESRGE IRLAGPDPRT YPTIHPNYLS TETDCATLTE GVKIARRIAR ADPLAGKIAE EFRPPANLAL DDDAATLDWA RSNSVSIYHP TGTCKMGTGP GAVVDARLRV HGLSGLRVAD CSIMPEIVSG NTNAPAIMIG EKLSDMVLED ARDTAQAVPA
|
| |