Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_0562 |
Symbol | |
ID | 5321398 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 608815 |
End bp | 610464 |
Gene Length | 1650 bp |
Protein Length | 549 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640789498 |
Product | choline dehydrogenase |
Protein accession | YP_001326253 |
Protein GI | 150395786 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | [TIGR01810] choline dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.55111 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGCAG ATTTCGTCAT CATCGGTTCC GGCTCGGCGG GCTCGGCCCT CGCCTATCGC CTGTCGGAAG ACGGCGCGAA TTCGGTCGTC GTGCTCGAAT TCGGCGGCTC GGACGTCGGC CCGTTCATTC AGATGCCGGC GGCGCTGGCC TGGCCGATGA GCATGAACCG TTATAATTGG GGCTACCTCT CCGAACCCGA GCCGAACCTC AACAACCGGC GCATCACCGC GCCGCGCGGC AAGGTGATCG GCGGCTCCTC TTCGATCAAC GGCATGGTCT ATGTCCGCGG GCACTCGGAA GACTTCGACC GGTGGGAAGA ACTCGGCGCA AAAGGCTGGG CCTATGCGGA CGTGCTGCCC TATTACAAGC GGATGGAGCA TTCGCACGGG GGCGAGGAGG GTTGGCGCGG CACCGACGGA CCGCTGCACG TGCAGCGCGG CCCGGTCAAG AATCCCCTTT TCCACGCCTT CATCGAGGCC GGAAAGCAGG CCGGCTTCGA GGTCACGGAG GACTACAACG GCTCGAAGCA GGAAGGGTTC GGGTTGATGG AGCAGACGAC ATGGCGGGGC CGCCGCTGGT CGGCCGCATC CGCCTATTTG AGGCCAGCGC TCAAGCGCCC GAATGTCGAG CTCGTCCGGT GCTTCGCCCG CAAGATCGTT ATCGAGAACG GCCGGGCGAC CGGCGTGGAG ATCGAGCGCG GCGGCCGCAC CGAGGTCGTC AGGGCCAATC GCGAGGTGAT CGTCTCCGCC TCCTCCTTCA ACTCGCCGAA GCTCCTGATG CTCTCCGGCA TCGGCCCCGC CGCGCATTTG CAGGAGATGG GCATCGACGT GAAGGCCGAC CGGCCCGGCG TCGGCCAGAA CCTGCAGGAC CACATGGAAT TCTATTTCCA GCAGGTGAGC ACCAAGCCGG TTTCGCTATA TTCTTGGCTG CCATGGTTCT GGCAGGGCGT TGCCGGGGCA CAATGGCTCT TCTTCAAAAG AGGCCTCGGC ATTTCCAACC AGTTCGAGTC CTGCGCCTTC CTGCGCTCGG CGCCCGGCGT CAAACAGCCG GACATCCAGT ATCATTTCCT TCCCGTCGCC ATCAGTTATG ACGGCAAGGC GGCAGCGAAG TCGCACGGCT TCCAGGTGCA TGTCGGCTAC AATCTCTCCA AGTCGCGCGG CGACGTCACG CTTCGCTCGT CCGATCCGAA AGCCGACCCG GTGATCCGCT TCAACTATAT GAGCCATCCC GAGGACTGGG AGAAGTTTCG CCATTGCGTG CGGCTGACCC GCGAGATTTT CGGCCAGAAG GCGTTCGACC TCTATCGTGG CCCGGAAATC CAGCCGGGCG AGAAGGTCCG GACCGACGAG GAGATCGACG CCTTTCTGCG CGAGCATCTC GAAAGCGCCT ATCACCCCTG CGGCACCTGC AAGATGGGCG CGAAGGACGA CCCGATGGCC GTGGTCGACC CGGAAACCCG CGTCATCGGT GTCGATGGCC TTCGCGTTGC CGATTCCTCG ATTTTCCCGC ATATCACCTA TGGCAATCTG AACGCTCCCT CGATCATGAC CGGCGAAAAG GCCGCCGACC ACATCCTCGG GAAGCAGCCT CTCGCCCGTT CCAACCAGGA ACCCTGGATC AATCCGCGCT GGGCGGTGAG CGATCGGTAG
|
Protein sequence | MQADFVIIGS GSAGSALAYR LSEDGANSVV VLEFGGSDVG PFIQMPAALA WPMSMNRYNW GYLSEPEPNL NNRRITAPRG KVIGGSSSIN GMVYVRGHSE DFDRWEELGA KGWAYADVLP YYKRMEHSHG GEEGWRGTDG PLHVQRGPVK NPLFHAFIEA GKQAGFEVTE DYNGSKQEGF GLMEQTTWRG RRWSAASAYL RPALKRPNVE LVRCFARKIV IENGRATGVE IERGGRTEVV RANREVIVSA SSFNSPKLLM LSGIGPAAHL QEMGIDVKAD RPGVGQNLQD HMEFYFQQVS TKPVSLYSWL PWFWQGVAGA QWLFFKRGLG ISNQFESCAF LRSAPGVKQP DIQYHFLPVA ISYDGKAAAK SHGFQVHVGY NLSKSRGDVT LRSSDPKADP VIRFNYMSHP EDWEKFRHCV RLTREIFGQK AFDLYRGPEI QPGEKVRTDE EIDAFLREHL ESAYHPCGTC KMGAKDDPMA VVDPETRVIG VDGLRVADSS IFPHITYGNL NAPSIMTGEK AADHILGKQP LARSNQEPWI NPRWAVSDR
|
| |