Gene Smed_0562 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0562 
Symbol 
ID5321398 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp608815 
End bp610464 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content64% 
IMG OID640789498 
Productcholine dehydrogenase 
Protein accessionYP_001326253 
Protein GI150395786 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID[TIGR01810] choline dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.55111 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGCAG ATTTCGTCAT CATCGGTTCC GGCTCGGCGG GCTCGGCCCT CGCCTATCGC 
CTGTCGGAAG ACGGCGCGAA TTCGGTCGTC GTGCTCGAAT TCGGCGGCTC GGACGTCGGC
CCGTTCATTC AGATGCCGGC GGCGCTGGCC TGGCCGATGA GCATGAACCG TTATAATTGG
GGCTACCTCT CCGAACCCGA GCCGAACCTC AACAACCGGC GCATCACCGC GCCGCGCGGC
AAGGTGATCG GCGGCTCCTC TTCGATCAAC GGCATGGTCT ATGTCCGCGG GCACTCGGAA
GACTTCGACC GGTGGGAAGA ACTCGGCGCA AAAGGCTGGG CCTATGCGGA CGTGCTGCCC
TATTACAAGC GGATGGAGCA TTCGCACGGG GGCGAGGAGG GTTGGCGCGG CACCGACGGA
CCGCTGCACG TGCAGCGCGG CCCGGTCAAG AATCCCCTTT TCCACGCCTT CATCGAGGCC
GGAAAGCAGG CCGGCTTCGA GGTCACGGAG GACTACAACG GCTCGAAGCA GGAAGGGTTC
GGGTTGATGG AGCAGACGAC ATGGCGGGGC CGCCGCTGGT CGGCCGCATC CGCCTATTTG
AGGCCAGCGC TCAAGCGCCC GAATGTCGAG CTCGTCCGGT GCTTCGCCCG CAAGATCGTT
ATCGAGAACG GCCGGGCGAC CGGCGTGGAG ATCGAGCGCG GCGGCCGCAC CGAGGTCGTC
AGGGCCAATC GCGAGGTGAT CGTCTCCGCC TCCTCCTTCA ACTCGCCGAA GCTCCTGATG
CTCTCCGGCA TCGGCCCCGC CGCGCATTTG CAGGAGATGG GCATCGACGT GAAGGCCGAC
CGGCCCGGCG TCGGCCAGAA CCTGCAGGAC CACATGGAAT TCTATTTCCA GCAGGTGAGC
ACCAAGCCGG TTTCGCTATA TTCTTGGCTG CCATGGTTCT GGCAGGGCGT TGCCGGGGCA
CAATGGCTCT TCTTCAAAAG AGGCCTCGGC ATTTCCAACC AGTTCGAGTC CTGCGCCTTC
CTGCGCTCGG CGCCCGGCGT CAAACAGCCG GACATCCAGT ATCATTTCCT TCCCGTCGCC
ATCAGTTATG ACGGCAAGGC GGCAGCGAAG TCGCACGGCT TCCAGGTGCA TGTCGGCTAC
AATCTCTCCA AGTCGCGCGG CGACGTCACG CTTCGCTCGT CCGATCCGAA AGCCGACCCG
GTGATCCGCT TCAACTATAT GAGCCATCCC GAGGACTGGG AGAAGTTTCG CCATTGCGTG
CGGCTGACCC GCGAGATTTT CGGCCAGAAG GCGTTCGACC TCTATCGTGG CCCGGAAATC
CAGCCGGGCG AGAAGGTCCG GACCGACGAG GAGATCGACG CCTTTCTGCG CGAGCATCTC
GAAAGCGCCT ATCACCCCTG CGGCACCTGC AAGATGGGCG CGAAGGACGA CCCGATGGCC
GTGGTCGACC CGGAAACCCG CGTCATCGGT GTCGATGGCC TTCGCGTTGC CGATTCCTCG
ATTTTCCCGC ATATCACCTA TGGCAATCTG AACGCTCCCT CGATCATGAC CGGCGAAAAG
GCCGCCGACC ACATCCTCGG GAAGCAGCCT CTCGCCCGTT CCAACCAGGA ACCCTGGATC
AATCCGCGCT GGGCGGTGAG CGATCGGTAG
 
Protein sequence
MQADFVIIGS GSAGSALAYR LSEDGANSVV VLEFGGSDVG PFIQMPAALA WPMSMNRYNW 
GYLSEPEPNL NNRRITAPRG KVIGGSSSIN GMVYVRGHSE DFDRWEELGA KGWAYADVLP
YYKRMEHSHG GEEGWRGTDG PLHVQRGPVK NPLFHAFIEA GKQAGFEVTE DYNGSKQEGF
GLMEQTTWRG RRWSAASAYL RPALKRPNVE LVRCFARKIV IENGRATGVE IERGGRTEVV
RANREVIVSA SSFNSPKLLM LSGIGPAAHL QEMGIDVKAD RPGVGQNLQD HMEFYFQQVS
TKPVSLYSWL PWFWQGVAGA QWLFFKRGLG ISNQFESCAF LRSAPGVKQP DIQYHFLPVA
ISYDGKAAAK SHGFQVHVGY NLSKSRGDVT LRSSDPKADP VIRFNYMSHP EDWEKFRHCV
RLTREIFGQK AFDLYRGPEI QPGEKVRTDE EIDAFLREHL ESAYHPCGTC KMGAKDDPMA
VVDPETRVIG VDGLRVADSS IFPHITYGNL NAPSIMTGEK AADHILGKQP LARSNQEPWI
NPRWAVSDR