Gene Rleg_0889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0889 
Symbol 
ID8015494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp879322 
End bp880971 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content64% 
IMG OID644823474 
Productcholine dehydrogenase 
Protein accessionYP_002974725 
Protein GI241203629 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID[TIGR01810] choline dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGCAG ATTTCGTCAT CATCGGCTCG GGCTCCGCCG GCTCCGCCCT CGCCTACCGC 
TTGTCGGAAG ACGGCAAGAA CAGTGTCATC GTCATCGAGG CCGGTGGCAG CGATTTCGGG
CCGTTCATCC AGATGCCGGC AGCCCTTGCC TGGCCGATGA GCATGAAGAG ATATAATTGG
GGTTATCTGT CCGAGCCCGA GGCGAACCTC AACAACCGGC GCATCACCGC GCCGCGCGGC
AAGGTGATCG GCGGCTCCTC GTCGATCAAC GGCATGGTCT ATGTGCGCGG CCATGCCGAG
GACTTCAACC GCTGGGAGGA GCTCGGCGCC AGCGGCTGGG CCTATGCCGA TGTACTTCCC
TATTTCAAGC GGATGGAACA TTCGCATGGC GGCGAAGAGG GCTGGCGCGG CACCGATGGG
CCGCTGCATG TCCAGCGCGG CGGCTTCACC AATCCGCTCT TCCGCGCCTT CGTCGAGGCC
GGCAAACAGG CGGGCTTCGA GACGACGGAG GATTACAACG GCAGCAAGCA GGAAGGCTTC
GGTCTGATGG AGCAGACCAT CTTCGGCGGC CGCCGCTGGT CTGCCGCCAA CGCCTATCTG
AGACCGGCGC TGAAGCGTGA CAATGTCAGG ATCGTCTATG GCTTTGCGCA GAAGATCGTG
ATCGAGGACG GGCGGGCGAC CGGCGTCGAG ATTGAACGCA ACGGCAGGAT CGAGGTGCTG
AAGGCGAACC GCGAGGTGAT CGTCTCGGCC TCCTCTTTCA ATTCGCCGAA GCTCTTGATG
CTGTCGGGCA TCGGTCCCGG CCAACATCTG CAGGACATGG GCATTACGGT GAAGGCCGAC
CGGCCGGGCG TCGGCGCCAA CTTGCAGGAC CATATGGAAT TCTACTTCCA GCAGGTGAGC
ACCAAGCCGG TGTCGCTCTA TTCCTGGCTG CCGTGGTTCT GGCAGGGGGT GGCGGGCGCC
CAATGGCTGC TCTCGCGCGG CGGGCTCGGC GCCTCCAACC AGTTCGAGGC CTGCGCCTTC
CTGCGCTCGG CGCCGGGGCT GAAGCAGCCC GACATCCAGT ATCATTTCCT GCCGGTGGCG
ATCAGCTATG ACGGCAAGGC GGCCGCGAAA AGCCACGGCT TCCAGGTTCA TGTCGGCTAT
AACCTGTCGA AATCGCGCGG CAGCGTGAGC TTGCGCTCCG CCGATCCCAA GGCCGACCCG
GTGCTGCGCT TCAACTATAT GAGCCATGCC GAGGATTGGG AGAAATTCCG CCACTGCGTG
CGCCTCACCC GCGAAATCTT CGGGCAGACG GCCTTCAACG ACTATCGCGG CCCGGAGATC
CAGCCGGGCG AAAGCGTGCA AAGCGACGAA GAGATCGACG CCTTCCTGCG CGAACATCTG
GAAAGCGCCT ATCACCCCTG CGGCACCTGC CGGATGGGCG CCAAGGACGA TCCGATGGCG
GTGGTCGATC CGCAAACGCG GGTGATCGGC ATCGATGGCC TGCGCGTCGC CGACAGCTCG
ATCTTCCCGC ACGTCACTTA TGGCAATTTG AACGGCCCCT CGATCATGAC CGGAGAGAAG
GCCGCCGACC ATATCCTCGG CAAACAACCG CTGGCGCGTT CGAACCAGGA ACCCTGGGTC
AACCCGCGCG CGGCCGTCAG CGATCGATAA
 
Protein sequence
MQADFVIIGS GSAGSALAYR LSEDGKNSVI VIEAGGSDFG PFIQMPAALA WPMSMKRYNW 
GYLSEPEANL NNRRITAPRG KVIGGSSSIN GMVYVRGHAE DFNRWEELGA SGWAYADVLP
YFKRMEHSHG GEEGWRGTDG PLHVQRGGFT NPLFRAFVEA GKQAGFETTE DYNGSKQEGF
GLMEQTIFGG RRWSAANAYL RPALKRDNVR IVYGFAQKIV IEDGRATGVE IERNGRIEVL
KANREVIVSA SSFNSPKLLM LSGIGPGQHL QDMGITVKAD RPGVGANLQD HMEFYFQQVS
TKPVSLYSWL PWFWQGVAGA QWLLSRGGLG ASNQFEACAF LRSAPGLKQP DIQYHFLPVA
ISYDGKAAAK SHGFQVHVGY NLSKSRGSVS LRSADPKADP VLRFNYMSHA EDWEKFRHCV
RLTREIFGQT AFNDYRGPEI QPGESVQSDE EIDAFLREHL ESAYHPCGTC RMGAKDDPMA
VVDPQTRVIG IDGLRVADSS IFPHVTYGNL NGPSIMTGEK AADHILGKQP LARSNQEPWV
NPRAAVSDR