Gene Rleg2_0776 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_0776 
Symbol 
ID6979494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp793929 
End bp795578 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content64% 
IMG OID643395488 
Productcholine dehydrogenase 
Protein accessionYP_002280297 
Protein GI209548380 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID[TIGR01810] choline dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0802775 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGCAG ATTTCGTCAT CATCGGCTCA GGCTCGGCCG GCTCGGCGCT TGCCTATCGC 
CTCTCCGAAG ACGGCAAGAA CAGCGTCCTC GTCATCGAGG CGGGCGGCAG CGATTTCGGG
CCGTTTATCC AGATGCCGGC GGCCCTTGCC TGGCCGATGA GCATGAAGCG CTACAATTGG
GGCTATTTGT CCGAGCCGGA GCCGAACCTC AACAACCGGC GCATCACCGC GCCGCGCGGC
AAGGTGATCG GCGGGTCCTC CTCGATCAAC GGCATGGTCT ATGTGCGCGG CCATGCCGAG
GATTTCAACC GCTGGGAGGA GCTCGGCGCC GGCGGCTGGG CCTATGCCGA TGTCCTCCCC
TATTTCAAGC GGATGGAGCA TTCGCATGGT GGCGAGGAGG GTTGGCGCGG CACCGACGGG
CCGCTGCATG TCCAGCGCGG CGGTTTCACC AATCCGCTGT TTCGGGCCTT TGTCGAGGCC
GGCAAACAGG CGGGCTTCGA GACGACCGAG GATTATAACG GCAGCAAGCA GGAAGGCTTC
GGGCTGATGG AGCAGACCAT CTTTTCCGGC CGCCGCTGGT CTGCCGCCAA CGCCTATCTG
AAACCGGCGC TGAAGCGGAA AAATGTCGAG ATCATCTACG GCTTTGCGCA AAGGATCGTG
ATCGAAGACG GCCGGGCGAC CGGCGTCGAG ATCGAGCGCG GCGGCAAAAT AGAGGTGGTC
AAGGCCAACC GCGAGGTGAT CGTCTCGGCC TCCTCGTTCA ATTCGCCGAA GCTGTTGATG
TTGTCGGGCA TCGGCCCGGG CGAGCATCTC AAGGAAATGG GCATCGAGGT GAAGGCCGAC
CGGCCGGGGG TCGGCGCCAA CCTGCAGGAC CATATGGAAT TCTACTTTCA GCAGGTGAGC
ACCAAGCCGG TGTCGCTCTA TTCCTGGCTG CCGTGGTTCT GGCAGGGAGT GGCGGGCGCG
CAATGGCTGC TGTCCAAGGG CGGGCTCGGC GCCTCCAACC AGTTCGAGGC CTGCGCCTTC
CTGCGCTCGG CGCCCGGGCT GAAGCAGCCC GACATCCAAT ACCATTTCCT GCCGGTGGCA
ATCAGCTATG ACGGCAAGGC GGCAGCCAAA AGCCACGGTT TCCAGGTGCA TGTCGGCTAC
AACCTGTCGA AATCGCGCGG CAGCGTGACG CTGCGCTCGC CCGACCCGAA GGCCGACCCG
GTGCTGCGCT TCAACTATAT GAGCCATGCC GAGGACTGGG AGAAATTCCG CCACTGCGTG
CGGCTGACCC GCGAGCTCTT CGGGCAAGCG GCCTTCGACG ACTACCGCGG GGCAGAGATC
CAGCCCGGAG AAAGTGTGCA AAGCGACGAA GAGATCGACG CCTTCCTGCG CGAGCATCTG
GAAAGCGCCT ATCACCCCTG CGGCACCTGC AAGATGGGCG CCAAGGACGA TCCAATGGCG
GTGGTCGATC CCCAGACCCG GGTGATCGGG GTCGAGGCCT TGCGCGTCGC CGACAGCTCG
ATCTTCCCGC ACGTCACCTA CGGCAACCTC AACGGCCCGT CGATCATGAC CGGCGAAAAG
GCCGCCGACC ACATCCTCGG CAAACAGCCG CTGGCGCGCT CGAACCAGGA ACCCTGGGTC
AATCCGCGGG CCGCCGTCAG CGATCGGTGA
 
Protein sequence
MQADFVIIGS GSAGSALAYR LSEDGKNSVL VIEAGGSDFG PFIQMPAALA WPMSMKRYNW 
GYLSEPEPNL NNRRITAPRG KVIGGSSSIN GMVYVRGHAE DFNRWEELGA GGWAYADVLP
YFKRMEHSHG GEEGWRGTDG PLHVQRGGFT NPLFRAFVEA GKQAGFETTE DYNGSKQEGF
GLMEQTIFSG RRWSAANAYL KPALKRKNVE IIYGFAQRIV IEDGRATGVE IERGGKIEVV
KANREVIVSA SSFNSPKLLM LSGIGPGEHL KEMGIEVKAD RPGVGANLQD HMEFYFQQVS
TKPVSLYSWL PWFWQGVAGA QWLLSKGGLG ASNQFEACAF LRSAPGLKQP DIQYHFLPVA
ISYDGKAAAK SHGFQVHVGY NLSKSRGSVT LRSPDPKADP VLRFNYMSHA EDWEKFRHCV
RLTRELFGQA AFDDYRGAEI QPGESVQSDE EIDAFLREHL ESAYHPCGTC KMGAKDDPMA
VVDPQTRVIG VEALRVADSS IFPHVTYGNL NGPSIMTGEK AADHILGKQP LARSNQEPWV
NPRAAVSDR