Gene Rleg2_4456 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4456 
Symbol 
ID6977550 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp88131 
End bp89726 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content65% 
IMG OID643393634 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_002278452 
Protein GI209546534 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.691807 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0210476 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACTT ACGATTACAT CATCGTCGGA GGCGGATCCT CGGGTTGCGT GTTGGCGAAC 
CGGCTGTCGG AAAATGCGGC GCACAAGGTG CTGCTGATCG AGTCCGGCCG CCGCGACGCC
GATCCGTGGA TCCATATACC CGCGACCTTC TTCAAGGTGC TGGGCAAGGG GATCGATATC
CATCCCTATG CATCGAACCC CGACAAAGGG CTGAACGGGC GGCCGAGCAT CGTGCCGCAG
GGCAATGTGC TTGGCGGCGG CAGCTCGGTG AATGCGATGA TCTATATTCG CGGCCATCGC
AACGACTACG ACACCTGGTC GCAGATGGGC TGCCAGGGCT GGTCCTATGA CGACGTGCTT
CCGGCCTTCC GCTCGCTGGA GCACAACGAG CGGCTGAACG GCCAATTCCA CGGTAGGAAG
GGCGGCCTGC ACGTCTCCGA CCCCCGCCAT CGGCATCCGC TGAGCGAGGC CTTCGTCCAG
GCGGCGACCG AAATCGGCAT TCCCCAGAAC GATGATTTCA ACGGCGCCGA TCAGGCCGGT
GTCGGCTTTT ATCAGAGCAC CACCCATGGC GGCCGGCGCT GGAGTTCGGC CCAGGCTTTC
CTGCGCGAAG CGGAGAAACG GCCGAACCTG ACGGTGCTCA CCGAGCGCAA GGTCGCCCGC
ATCCTATTCG AAGGACAGAA GGCCGTCGGC GTCGAGCTTC TCGACGGCAC GACATTCAAG
GCGTCGCGCG AAATCGCCCT CACAGCAGGC GCGATCGCGA CGCCGAAGAT CCTGCAACAC
TCCGGTATCG GCGACGGCGC GCATCTTTCC TCGCTCGGCA TCAAGGTCGT CGCCGATCTG
CCGGGCGTCG GCGCGAACTA CCAGGACCAT CTGGAAGTGC CGGTGCAGGG CGAGACGCGC
GAGCCGATTT CGATCCTCGG CCACGATACG GGGCTGCGGG CCGTCGGCCA TATGCTGCGC
TATCTCACCT CCCGCCGCGG ATTGCTGGCA TCTAACGTCG TCGAATGCGG CGGCTTCGTC
GATACGGCGG GCACGGGGCA GCCGGACGTT CAGTTCCATG TCCTGCCGGT GCTGATCGGC
TTTGTCGATC GCGAACCGGA GCCCGGCCAC GGCCTCAGCA TCGGGCCGTG TTACCTGCGG
CCGCGGTCGC GCGGCTGGAT CAGGCTGAAG AGCGCCGATC CCAGCGAGCA GACGGATTTC
AACGCCAATC TGCTCTCCGA TCCCGCCGAT ATCGAAACCC TGGTGCGCGG CGTCGAGACG
GCGATCCGCA TCCTCGATGC CCCGGCGCTT GCCAAGCTGG TCAAGCGCCG CGTGCTGCCG
AAGCCCGGCG TCGAAAAGGA TCCGGAGGCT CTGCGCGATT ACATCCGTCA ATCAGCCAAA
ACAGTGTTCC ATCCGGCGGG AACGGCGCGC ATGGGCCGCG CTGACGATCG CATGGCGGTC
GTCGGACCGG ACCTCAAGGT GCGCGGCGTC GAGGGGCTGC GCGTCTGCGA CGCTTCGGTA
ATGCCGACGC TGGTGTCGGG CAATACCAAC GCGCCGACGA TGATGATTGC GGCAAAGGCC
GGTGCGTTCA TGACAGCGAA GGCGATTTCC GGCTGA
 
Protein sequence
MTTYDYIIVG GGSSGCVLAN RLSENAAHKV LLIESGRRDA DPWIHIPATF FKVLGKGIDI 
HPYASNPDKG LNGRPSIVPQ GNVLGGGSSV NAMIYIRGHR NDYDTWSQMG CQGWSYDDVL
PAFRSLEHNE RLNGQFHGRK GGLHVSDPRH RHPLSEAFVQ AATEIGIPQN DDFNGADQAG
VGFYQSTTHG GRRWSSAQAF LREAEKRPNL TVLTERKVAR ILFEGQKAVG VELLDGTTFK
ASREIALTAG AIATPKILQH SGIGDGAHLS SLGIKVVADL PGVGANYQDH LEVPVQGETR
EPISILGHDT GLRAVGHMLR YLTSRRGLLA SNVVECGGFV DTAGTGQPDV QFHVLPVLIG
FVDREPEPGH GLSIGPCYLR PRSRGWIRLK SADPSEQTDF NANLLSDPAD IETLVRGVET
AIRILDAPAL AKLVKRRVLP KPGVEKDPEA LRDYIRQSAK TVFHPAGTAR MGRADDRMAV
VGPDLKVRGV EGLRVCDASV MPTLVSGNTN APTMMIAAKA GAFMTAKAIS G