Gene Rleg2_4996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4996 
Symbol 
ID6978090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp640672 
End bp642366 
Gene Length1695 bp 
Protein Length564 aa 
Translation table11 
GC content65% 
IMG OID643394142 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_002278960 
Protein GI209547042 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.833385 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGTCG ATCATCTGAT CCTTGGGGGA GGCTCGGCGG GCTGCGTGCT GGCGGCTCGG 
CTATCGGCCG ATCCGAACCG GAGGGTCGTT CTGGTCGAGG CGGGACGCAA CATTGCGCCC
GACGACATCC CAGGCGATAT CCGCAGTCGT TATCCCGGCC GGGCCTATCT CGACACGCGC
AATATCTGGT CGCAACTGAC GGCGCTGATG GGTTATGCCC GCTCCAACGT GGCACCGCGT
TCCTCGCGCA GATACGAGCA GGCGCGCCTA CTGGGTGGCG GCTCGGCGAT CAATGCGCTT
ATGGCCAATC GCGGCGCGCC GGCCGATTAT GCCGAATGGG AGGCTCTGGG CGCGGACGGC
TGGGGCTGGG ACGAATGTCT GCCCTATTTC CGCAAGATCG AGTCCGACCG CGATTTCAAT
GGCCCGCTGC ATGGACAGGA TGGACCGCTG ACGATCCGCC GCATTTCGGA CGAGAAGATT
TCACCCTTCG TGGATCGCAC AATGAAGGCG CTCGACCGGC GCGGTCATCC GATCCGGCAG
GATCAGAATG GTGTCTGGGA AGACGGCGCC TTCCGGGCCG CCATCGCCGT CAGCGACCGC
GGTGAACGGC TGCCGACCTC GCTCGCCTAT CTGACATCCG AGGTCCGCAG ACGGCCCAAT
CTGCGTATCG TGACCGAGAG CGTTGCCATT CGGATCCTGT TCGATGGTCG CCGCGCCACG
GGGGCGACCC TTTCGGGTGC TGCGGGAGAG ACGATTCACG CGGCGGAAGT GATCGTTTCT
GCCGGCGCCA TCCATACGCC GGCGCTGCTG CTGCGATCCG GTATCGGTCC GGCCGGCGAT
CTCAATTCTA CCGGGGTGAC GATTGTGGCC GGGTGCGAAG GCGTCGGCCG CAACCTGATG
GAACATCCCT CGATCGCAGT CGCAGCCTAT CTGCCGCCGC ATATGCGGGT GCGCGATCCG
GCCGAGCACC ACGAACAGGC GATCTGGCGT TTCTCATCCG GCCTCGTTGG CACGCCGCAG
GGCGACATGC ATGGCTCCAT CCTGTCGCGC TCGGGTTGGC ATTCGGTCGG CATGCGGCTC
GGTAGCCTGT TCTTCTGGGT CAACAAATCC TATTCGCGCG GCGTCGTGCG GCTCGCCTCG
GCCATCCCGC AGGCCGAGCC GGACGTCGAT TTCCGCATGT TGAGCGACGA GCGAGATCTC
AACCGGCTCA AGCTCGCACT GCGAATGGGA GCAGAGGCTT TGTTGGACCC GTTCCTGGAT
GGCCATCGCG GCACGGTCTT TCCATCGAGC TATTCGCCGC GCGTCGCGAA AGTTGCCGTA
CCGGGCGCAT GGAACGCCCT GCAACGCGGC ATCCTGTCCG GCATGCTCGA CGTGGCCGGA
CCGCTGCGCG CAGCGCTGAT ACATTCGGCG ATCACTCTGG GCACGACGAT GGACGGACTT
TTGGCGGACG ATGTGGCGCT GACGGAATTC GTCCGCCGCC ATGTCGGCGG CACCTGGCAT
GCATCCGGCA CCTGCCGGAT GGGTTCGATA GACGACCCGA TGGCGGTCAC GTCGCCGACC
GGGCGCGTCC ATGGCGTCGA GGGATTGCGC GTCTGTGATG CCTCTTTGAT GCCATCCATT
CCCTGCGCCA ATACGAACAT ACCCACGATC ATGATTGCCG AGCGTATCGC CGATTTCATC
TTGGGCGGCC GCTGA
 
Protein sequence
MIVDHLILGG GSAGCVLAAR LSADPNRRVV LVEAGRNIAP DDIPGDIRSR YPGRAYLDTR 
NIWSQLTALM GYARSNVAPR SSRRYEQARL LGGGSAINAL MANRGAPADY AEWEALGADG
WGWDECLPYF RKIESDRDFN GPLHGQDGPL TIRRISDEKI SPFVDRTMKA LDRRGHPIRQ
DQNGVWEDGA FRAAIAVSDR GERLPTSLAY LTSEVRRRPN LRIVTESVAI RILFDGRRAT
GATLSGAAGE TIHAAEVIVS AGAIHTPALL LRSGIGPAGD LNSTGVTIVA GCEGVGRNLM
EHPSIAVAAY LPPHMRVRDP AEHHEQAIWR FSSGLVGTPQ GDMHGSILSR SGWHSVGMRL
GSLFFWVNKS YSRGVVRLAS AIPQAEPDVD FRMLSDERDL NRLKLALRMG AEALLDPFLD
GHRGTVFPSS YSPRVAKVAV PGAWNALQRG ILSGMLDVAG PLRAALIHSA ITLGTTMDGL
LADDVALTEF VRRHVGGTWH ASGTCRMGSI DDPMAVTSPT GRVHGVEGLR VCDASLMPSI
PCANTNIPTI MIAERIADFI LGGR