Gene Rleg2_5818 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5818 
Symbol 
ID6977207 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011366 
Strand
Start bp228786 
End bp230381 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content64% 
IMG OID643393273 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_002278091 
Protein GI209546201 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.673966 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0679471 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCGGT ACGACTATAT CATCATCGGG GCGGGCAGTG CCGGCTGCGT ACTTGCCAAC 
CGGCTGTCGG CCGATGGCAG GAGCCGGGTG CTGCTGTTGG AAGCCGGCGG CAGCGACAAT
TACCACTGGA TCCATATCCC GGTCGGTTAT CTCTATTGCA TCAATAATCC GCGCACCGAC
TGGTGTTTCA CCACGGCGCC GGAAGCCGGA TTGAACGGCC GGGCGCTGAG TTATCCCCGC
GGCAAGGTGC TCGGCGGCTG CTCGTCGATC AACGGCATGA TCTATATGCG CGGCCAGGCG
CGGGACTATG ATCTTTGGCG GCAGATGGGC TGCAGCGGTT GGGGCTGGGA CGATGTTCTG
CCCTTCTTCC GCAAGTCCGA GGATTTCTAT CGCGGCGCCG ACGACATGCA CGGCGCCGGC
GGCGAATGGC GCATCGAAAG GGCGCGCGTG CGCTGGGCCG TGCTCGACGC CTTCCAGCAG
GCGGCGCGAG AGGCAGGCAT TCCAGAGACG GCGGATTTCA ACCGCGGCAG CAATGAAGGG
TCCGGCTATT TCGACGTCAA CCAGCGTTCC GGCATTCGCT GGAACACCTC GAAAGCCTTC
CTGCGCCCGG CGCGGAAACG CTCCAATCTG ACCGTGCTGA TCAAGGCGCA GGTGCGGCGG
TTGCTGGTCG AGGAGGGGGC CGTCGCCGGC GTCGAATACC AGCACAATGG CGTGGCGAAA
CGCGCCTATG CGGGCAAGGA AACCATTCTG TCGGCCGGTT CGATCGGCTC GCCGCATGTT
CTGGAACTCT CGGGCATCGG CAGGGGCGAG GTTCTCCAGC GGGCAGGCGT CGATGTCATC
ACCGAGGTCA AGGGCATCGG CGAGAACCTG CAGGACCATC TGCAACTGCG GCTCGCCTAT
AAGGTGACCG GCGTTCCGAC GCTGAACGAG AAGGCGACGA AGCTGATCGG CAAGGCGGCG
ATCGGGCTCG AATATCTCGT CCGCCGCTCC GGGCCGATGG CGATGGCGCC GAGCCAGCTT
GGCATCTTCA CCCGCTCGGG GCCGGACCGG GAAACGCCCG ACCTGCAATA TCACGTGCAG
CCGGTCTCGC TGGAGAAGTT CGGCGATCCC GTCCATCCTT TCCCGGCAAT CACCGCAAGC
GTCTGCAATC TGAGGCCGGA AAGCCGCGGT TCGGTGCATC TGTCGAGCCC GGATTTTGCC
GCCCAGCCGA CGATCAGCCC GAAATACCTC TCGACGCAGC GCGATCGTGA CATAGCTGTC
CGTTCGATAC GATTGACGCG CAAGATCGTC GCCCAGCCTT CCTTCGCCAG GTTCAAGCCG
GAGGAATTCA AGCCGGGGCC GAGCTATCAG ACCGAGGCCG ATCTGGAGCG GGCGGCGGGC
GAAATCGGCA CGACGATCTT CCATCCCGTC GGCACCTGCC GCATGGGCGC CGACCGGGAC
AGCGTCGTCG ATCCCCGGCT GAAACTGCGG GCGCTCGGCA AGCTCAGGAT CGCCGACGCC
TCGGTGATGC CGTCGATCAC CTCAGGCAAC ACCAATTCGC CGACGATCAT GATCGCCGAA
AAGGCAGCGG CGATGATCCT CGAAGACAAT CGATAG
 
Protein sequence
MDRYDYIIIG AGSAGCVLAN RLSADGRSRV LLLEAGGSDN YHWIHIPVGY LYCINNPRTD 
WCFTTAPEAG LNGRALSYPR GKVLGGCSSI NGMIYMRGQA RDYDLWRQMG CSGWGWDDVL
PFFRKSEDFY RGADDMHGAG GEWRIERARV RWAVLDAFQQ AAREAGIPET ADFNRGSNEG
SGYFDVNQRS GIRWNTSKAF LRPARKRSNL TVLIKAQVRR LLVEEGAVAG VEYQHNGVAK
RAYAGKETIL SAGSIGSPHV LELSGIGRGE VLQRAGVDVI TEVKGIGENL QDHLQLRLAY
KVTGVPTLNE KATKLIGKAA IGLEYLVRRS GPMAMAPSQL GIFTRSGPDR ETPDLQYHVQ
PVSLEKFGDP VHPFPAITAS VCNLRPESRG SVHLSSPDFA AQPTISPKYL STQRDRDIAV
RSIRLTRKIV AQPSFARFKP EEFKPGPSYQ TEADLERAAG EIGTTIFHPV GTCRMGADRD
SVVDPRLKLR ALGKLRIADA SVMPSITSGN TNSPTIMIAE KAAAMILEDN R