Gene Rleg2_1383 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1383 
Symbol 
ID6980111 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1404724 
End bp1406379 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content63% 
IMG OID643396104 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_002280903 
Protein GI209548986 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00471975 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCTTCG ATTATATCAT CACAGGCGCC GGTCCCGCGG GCTGCGTTCT TGCAAGCCGG 
CTGAGCGAAG ATCCCGATAT CCGCGTCCTC CTGCTCGAAG CAGGCGGCGG CGACTGGAAT
CCCCTCTTTC ACATGCCGGC GGGCTTCGCC AAGATGACCA AGGGCGTGGC CAGCTGGGGA
TGGCAAACCG TTCCCCAGAA GCACATGAAA GACCGGGTGC TTCGCTATAC CCAGGCGAAG
GTCATCGGCG GCGGCTCTTC GATCAACGCC CAGCTCTATA CGCGTGGAAA TGCGGCCGAT
TACGACCTCT GGGCCAGTGA AGACGGCTGC GAGGGCTGGG ACTATCGCTC GATCCTGCCC
TATTTCAAAC GCGCCGAGGA CAATCAGCGC TTCGCCGACG ACTACCACGC CTATGGCGGT
CCGCTGGGGG TCTCGATGCC GGCAGCACCC CTGCCGATCT GCGACGCCTA TATCCGTGCA
GGTCAGGAGC TCGGCATTCC CTATAATCAC GACTTCAACG GCCGCCAGCA GGCGGGTGTT
GGATTTTATC AGCTGACGCA GCGCAATCGC CGCCGGTCCT CGGCATCGCT TGCCTATCTC
TCGCCGATCA AGGAGCGGAA GAACCTGACG GTCAGGACAG GCGCGCGTGT TACGCGGATC
ATTGTCGAAG GAGGCCGTGC GACAGGCGTC GAGATCGCCA CCGCGGGCGG CTCGGAGATC
GTGCGCGCCG AGCGTGAGGT CCTGGTGTCG TCAGGCGCGA TCGGATCGCC GAAGCTTCTT
CTGCAGTCCG GCATCGGGCC GGCCGATCAT CTGAAATCGG TGGGCGTCAA GGTGAACCAC
GATCTGCCCG GTGTCGGCGG CAACCTGCAG GATCACCTGG ACCTTTTCGT CATCGCCGAA
TGCACCGGCG ATCACACCTA TGACGGCGTT GCAAAGCTTC ACCGCACACT CTGGGCCGGG
GTTCAATATG TCCTGTTCCG CACCGGTCCT GTGGCGTCGT CGCTCTTCGA GACCGGCGGC
TTCTGGTATG CCGATCCCGA GGCTCGGTCT CCCGATATCC AGTTTCATCT CGGCTTGGGA
TCGGGCATCG AAGCCGGCGT CGAGCGGCTC AAAAATGCGG GCGTGACGCT GAATTCCGCC
TATCTGCATC CGCGCTCCCT CGGGACGGTG CGTCTGTCAT CCGCCGACCC GGCTGCCGCG
CCCCTGATCG ATCCGAACTA CTGGTCCGAT CCGCACGACA GGCAGATGTC GCTGGAGGGG
CTCAAGATTG CCCGCGAGAT CATGCAGCAG GCGGCGCTGA AGCCCTTTGT CATGGCCGAA
AGGCTTCCCG GGCCGAAGGT CATGACCGAC GAGCAGCTGT TCGACTATGG CTGCGCCAAT
GCAAAGACCG ACCATCATCC TGTGGGCACC TGCAAGATGG GAACGGGGCC CGACGCCGTT
GTCGGGCTGG ATCTCAAGGT TCATGGCCTC GAAGGCCTCC GCGTCTGCGA TTCCTCGGTC
ATGCCGCGCG TGCCGTCCTG CAATACCAAC GCGCCGACGA TCATGGTCGG CGAAAAGGGA
TCGGACCTGA TCAGGGGCTT GCCTGCGCTG CCGTCGGCAA TCTTTGCCTA TGAACGCAAC
GATGTCAGGC CTCGGGCCCG CGCCGAAATC CGCTAA
 
Protein sequence
MGFDYIITGA GPAGCVLASR LSEDPDIRVL LLEAGGGDWN PLFHMPAGFA KMTKGVASWG 
WQTVPQKHMK DRVLRYTQAK VIGGGSSINA QLYTRGNAAD YDLWASEDGC EGWDYRSILP
YFKRAEDNQR FADDYHAYGG PLGVSMPAAP LPICDAYIRA GQELGIPYNH DFNGRQQAGV
GFYQLTQRNR RRSSASLAYL SPIKERKNLT VRTGARVTRI IVEGGRATGV EIATAGGSEI
VRAEREVLVS SGAIGSPKLL LQSGIGPADH LKSVGVKVNH DLPGVGGNLQ DHLDLFVIAE
CTGDHTYDGV AKLHRTLWAG VQYVLFRTGP VASSLFETGG FWYADPEARS PDIQFHLGLG
SGIEAGVERL KNAGVTLNSA YLHPRSLGTV RLSSADPAAA PLIDPNYWSD PHDRQMSLEG
LKIAREIMQQ AALKPFVMAE RLPGPKVMTD EQLFDYGCAN AKTDHHPVGT CKMGTGPDAV
VGLDLKVHGL EGLRVCDSSV MPRVPSCNTN APTIMVGEKG SDLIRGLPAL PSAIFAYERN
DVRPRARAEI R