Gene Rleg_3235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3235 
Symbol 
ID8014126 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3241325 
End bp3242653 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content60% 
IMG OID644825796 
Productnucleotide sugar dehydrogenase 
Protein accessionYP_002977023 
Protein GI241205927 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1004] Predicted UDP-glucose 6-dehydrogenase 
TIGRFAM ID[TIGR03026] nucleotide sugar dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATCA CGATGATTGG ATCAGGCTAT GTCGGCCTCG TTTCAGGCGT TTGCTTTGCG 
GATTTCGGCC ACGACGTCAT CTGCGTCGAC AAGGATCTGA GTAAGATCGA AGCCCTTCGC
GAAGGCCGCA TTCCGATCTA CGAGCCGGGT CTGGAACAAT TGGTCGCCGA AAATACCAGC
ACCGGCCGAC TGTCGTTTTC GACGGATGTC GGCGAAAGTG TCCGCAGCGC CGATGTCGTG
TTCATCGCAG TCGGCACGCC GTCCCGGCGC GGCGACGGCC ACGCAGACCT TTCCTATGTC
TATGCCGCTG CACGCGAGAT TGCCACCTAT GTGGAAGGCT TCACGGTCAT CGTCACCAAG
TCGACCGTGC CGGTCGGCAC GGGAGACGAG GTCGAGCGCA TCATGCGCGA AACCAATCCT
GCGGCGGATG TCGCCGTCGT TTCCAATCCG GAATTCCTGC GTGAAGGTGC GGCGATCGAA
GACTTCAAGC GGCCCGACCG TATCGTCATC GGGCTGAACG ACGACCGGGC GCGCGAAACC
ATGACCGAGG TCTACCGCCC GCTCTATCTC AACCAGGCCC CCTTGGTCTT CACCACCCGC
CGCACCTCGG AACTGATCAA ATATGCGGCC AATGCCTTCC TCGCAATGAA GATCACCTTC
ATCAACGAGA TCGCCGATCT CTGCGAACGG GTCGACGCAA ACGTCCAGGA CGTTTCGCGC
GGAATCGGTC TCGACGGCCG TATCGGCTCC AAGTTCCTGC ATGCCGGCCC GGGTTACGGC
GGTTCGTGCT TCCCCAAGGA TACGCTTGCC CTTGCCAAGA CGGCGCAGGA TTACGACGCG
CCGATGCGTC TCATCGAGAC GACGATCTCG ATCAATGACA ACCGCAAGCG GGCAATGGGA
CGCAAAGTCA TTTCGGCCGT CGGCGGAGAC ATTCGCGGCA AGAAGATCGC GATCCTCGGC
CTGACCTTCA AGCCGAACAC CGACGATATG CGCGACAGCC CGGCGATCGC AGTCATCCAG
ACCCTGCAGG ACAACGGAGC CGAAGTGGTT GGCTACGATC CCGAGGGCAT GGAAAACGCC
CGTAAGGTGA TCGAGAACAT CGAATATGCG AGCGGCCCTT ATGAAGCAGC CGCTGGTGCG
GATGCGCTTG TCATCGTCAC CGAATGGAAC CAGTTCCGCG CGCTCGATTT CAATCGCTTG
AAGCAGTCGA TGCGCGCTCC GATCCTGGTC GATCTGCGCA ATATCTACCG CAGCGACGAG
GTCCGCAAAC ACGGCTTTAC CTATACCGGC ATCGGCACCA ACCTTTATCA GGACGTGACC
GGCGCCTGA
 
Protein sequence
MRITMIGSGY VGLVSGVCFA DFGHDVICVD KDLSKIEALR EGRIPIYEPG LEQLVAENTS 
TGRLSFSTDV GESVRSADVV FIAVGTPSRR GDGHADLSYV YAAAREIATY VEGFTVIVTK
STVPVGTGDE VERIMRETNP AADVAVVSNP EFLREGAAIE DFKRPDRIVI GLNDDRARET
MTEVYRPLYL NQAPLVFTTR RTSELIKYAA NAFLAMKITF INEIADLCER VDANVQDVSR
GIGLDGRIGS KFLHAGPGYG GSCFPKDTLA LAKTAQDYDA PMRLIETTIS INDNRKRAMG
RKVISAVGGD IRGKKIAILG LTFKPNTDDM RDSPAIAVIQ TLQDNGAEVV GYDPEGMENA
RKVIENIEYA SGPYEAAAGA DALVIVTEWN QFRALDFNRL KQSMRAPILV DLRNIYRSDE
VRKHGFTYTG IGTNLYQDVT GA