Gene Rleg_3501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3501 
Symbol 
ID8014370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3536583 
End bp3537803 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content62% 
IMG OID644826066 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_002977286 
Protein GI241206190 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.409017 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.142539 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATTG CGGTTATGGG CGGAGACGGT TTCATTGGCT GGCCAACGTC GCTGCATCTC 
TCCGATGCCG GGCACGACAT CCATATCCTC GACAATCTCT CGCGCCGCTG GATCGACACC
GAACTCGGCG TTCAGTCGCT GACTCCGATG GATTCGATCC AGGAGCGCAC CCGCATCTGG
CATGCCGAAA CCGGACGCCG CATCCACTTC AATCTGATCG ATCTCGCCAA GGATTACGAA
CTCCTGAAGA ACTGGCTTTC CGAACATCGC CCGGATGCCG TCATCCATTT CGCCGAGCAG
CGGGCCGCGC CCTATTCGAT GAAGAGCGAC CGCCACAAGA ACTACACCGT CAACAACAAT
GTCAGCGCCA CGCACAACCT GCTGAACGCG CTGACGGAAC TCAATCTCGA TGCCCATCTC
ATCCATCTCG GCACCATGGG CGTCTACGGC TATTCGACGG TCGGCGCCGC GATTCCCGAA
GGTTACCTGC CGGTCGGCAT CGAAACCGCA GGCGGCGAGA CGGTCAACCA GGAGATCCTC
TACCCCTCCA ATCCCGGCTC GATCTATCAC ATGACCAAGT GCCTGGATCA GCTTCTCTTC
CAGTTCTATG CTAGGAATGA CGGCCTGAGG GTCACCGACC TGCACCAGGG CATCGTCTGG
GGCACGCATA CCGAGCAGAC GCGCCGCCAT GCGCAGCTGA TCAACCGTTT CGACTATGAC
GGCGACTACG GCACGGTGCT CAACCGCTTT CTCATCCAGG CGGCGATCGG CTATCCGTTG
ACGGTGCACG GTACCGGCGG CCAGACCCGC GCCTTCATCC ACATCCAGGA TTCGGTGCGC
TGCATCGAGC TGGCGCTGAA AAACCCGCCG GCCCGCGGCG CCCGCGTCGA GATCTTCAAC
CAGATGACCG AAACCCACCG GGTGCGCGAC CTCGCCGAGA TGATCGCCAG GATGAGCGGC
GCCAAGATTG CCTGGCTGCC CAACCCGCGC AAGGAAGCCG CCGAGAACGA GCTGATCGTC
CGGAACGAAA AGTTCCGCGA TCTCGGCCTC GAGCCGATCA CGCTGGAAGC AGGCCTGCTC
GGCGAAATCG TCGACGTCGC CAAGAAATTC GCCTATCGCG TCGACCGCGC GCGCGTTCCG
GCCGTCTCCG CCTGGACTAA GGACATCGCC GCGACGATCA ATCACGATCC GGAAGGCAAG
CGGCTGAAAT CCGTCTCGTG A
 
Protein sequence
MKIAVMGGDG FIGWPTSLHL SDAGHDIHIL DNLSRRWIDT ELGVQSLTPM DSIQERTRIW 
HAETGRRIHF NLIDLAKDYE LLKNWLSEHR PDAVIHFAEQ RAAPYSMKSD RHKNYTVNNN
VSATHNLLNA LTELNLDAHL IHLGTMGVYG YSTVGAAIPE GYLPVGIETA GGETVNQEIL
YPSNPGSIYH MTKCLDQLLF QFYARNDGLR VTDLHQGIVW GTHTEQTRRH AQLINRFDYD
GDYGTVLNRF LIQAAIGYPL TVHGTGGQTR AFIHIQDSVR CIELALKNPP ARGARVEIFN
QMTETHRVRD LAEMIARMSG AKIAWLPNPR KEAAENELIV RNEKFRDLGL EPITLEAGLL
GEIVDVAKKF AYRVDRARVP AVSAWTKDIA ATINHDPEGK RLKSVS