Gene Rleg2_2374 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_2374 
Symbol 
ID6981113 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp2436380 
End bp2437360 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content61% 
IMG OID643397087 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_002281875 
Protein GI209549958 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.79798 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.123393 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTTGA AAGTTCTTTT CATCGGCGGC ACCGGCCAGA TCTCCTATCC CTGCGTCGAG 
CGCGCCGTTG CCGAGGGCCA TCAGGTCAGC GTCTACAATC GCGGCTTGAG AAATGCTGGT
TTGCCCGCAG GGGTGACCTC GATCGTCGGC GAATTGGGAT CGGGCGCCTA TGCGGATCTC
GCCAAGGGCA ATTATGACGT CGTCTGCCAG TTCATCGCCT TCACGCCCGA CCAGGTCGCC
CGCGACATCG AGGTGTTCTC GGGCAGTTGC GGCCAGTATA TCTTTATCTC TTCGGCCTCG
GTCTATGAAA AGCCGCCGCG TCACTACGTG ATCACCGAGG AGACGCCGGC GATCAATCCC
CACTGGCCGT ATAGCCAGGC GAAGATCGCC TGCGAGGAAC TGCTCAAACA GTCCGCGAAT
CTCGCGTGCA CGATCGTCCG CCCCAGCCAC ACCGTTCGCA CCGGCCTGCC GATCATGATG
GGCGATAGCG ATGTCATGGC AAGACGCATG CTGGATGGCG AGCCCATCAT CGTGGCGGGC
GACGGCCACA CGCCCTGGAC GCTGACTCGC TCAATCGATT TCGCCGTGCC TTTCGTCGGC
CTGTTCGGCA AGCAGGCGGC GCTGAACAAG ATTTTCCACA TCACCTCCGA CCGTGCGCAT
ATCTGGGACG ATATCCAGAA GACGATCGCA AGGTTGCTCG GCGTCGAGGC GAAGATCGTC
CACGTGCCGA CGGACACGCT GATCCGGTAC AATCCGGAAT GGGTCGGCCC GCTCTTGGGC
GACAAGGCCT GGACGGCGAT CTTCGACAAT TCGAAGGTCA AGCGCGTGGC GGGCGACTTC
ACCTGCGCCG AGAACCTCGA TGAAATCCTC GCCGAGCCGA TCATGCACCT CAAGCAGCGC
CTCGCCAAAA GCCGTCCGCC GAAGGGTGAA GTCGATGCTC TGATCGACCG GATTTGCGCC
GACCAAAGCG CTCTCGGTTA G
 
Protein sequence
MALKVLFIGG TGQISYPCVE RAVAEGHQVS VYNRGLRNAG LPAGVTSIVG ELGSGAYADL 
AKGNYDVVCQ FIAFTPDQVA RDIEVFSGSC GQYIFISSAS VYEKPPRHYV ITEETPAINP
HWPYSQAKIA CEELLKQSAN LACTIVRPSH TVRTGLPIMM GDSDVMARRM LDGEPIIVAG
DGHTPWTLTR SIDFAVPFVG LFGKQAALNK IFHITSDRAH IWDDIQKTIA RLLGVEAKIV
HVPTDTLIRY NPEWVGPLLG DKAWTAIFDN SKVKRVAGDF TCAENLDEIL AEPIMHLKQR
LAKSRPPKGE VDALIDRICA DQSALG