Gene Rleg2_6423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_6423 
Symbol 
ID6983494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011371 
Strand
Start bp82326 
End bp83366 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content57% 
IMG OID643399420 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_002284176 
Protein GI209552261 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.544479 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGGAC ACAAGAGAAT TATGGTTACC GGGGGCACCG GTTTTTTGGG ATCATTCCTG 
TGCGAAAGGC TTTTGCGAGA GGGCAATGAC GTCCTCTGCG TCGACAATTA CTACACCGGT
TCGCGCGACA ACGTGCTGCA CCTTCTCGAC GATCCACGCT TTGAGATTCT TCGCCACGAC
ATTACCTTCC CGCTGTACGT GGAGGTCGAC GAGATCTACA ACCTCGCCTG CCCGGCATCT
CCGGTCCACT ATCAGCACGA CCCCGTGCAG ACAGTGAAGA CCAATGTGCA CGGCGCCATC
AACATGCTCG GCTTGGCAAA ACGCACCAAG GCCAAGATCT TCCAGGCATC CACCAGCGAA
GTTTATGGTG ATCCGGCTGT CCACCCTCAA CCCGAGGAGT ATCGAGGCAG CGTCAATCCG
ATCGGCCCCC GGGCATGTTA TGACGAAGGC AAACGCTGCG CTGAAACATT GTTCTTCGAC
TATCATCGTC AATACGGTGT GGAAATCCGG GTGGCGCGGA TCTTCAATAC CTATGGACCG
CGCATGCAGA CCAATGATGG CCGCGTCGTC TCGAACTTCA TCGTTCAGGC GCTTCAAAAC
CAACCGATCA CTATCTTCGG CAACGGCACG CAGACGCGCT CCTTCTGCTA TGTAGACGAT
CTGATCGAGG GCTTCATCCG ACTGATGGGG GCGCCGGCCG GCGTTACGGG TCCGATCAAT
CTCGGTAACC CGGGAGAATT CCAGGTCCGG GAACTGGCCG AAATGGTCAT CGAGATGACG
GGATCGAAAT CAAGCATCGT GTACAATCCT CTGCCGATTG ACGATCCGAC ACAGCGCAAG
CCCGACATCA GTCGCGCAAA GCAGGACCTG GGCTGGCAGC CGACGGTGAA CCTGCGAGAG
GGGCTCGAAA AAACGATCGC GTATTTCGAG TGGAAGCTTT CAGCTGGTGC CAAGAGCGCG
CCTGTCCGGT CCTCGCGAAA GGCTTACACC TATCTGCCTA CCCCGGCCGT CGGCCTTCCT
GTTCAGGAAA CCACACGATA G
 
Protein sequence
MHGHKRIMVT GGTGFLGSFL CERLLREGND VLCVDNYYTG SRDNVLHLLD DPRFEILRHD 
ITFPLYVEVD EIYNLACPAS PVHYQHDPVQ TVKTNVHGAI NMLGLAKRTK AKIFQASTSE
VYGDPAVHPQ PEEYRGSVNP IGPRACYDEG KRCAETLFFD YHRQYGVEIR VARIFNTYGP
RMQTNDGRVV SNFIVQALQN QPITIFGNGT QTRSFCYVDD LIEGFIRLMG APAGVTGPIN
LGNPGEFQVR ELAEMVIEMT GSKSSIVYNP LPIDDPTQRK PDISRAKQDL GWQPTVNLRE
GLEKTIAYFE WKLSAGAKSA PVRSSRKAYT YLPTPAVGLP VQETTR