Gene Rleg_4133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4133 
Symbol 
ID8014928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4215702 
End bp4216724 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content60% 
IMG OID644826703 
ProductAldose 1-epimerase 
Protein accessionYP_002977913 
Protein GI241206817 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2017] Galactose mutarotase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.270907 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGATA AGTTGGAGCG GGACGTTTTC GGGCAGACGC AGGCGGGCGA GACCGTCTAT 
CGCGTCGTGA TCAAGGGCGG TGGGCTGACG GCCAAGATCA TCAGCTGGGG CGCGGTCATC
CAGGATCTGC GTCTCGAGGG ACATGATGCG CCGCTGCAGC TCGGCTTTGA CGATTTCGAC
AGCTACCCCC TCTATTCATC CTATTTCGGC GCGACACCCG GCCGCTGCGC CAACCGCGTC
GGCGGCGGCA GGTTTACGCT TGACGACAAG GACTATCAGC TCGAACTGAA TGAAAACGGC
GTGACGCATC TGCATGGCGG CAGCGACAAT ATCGCAAAAC GCAATTGGAC GATTGTCGAG
CATGACGTCG ACCGCGTAGT ACTGAAGATC GTCGATCCCG ATGGCCGCGC CGGCTATCCC
GGCAATTGCA CCATCCAGGC GACTTTCTGG GTGCATGGCA ACGGTGAACT GTCGATCACC
TATGAATCGA CCTGCGACCA GCCGACGCTC GCCAATGTCT GCCAGCACGC CTATTTCAAT
CTCGACGGCC GGGAAGATGC GCTTGGCCAC GATATCATGA TTGCCGCCGA TCGCTATCTG
CCGACCGATG AGAAGCAGGT GCCGACCGGC GAGATCTGTT CCGTCGAGGG CACGGAATTC
GATTTCCGCG AGATGGCGCC GATGAAGCGT TTCGTCGGCA GCGAACAAGC CTTTTACGAC
CATAATTTCT GCCTGTCGGG CGAGCGTACC GCCAAGCGGA GCGTCGCGCT TGCCCGCAGC
CTTTATTCCG GTGTGTCGCT GGAAGTGCGC AGCACCGAGC CAGGCGTGCA GTTCTATGCC
GGCTTCAAGC TCGATACCGC GGCCCCCGGC ATCGGCGGGC GCAAATACGG CCCATTCGCC
GGCTTCTGCC TGGAGACGCA GGTCTGGCCG GATGCCATCA ATCACCAAGG TTTTCCGAAT
GCGGTTCTGC GCCCCGGCGA AGTGCTGCGT CAGGAGACGG ATTATATCTT CACCAAGAAC
TGA
 
Protein sequence
MSDKLERDVF GQTQAGETVY RVVIKGGGLT AKIISWGAVI QDLRLEGHDA PLQLGFDDFD 
SYPLYSSYFG ATPGRCANRV GGGRFTLDDK DYQLELNENG VTHLHGGSDN IAKRNWTIVE
HDVDRVVLKI VDPDGRAGYP GNCTIQATFW VHGNGELSIT YESTCDQPTL ANVCQHAYFN
LDGREDALGH DIMIAADRYL PTDEKQVPTG EICSVEGTEF DFREMAPMKR FVGSEQAFYD
HNFCLSGERT AKRSVALARS LYSGVSLEVR STEPGVQFYA GFKLDTAAPG IGGRKYGPFA
GFCLETQVWP DAINHQGFPN AVLRPGEVLR QETDYIFTKN