Gene Rleg2_0408 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_0408 
Symbol 
ID6979123 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp420213 
End bp421193 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content57% 
IMG OID643395121 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_002279933 
Protein GI209548016 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.253997 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAG CGCTTGTCAC CGGGGCAGAT GGGTTCATCG GTTCTCATCT CGTCGAAACG 
CTGGTCAGGT CGGGTGTCGA GGTTCGCGCC CTCTGCCAGT ATAACTCGTT TTCCAGTTGG
GGCTGGCTCG ACCAATCGGA ATATCGCGGC AAGTTCGAGG TGATCCTCGG AGACGTCCGT
GATCCCGCCC AGATGCGCTC CGTCGCCAGA GATGTCGACA CGATCTTCCA TCTTGCCGCT
CTGATTGCGA TTCCCTATTC CTATCAGGCT CCCTCGAGCT ACATCGATAC CAATGTGCAT
GGGACGCTGA ACGTTCTTCA AGGCGCTCTC GACGCCGGCG TCGGAAGAGT GATCCAGACC
TCGACGAGCG AAGTCTATGG GACGGCACGT TTTGTGCCCA TCAGCGAAAG CCATCCTTTG
CAGGCGCAGT CGCCCTATTC GGCGTCCAAA ATCGGTGCCG ATGCGATCGC CTACAGCTAT
CACTCGAGCT TCGATCTGCC GGTGACGATC GCACGGCCGT TCAACACCTA CGGCCCGAGA
CAATCTGCAA GGGCGGTTAT TCCAACCGTG ATTTCGCAGC TTCTGAGTGG ACGAAGGACG
CTCAAGCTTG GTGCGCTCTC GCCCACCCGG GATTTCAATT TCGTGCAGGA TACATGCGAC
GGCTTTCTGG CGCTCGCGGC CTGCGACAAA GCCATCGGAC AGACGGTCAA TATCGGCTCG
GGCGGCGAGA TATCGATCGG CGATACCGTT CGGCTGATCG CCGATATCAT CGGCGTCAAC
ATCGAGATCG AATGCGACGA ACAGCGTTTG CGTCCGGCAA ACAGCGAAGT GGAACGCCTG
TGCTGTGATA ACAGCCTGAT CAAGTCTCTG ACGGGATTCT CGCCTCGCTA CAGCTTGAAA
GACGGGCTCC AGGCGACGAT TGAATGGCTG CGTCAGCCCG AGAATCTGGC GCGGTATAAG
GCGGATATTT TCAATGTCTA G
 
Protein sequence
MKKALVTGAD GFIGSHLVET LVRSGVEVRA LCQYNSFSSW GWLDQSEYRG KFEVILGDVR 
DPAQMRSVAR DVDTIFHLAA LIAIPYSYQA PSSYIDTNVH GTLNVLQGAL DAGVGRVIQT
STSEVYGTAR FVPISESHPL QAQSPYSASK IGADAIAYSY HSSFDLPVTI ARPFNTYGPR
QSARAVIPTV ISQLLSGRRT LKLGALSPTR DFNFVQDTCD GFLALAACDK AIGQTVNIGS
GGEISIGDTV RLIADIIGVN IEIECDEQRL RPANSEVERL CCDNSLIKSL TGFSPRYSLK
DGLQATIEWL RQPENLARYK ADIFNV