Gene Rleg_4655 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4655 
Symbol 
ID8007133 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp17868 
End bp19205 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content59% 
IMG OID644821591 
ProductEpoxide hydrolase domain protein 
Protein accessionYP_002972851 
Protein GI241113016 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.246703 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAGT TTGACAATAT TCGCGCCTGG CGCCGGCTTG CCCGCGGAGC AATGGTGATG 
GCGGTTGGAC CGATGTTGCT GTTCCCAGCC GGTTACCTTG TGCCGGCGAG TGCCGCCGAC
GTCACATCGG AAGGGCCGGC GACGCCGATC GAGGCCGACG AAACGATCCG CCCGTTCCAG
ATCCACGTTC CGCAGTCACA GCTTGACGAT CTGCGCAAGC GCATTGCCGA AACGCGTTGG
CCAGACAAGG AGACCGTGAG CGACACCTCG CAAGGCATCC AGCTTTCGCG CGTCCAGGAT
CTGGTCCGTT ACTGGGGCAC TGATTACGAT TGGCGCAAAG CCGAGGCTGA GCTCAATGCA
CTTCCGGAAT TCATCACGAC GATCGACGGG GTCGATATCC AGTTCATCCA TGTGCGATCG
CGTCATCCCA ACGCCCTTCC GGTCATTTTG ACCCATGGTT GGCCGGGTTC GACCTTCGAG
TTCATCAAGG CGATCGGCCC TCTTACCGAT CCGACTGCCT ATGGCGGTAA AGCGGAGGAC
GCATTTGATG TCGTCATCCC TTCCATCCCC GGCTACGGCT TTTCGGGTAA GCCGACGGAG
CTTGGCTGGG GCCCCGACCG CGTTGCGCGA GCATGGGACA TCCTGATGAA GCGGCTCGGC
TACGCGCACT ACGTTTCCCA GGGTGGCGAC CATGGTTCCG TTATCTCCGA CGCGCTGGCG
CGCCAGGCAC CGAAGGGTTT GCTTGGTATC CATCTCAACA TGCCGGCGAC CGTTCCGGGC
AATCTCACCA AGGCGGTCAA CAGTGGAGAC CCGGCTCCCG CAGGGCTGTC GGCGCCCGAG
CGGGATGCCT ATGAATCCCT GAGCACCTTT TTTGGCCGGA ATGCCGCCTA TGGGGCCGTG
ATGGTGACGC GTCCGCAGAC GATCGGCTAC TCGCTTTCCG ACTCGCCGTC GGGCCTAGCT
GCCTGGATCT ACGAAAAATT TGCGCAATGG AGCGATAGCG AGGGCATTCC CGAGCGTGTT
TTTTCCAAGG ACGAAATGCT GAATGACATC ACATTGTACT GGCTGACCAA CACTGGGGCA
TCCTCGTCGC GGTTCTATTG GGAAAACAAC AACAACAACT TCAGCTCAGA CGCCCAGAAG
ACCAAAGAGA TCAAGATCCC GGTGGCAATC AGCGTATTCC CAAAGGAGAT CTACCAGGCG
CCGGAGAGTT GGAGCAAGCA GGCCTATCCC ACGCTGCATT ACTACCACCG TGTCGATATG
GGCGGTCACT TCGCCGCCTG GGAACAGCCC CAACTTTTCG CTGAGGAACT GCGAGAGGCA
TTCAGATCGG TGCGTTGA
 
Protein sequence
MKKFDNIRAW RRLARGAMVM AVGPMLLFPA GYLVPASAAD VTSEGPATPI EADETIRPFQ 
IHVPQSQLDD LRKRIAETRW PDKETVSDTS QGIQLSRVQD LVRYWGTDYD WRKAEAELNA
LPEFITTIDG VDIQFIHVRS RHPNALPVIL THGWPGSTFE FIKAIGPLTD PTAYGGKAED
AFDVVIPSIP GYGFSGKPTE LGWGPDRVAR AWDILMKRLG YAHYVSQGGD HGSVISDALA
RQAPKGLLGI HLNMPATVPG NLTKAVNSGD PAPAGLSAPE RDAYESLSTF FGRNAAYGAV
MVTRPQTIGY SLSDSPSGLA AWIYEKFAQW SDSEGIPERV FSKDEMLNDI TLYWLTNTGA
SSSRFYWENN NNNFSSDAQK TKEIKIPVAI SVFPKEIYQA PESWSKQAYP TLHYYHRVDM
GGHFAAWEQP QLFAEELREA FRSVR