Gene Rleg_4037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4037 
Symbol 
ID8014842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4115404 
End bp4116711 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content59% 
IMG OID644826606 
ProductEpoxide hydrolase domain protein 
Protein accessionYP_002977817 
Protein GI241206721 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCTA TCAAGACTGA CGTTATCGAC GAGGATCGCC GTCGCCTTCT GGCTGCTGCA 
GCGTCCGGCA TTGCTGCCCT GGGCATCGCA AGTTTGCTTC CCGCGGGTTC GACTGCCGCA
ACGGAATCCG ATGCCATCCG CCCGTTCCGG GTGAACGTTC CAGAGGCTGA TCTCGCCGAC
CTTCGCTATC GCCTTGCTCA CACCCGCCTC CCCGAAAAGG AGACAGTCAG CGATTTCTCC
CAGGGTGTAC CGCTCAAAAC CACCAAGCAG TTGCTCGATC ACTGGCAGAA CAAATACGAC
TGGCGCAAGG TCGAAGCCCG GATCAATGCC GTGCCGAATT TCATCACCGA GATCGATGGG
CTGGACATCC ATTTCATCCA TGTTCGTTCC AAGCACGAGA ACGCGCTTCC TCTGATCGTG
ACCCACGGAT GGCCGGGCTC GATCATTGAG CAATTGAAGA TCATCGGGCC GCTCACCGAC
CCGACAGCCT ACGGTGGCAG TGCCTCGGAT GCATTTCACA TCGTCATCCC GTCGATGCCA
GGATATGGCT TTTCCGGAAA GCCTGATGCG ACCGGCTGGG GACCAGAACG GATAGCGACG
GCATGGATCA CTCTGATGCG GCGTCTGGGC TACAAGCAAT TCGTTGCGCA AGGCGGCGAT
TGGGGCGCAG TCGTGACCGA TATGATCGGT GTGCAGGCTC CTCCGGAATT GCTCGGCATC
CATACCAACA TGCCAGGAGC GATCCCCAAC GACATCAACA ACGCATCCTT TGTCGGAGCC
CCTGCTCCAG CAGGGCTGTC GGACGAAGAA AAAGCCTCCT ACCACCAGCT CGTCTCCTTC
TACAAGAATG TCTACTACGC ATTTCTGATG GGCACGCGTC CCCAGACCCT CACGGGCTTG
TCGGACTCAC CCATCGCACT CGCGACCTAT ATGCTCGATC ATGACAGGGC GAGCCTGGCG
ATGATCGCAC GATCATTCGA CGGCCAGGAT GAGGGTGTGA GCCCTGACGA TGTCCTCGAC
AACGTGACGC TGTTTTGGCT GACAAATACC GGCGTATCCG CCGCGCGGCT CTATTGGGAG
AACAAGCTCG TATTCTTCGC TTCGAAGGGC GTCAAGGTGC CGGTCGCAGT CAGCGTCTTC
CCAGACGAAC TCTACCAGAC GCCCCGCGCC TGGGCCGAGA AGGCTTATCC AAACCTGGTT
CACTACAACA AGCTTCCGAA GGGTGGACAC TTTGCAGCAT GGGAGCAGCC GAAGCTCTTC
ACAGACGAGG TCCGTGTCGG CTTTCGCAGC CTGCGGAAGT CCGGCTGA
 
Protein sequence
MTAIKTDVID EDRRRLLAAA ASGIAALGIA SLLPAGSTAA TESDAIRPFR VNVPEADLAD 
LRYRLAHTRL PEKETVSDFS QGVPLKTTKQ LLDHWQNKYD WRKVEARINA VPNFITEIDG
LDIHFIHVRS KHENALPLIV THGWPGSIIE QLKIIGPLTD PTAYGGSASD AFHIVIPSMP
GYGFSGKPDA TGWGPERIAT AWITLMRRLG YKQFVAQGGD WGAVVTDMIG VQAPPELLGI
HTNMPGAIPN DINNASFVGA PAPAGLSDEE KASYHQLVSF YKNVYYAFLM GTRPQTLTGL
SDSPIALATY MLDHDRASLA MIARSFDGQD EGVSPDDVLD NVTLFWLTNT GVSAARLYWE
NKLVFFASKG VKVPVAVSVF PDELYQTPRA WAEKAYPNLV HYNKLPKGGH FAAWEQPKLF
TDEVRVGFRS LRKSG