Gene Rleg2_1791 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1791 
Symbol 
ID6980528 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1840114 
End bp1841424 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content62% 
IMG OID643396512 
ProductEpoxide hydrolase domain protein 
Protein accessionYP_002281302 
Protein GI209549385 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.133373 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.094605 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACTGC TGACCTCTTT GAACCACAGG CTTCTATCGC GACGCGCGCT GCTGTCAGCG 
GGTGCCGCGG CCGGCTTATC CGCCGTTCTG CTTCCGCTGG CGTCGGAAGC CGCCGACGTT
ATAGGCACCC CAGCCGGGGA AATCCGACCA TTCAGGGCCG ACATCCCCGA GGCAGCGCTT
GAGGATCTCA GGCGGCGACT TGCCGAAACC CGATGGCCCG ACGGCGAAAC CGTCACGGAT
CGTTCGCAGG GTGTTGAGCC TGACAGGCTG AAGGAGCTGG TGGGCTACTG GCAATCGTCC
TACGACTGGC GCAAGGCAGA GAGCCGGCTG AATGCCTTTC CGCAATTTCT CACCAATATC
GACGGTGTGG ACATCCATTT CATCCATGTC CGTTCCCGCC ATGAAAACGC CCTGCCGTTG
ATCATGACCC ATGGCTGGCC GGGTTCGGTG TTCGAACTGC TCGATGTCAT CGGGCCGCTC
ACGGACCCGA CGGCGCATGG CGGTACGGCC GAGGATGCTT TCCACCTGGT GATCCCTTCG
ATTCCGGGAT TCGGCTTCTC CGGGAAGCCG TCGACGACGG GTTGGAACCC GCAGCGGATA
GCGGCTGCCT GGGACGTGCT GATGAAACGG CTCGACTATA TCAGCTATGT CGCGCAAGGC
GGCGACTGGG GCGCCATCAT CAGCGACGCC CTGGGTCGCG AGGCACCCGA TGGGCTGCTC
GCCATCCATG TCAACAGGAT CGAGCGGGCG ACGACGTTCC CATCGGACGC AGCCCAGGCT
CTTAGAAATG GAGGGACGGC TCCCGACAAT CTGTCTGCGG ACGAGAAGCT CGTCTTCGAC
GAGGCGCGGA ACTTCCTCAA CAACGGCTTC GGCTATGCCG CGATCATGAG CACACGTCCG
GAGACAGTCG GTTACGGCAT TGCGGATTCG CCAGTTGGCC TTGCCGCCTG GCTTTACGAC
AAGATCGCCG ACTGGGTGTT CACCCGAGGC GATCCGGAAC AGGCGCTTGG CAAGGAGGCG
ATCCTCGACA ATATCACGCT GTACTGGCTG ACGAACACCG GCCCCTCGAG TGGCCGCATC
TATTTCGAAA ACGCCATGGC AGGCGCGAAG CTCTCGGAGG TCAAAGTGCC GGTCGCCGTC
ACCATATTCC CCGGAGAGGT CTACAAACCG CCGAAGCACT GGTTGTCGAA GGCCTATCCG
AAGCTGGTGT ACTATAACCG CGCGTCCAAG GGCGGCCACT TCGCGGCCTG GGAGGAGCCG
GAACTCTTCA GTCAGGAGAT CAGGGCAGGG TTCAAAACGG TGCGATCATG A
 
Protein sequence
MALLTSLNHR LLSRRALLSA GAAAGLSAVL LPLASEAADV IGTPAGEIRP FRADIPEAAL 
EDLRRRLAET RWPDGETVTD RSQGVEPDRL KELVGYWQSS YDWRKAESRL NAFPQFLTNI
DGVDIHFIHV RSRHENALPL IMTHGWPGSV FELLDVIGPL TDPTAHGGTA EDAFHLVIPS
IPGFGFSGKP STTGWNPQRI AAAWDVLMKR LDYISYVAQG GDWGAIISDA LGREAPDGLL
AIHVNRIERA TTFPSDAAQA LRNGGTAPDN LSADEKLVFD EARNFLNNGF GYAAIMSTRP
ETVGYGIADS PVGLAAWLYD KIADWVFTRG DPEQALGKEA ILDNITLYWL TNTGPSSGRI
YFENAMAGAK LSEVKVPVAV TIFPGEVYKP PKHWLSKAYP KLVYYNRASK GGHFAAWEEP
ELFSQEIRAG FKTVRS