Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4037 |
Symbol | |
ID | 8014842 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 4115404 |
End bp | 4116711 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 644826606 |
Product | Epoxide hydrolase domain protein |
Protein accession | YP_002977817 |
Protein GI | 241206721 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGCTA TCAAGACTGA CGTTATCGAC GAGGATCGCC GTCGCCTTCT GGCTGCTGCA GCGTCCGGCA TTGCTGCCCT GGGCATCGCA AGTTTGCTTC CCGCGGGTTC GACTGCCGCA ACGGAATCCG ATGCCATCCG CCCGTTCCGG GTGAACGTTC CAGAGGCTGA TCTCGCCGAC CTTCGCTATC GCCTTGCTCA CACCCGCCTC CCCGAAAAGG AGACAGTCAG CGATTTCTCC CAGGGTGTAC CGCTCAAAAC CACCAAGCAG TTGCTCGATC ACTGGCAGAA CAAATACGAC TGGCGCAAGG TCGAAGCCCG GATCAATGCC GTGCCGAATT TCATCACCGA GATCGATGGG CTGGACATCC ATTTCATCCA TGTTCGTTCC AAGCACGAGA ACGCGCTTCC TCTGATCGTG ACCCACGGAT GGCCGGGCTC GATCATTGAG CAATTGAAGA TCATCGGGCC GCTCACCGAC CCGACAGCCT ACGGTGGCAG TGCCTCGGAT GCATTTCACA TCGTCATCCC GTCGATGCCA GGATATGGCT TTTCCGGAAA GCCTGATGCG ACCGGCTGGG GACCAGAACG GATAGCGACG GCATGGATCA CTCTGATGCG GCGTCTGGGC TACAAGCAAT TCGTTGCGCA AGGCGGCGAT TGGGGCGCAG TCGTGACCGA TATGATCGGT GTGCAGGCTC CTCCGGAATT GCTCGGCATC CATACCAACA TGCCAGGAGC GATCCCCAAC GACATCAACA ACGCATCCTT TGTCGGAGCC CCTGCTCCAG CAGGGCTGTC GGACGAAGAA AAAGCCTCCT ACCACCAGCT CGTCTCCTTC TACAAGAATG TCTACTACGC ATTTCTGATG GGCACGCGTC CCCAGACCCT CACGGGCTTG TCGGACTCAC CCATCGCACT CGCGACCTAT ATGCTCGATC ATGACAGGGC GAGCCTGGCG ATGATCGCAC GATCATTCGA CGGCCAGGAT GAGGGTGTGA GCCCTGACGA TGTCCTCGAC AACGTGACGC TGTTTTGGCT GACAAATACC GGCGTATCCG CCGCGCGGCT CTATTGGGAG AACAAGCTCG TATTCTTCGC TTCGAAGGGC GTCAAGGTGC CGGTCGCAGT CAGCGTCTTC CCAGACGAAC TCTACCAGAC GCCCCGCGCC TGGGCCGAGA AGGCTTATCC AAACCTGGTT CACTACAACA AGCTTCCGAA GGGTGGACAC TTTGCAGCAT GGGAGCAGCC GAAGCTCTTC ACAGACGAGG TCCGTGTCGG CTTTCGCAGC CTGCGGAAGT CCGGCTGA
|
Protein sequence | MTAIKTDVID EDRRRLLAAA ASGIAALGIA SLLPAGSTAA TESDAIRPFR VNVPEADLAD LRYRLAHTRL PEKETVSDFS QGVPLKTTKQ LLDHWQNKYD WRKVEARINA VPNFITEIDG LDIHFIHVRS KHENALPLIV THGWPGSIIE QLKIIGPLTD PTAYGGSASD AFHIVIPSMP GYGFSGKPDA TGWGPERIAT AWITLMRRLG YKQFVAQGGD WGAVVTDMIG VQAPPELLGI HTNMPGAIPN DINNASFVGA PAPAGLSDEE KASYHQLVSF YKNVYYAFLM GTRPQTLTGL SDSPIALATY MLDHDRASLA MIARSFDGQD EGVSPDDVLD NVTLFWLTNT GVSAARLYWE NKLVFFASKG VKVPVAVSVF PDELYQTPRA WAEKAYPNLV HYNKLPKGGH FAAWEQPKLF TDEVRVGFRS LRKSG
|
| |