Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_1791 |
Symbol | |
ID | 6980528 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 1840114 |
End bp | 1841424 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643396512 |
Product | Epoxide hydrolase domain protein |
Protein accession | YP_002281302 |
Protein GI | 209549385 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.133373 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.094605 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACTGC TGACCTCTTT GAACCACAGG CTTCTATCGC GACGCGCGCT GCTGTCAGCG GGTGCCGCGG CCGGCTTATC CGCCGTTCTG CTTCCGCTGG CGTCGGAAGC CGCCGACGTT ATAGGCACCC CAGCCGGGGA AATCCGACCA TTCAGGGCCG ACATCCCCGA GGCAGCGCTT GAGGATCTCA GGCGGCGACT TGCCGAAACC CGATGGCCCG ACGGCGAAAC CGTCACGGAT CGTTCGCAGG GTGTTGAGCC TGACAGGCTG AAGGAGCTGG TGGGCTACTG GCAATCGTCC TACGACTGGC GCAAGGCAGA GAGCCGGCTG AATGCCTTTC CGCAATTTCT CACCAATATC GACGGTGTGG ACATCCATTT CATCCATGTC CGTTCCCGCC ATGAAAACGC CCTGCCGTTG ATCATGACCC ATGGCTGGCC GGGTTCGGTG TTCGAACTGC TCGATGTCAT CGGGCCGCTC ACGGACCCGA CGGCGCATGG CGGTACGGCC GAGGATGCTT TCCACCTGGT GATCCCTTCG ATTCCGGGAT TCGGCTTCTC CGGGAAGCCG TCGACGACGG GTTGGAACCC GCAGCGGATA GCGGCTGCCT GGGACGTGCT GATGAAACGG CTCGACTATA TCAGCTATGT CGCGCAAGGC GGCGACTGGG GCGCCATCAT CAGCGACGCC CTGGGTCGCG AGGCACCCGA TGGGCTGCTC GCCATCCATG TCAACAGGAT CGAGCGGGCG ACGACGTTCC CATCGGACGC AGCCCAGGCT CTTAGAAATG GAGGGACGGC TCCCGACAAT CTGTCTGCGG ACGAGAAGCT CGTCTTCGAC GAGGCGCGGA ACTTCCTCAA CAACGGCTTC GGCTATGCCG CGATCATGAG CACACGTCCG GAGACAGTCG GTTACGGCAT TGCGGATTCG CCAGTTGGCC TTGCCGCCTG GCTTTACGAC AAGATCGCCG ACTGGGTGTT CACCCGAGGC GATCCGGAAC AGGCGCTTGG CAAGGAGGCG ATCCTCGACA ATATCACGCT GTACTGGCTG ACGAACACCG GCCCCTCGAG TGGCCGCATC TATTTCGAAA ACGCCATGGC AGGCGCGAAG CTCTCGGAGG TCAAAGTGCC GGTCGCCGTC ACCATATTCC CCGGAGAGGT CTACAAACCG CCGAAGCACT GGTTGTCGAA GGCCTATCCG AAGCTGGTGT ACTATAACCG CGCGTCCAAG GGCGGCCACT TCGCGGCCTG GGAGGAGCCG GAACTCTTCA GTCAGGAGAT CAGGGCAGGG TTCAAAACGG TGCGATCATG A
|
Protein sequence | MALLTSLNHR LLSRRALLSA GAAAGLSAVL LPLASEAADV IGTPAGEIRP FRADIPEAAL EDLRRRLAET RWPDGETVTD RSQGVEPDRL KELVGYWQSS YDWRKAESRL NAFPQFLTNI DGVDIHFIHV RSRHENALPL IMTHGWPGSV FELLDVIGPL TDPTAHGGTA EDAFHLVIPS IPGFGFSGKP STTGWNPQRI AAAWDVLMKR LDYISYVAQG GDWGAIISDA LGREAPDGLL AIHVNRIERA TTFPSDAAQA LRNGGTAPDN LSADEKLVFD EARNFLNNGF GYAAIMSTRP ETVGYGIADS PVGLAAWLYD KIADWVFTRG DPEQALGKEA ILDNITLYWL TNTGPSSGRI YFENAMAGAK LSEVKVPVAV TIFPGEVYKP PKHWLSKAYP KLVYYNRASK GGHFAAWEEP ELFSQEIRAG FKTVRS
|
| |