Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3747 |
Symbol | |
ID | 8014581 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 3797950 |
End bp | 3798639 |
Gene Length | 690 bp |
Protein Length | 229 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644826310 |
Product | HAD-superfamily hydrolase, subfamily IA, variant 3 |
Protein accession | YP_002977529 |
Protein GI | 241206433 |
COG category | [R] General function prediction only |
COG ID | [COG0637] Predicted phosphatase/phosphohexomutase |
TIGRFAM ID | [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.422409 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGATG CTGAAACACG GCTGGTGATC TTCGATTGCG ATGGCGTACT CGTCGACAGC GAGCCGATCT CGATCAGCGT GCTCGTCGGG GCAATGAACG ATCTCGGCGT CTCGATCACC GAGGACCAGG CCTATGAGCG TTTTCTCGGC CGCAGCCTGT CGACCCTCAT CGATACGCTG GAAACCGAAT TCAACGTCCA TGCCGACGAG GAATTCCTCG AGCGTATCCG CATCGAACTC TACGCTCGTT TTCGCACGGA ACTGAAGCCG ATCGACGGTA TCGCCGCGGC GATCGACAGG CTGGGCGTTC GCTGCTGCGT TGCCTCCTCC AGCCAGATGG AGCGAATCCG GTTGTCGCTG TCGGTGACCG GGCTTCTCGA CAGGCTGCCC GACATCTTCA GCGCAACGAT GGTCAAGCGC GGCAAGCCGG CGCCCGATCT CTTCCTGCAT GCGGCGCGTG AAATGCAGGT CGAGCCGGCT CATTGCCTTG TCGTCGAAGA CAGCCCGGCC GGCATTGCCG CCGCCAAGGC AGCAGGCATG ACGGTCTTTG CCTTCACCGG CGGATCACAC GCCAATTTCA CCGGATATCG TGCCGAACTC GACCGCCTTT CGCCTGATGT GGTGTTTGAC GCCATGCCGG ATTTGATACA CCTTGTCCGC AACCATAAGC TGGACGGGAC CAAGACTTGA
|
Protein sequence | MADAETRLVI FDCDGVLVDS EPISISVLVG AMNDLGVSIT EDQAYERFLG RSLSTLIDTL ETEFNVHADE EFLERIRIEL YARFRTELKP IDGIAAAIDR LGVRCCVASS SQMERIRLSL SVTGLLDRLP DIFSATMVKR GKPAPDLFLH AAREMQVEPA HCLVVEDSPA GIAAAKAAGM TVFAFTGGSH ANFTGYRAEL DRLSPDVVFD AMPDLIHLVR NHKLDGTKT
|
| |