Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_1449 |
Symbol | |
ID | 6980177 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 1472102 |
End bp | 1472782 |
Gene Length | 681 bp |
Protein Length | 226 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643396170 |
Product | HAD-superfamily hydrolase, subfamily IA, variant 3 |
Protein accession | YP_002280969 |
Protein GI | 209549052 |
COG category | [R] General function prediction only |
COG ID | [COG0637] Predicted phosphatase/phosphohexomutase |
TIGRFAM ID | [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0077861 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTCCCAAC CGATGCCCCA TGGCTTCGAG AAGCCTTATG CCGCCTTCCT GTTCGATATG GACGGCACAA TCCTCAATTC GATCCGTGCG GCCGAACGTG TCTGGAGCGA CTGGGCAAGA CGTCAGGGGC TCGATGTCGC CGACTTTCTG CCGAAGATGC ATGGATCGCG CGGCGTCGAC ACGATCGCCC GGCTGAACCT GCCGGGCGTC GATCCCGAAC ATGAAGCCAG GCTGGTGACC GAGGCCGAGA TCGCCGATGT CGGCGATGTC GTCGCCATTT CGGGTGCCGC GGCATTCCTG AGTTCGCTGC CGCCGGATCG CTGGGCGATC GTCACCTCCT CACCCTTGCG TCTTGCCCGC CGGCGGCTGG AGGCCGCAGG CCTGTCGCTG CCGAAATTCA TGGTGACGGC TGAAGATGTG AAGGTCGGAA AACCCGATCC GCAATGTTAT ATTCTGGGCG CCGAACGCCT CGGCGTCAGC ACGCAGGACT GCCTGGTGTT CGAGGATGTC GCTGCCGGCA TTCTCGCCGG CGAGGCGGCG GGCGCCGATG TCATGGTGGT GACGGCAACC CATCATGACA AGATGGAAAC GCCGCATCCG ACGCTGTCAT CCTACGACGA GATCTCGGTC CGCATCTCGG CCGACCATAA AATGTTCATC GTGCCGAAGG CGGCCGGATA A
|
Protein sequence | MSQPMPHGFE KPYAAFLFDM DGTILNSIRA AERVWSDWAR RQGLDVADFL PKMHGSRGVD TIARLNLPGV DPEHEARLVT EAEIADVGDV VAISGAAAFL SSLPPDRWAI VTSSPLRLAR RRLEAAGLSL PKFMVTAEDV KVGKPDPQCY ILGAERLGVS TQDCLVFEDV AAGILAGEAA GADVMVVTAT HHDKMETPHP TLSSYDEISV RISADHKMFI VPKAAG
|
| |