Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5598 |
Symbol | |
ID | 6978692 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | - |
Start bp | 1243552 |
End bp | 1245501 |
Gene Length | 1950 bp |
Protein Length | 649 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643394696 |
Product | histidinol-phosphate phosphatase family protein |
Protein accession | YP_002279514 |
Protein GI | 209547596 |
COG category | [E] Amino acid transport and metabolism [J] Translation, ribosomal structure and biogenesis [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0241] Histidinol phosphatase and related phosphatases [COG1208] Nucleoside-diphosphate-sugar pyrophosphorylase involved in lipopolysaccharide biosynthesis/translation initiation factor 2B, gamma/epsilon subunits (eIF-2Bgamma/eIF-2Bepsilon) |
TIGRFAM ID | [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01656] histidinol-phosphate phosphatase family domain [TIGR01662] HAD-superfamily hydrolase, subfamily IIIA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTTCTG AGGACGCGAT CCGCCAGGCG ATCATTATTG CGGGTGGCCT TGGAACACGC GCGCGCAGCA TGACGGGCGA TGCTATCCCG AAGGCCCTTT TGCCGCTCGC CGGCGTGCCG ATCATTCTGC GGCAAATCCG CACCCTGGCT CGCGAAGGTA TCCAGCATGT GCGCGTGCTC GGCGGTCACC TCGGCAGCCA GCTCGAGCCT GCCCTTGGCC CGGAAGCTGA AAAACTCGGT ATCAAGATCG AGGTCTTCGT CGAGAAATCT CCGCTCGGCA CAGCGGGCTG CCTGACGACT TTGGTCATGA CAGCCGGCGA TGTCCTGATC GTCTACGGCG ACATGCTTTT CGATATCGAT CTGTCGGCAC TGACGCGTCA TCGCCAGCAA TTTCCCGCCG CGCTGACCAT CATCGCTCAC CCCAACGACC ATCCCCGCAC ATCCGATCTC GTCGTCCAGA AAAGCGGTTA TCTCCAGCGT CTTTTGCCCC GCAAAACCCC GCGCGATTCG GATTGGCGGA ATTTGGTGCC GGCCGGTTTG TATGTGGCCT CCGAGCAATT TTTCCAGGCG CTCGTGCCGG GTCAAACAGC CGATATGATC CACGACGTCA TTCCCGGTCT CCTCGGACGG TCCGTTCCGA TCGCGATTTA CGACACGCCG GAATATATGA AGGATACCGG GTCGCCGAAC CGGCATGCTG CCGCGGAAGC GGACCTCCGG CAGGAACGGG TTCATGCGGC GCATCTGTCC GTGCAAAGGC CCGCGGTTTT CTTCGATTGC GACGGCGTGC TCAACGAAGA TGTCGGCGGC CATGGCGTCA TACATCCCGA CCAGGTGAAG CTGATCGACC GGGCGGGCGA AGCCGTGCGG CTCGCCCGCG AGGCAGGCTT CCTGACGATT GCGGTCACGA ACAGGCCGCA GGTCGCCAAA GGCTTTCTGG ATGAAGCCGG GCTGGATCAT GCTCTCGGCC GCCTCGAGGC CAAGCTTGCC GAAGATGGCG GCGTCCTGGA TCGCATTTAT TTTTGTCCGC ATCATCCGGA CAAAGGATTT CCCAACGAAG TCGCCGCGCT CAAGATCGAT TGTGCCTGCC GGAAACCGGG CGATCTGATG ATCCGTCAGG CCATGTCGGA ACTGCCTGTC GAAAAATCGA AATCCATCAT CATCGGCGAC AGCCTGCGCG ACATCGGTGC TGGCCGCAAG GCGGGCATCT GGGCCTATGG CGTCCGGACC GGTTATGGCT TGCGGGACGA AAAGAGCTAT CCCACCGTCG AAACAGCGAT ACCGCATGCC GATCTCGTCT TCGACACGGT CTATGACGCC GTCCGCTTCC AATGCGGCTA CCAAGAGATC GGCAAGGCTC TGTCCGGCGC GATTGATGAA CGGCTTTCAG ATACGGCCGG TCCGCTGCTC ATCAGCATAT GCGGCCGTTC CCGCTCCGGG AAAAGTACGT TTGCACACGC CGTTCAACGC GTGCTCTCGG AAACAGGGCG CAGGGTGCTG AGGCTGGAAC TCGACCGCTG GATACTGCCG CTCGAACATC GCCGCCCCGA CATGAACGCG GAAGAGCGCA GCAGAGTAGA GCTCTACCCC GAGATCGTCG ACGTGCTGCG CCGCTCCGGA CAGATCGAAG CGCCCGGCTA TGACGCGGCA AGCCGCGGCC GGCTTAGGGG CACCACCGCC TATGACGCCC GCGATGCCGA CGTCATCCTC CTGGACGGCA TCTTTGCCGG GCATGCGTCG ATCCGCGAAC AGGTCGATAT GTCCGTCTTC GTCGAAGCGT CCGAGCAGAG CCTGCTGAAC CGCTTTCACA CGTTCTATGC CTGGAAAGGC CTCACGCCAG TTGCTGCCGA AGAGCTCTGG CAGTCGCGAA TTCAGGAAGA GTGGCCGAGG ATCGATCTGC AGCGCACATC GGCCGATATC GTCATCAATC TCGAGGAGGC AATCCTTTGA
|
Protein sequence | MGSEDAIRQA IIIAGGLGTR ARSMTGDAIP KALLPLAGVP IILRQIRTLA REGIQHVRVL GGHLGSQLEP ALGPEAEKLG IKIEVFVEKS PLGTAGCLTT LVMTAGDVLI VYGDMLFDID LSALTRHRQQ FPAALTIIAH PNDHPRTSDL VVQKSGYLQR LLPRKTPRDS DWRNLVPAGL YVASEQFFQA LVPGQTADMI HDVIPGLLGR SVPIAIYDTP EYMKDTGSPN RHAAAEADLR QERVHAAHLS VQRPAVFFDC DGVLNEDVGG HGVIHPDQVK LIDRAGEAVR LAREAGFLTI AVTNRPQVAK GFLDEAGLDH ALGRLEAKLA EDGGVLDRIY FCPHHPDKGF PNEVAALKID CACRKPGDLM IRQAMSELPV EKSKSIIIGD SLRDIGAGRK AGIWAYGVRT GYGLRDEKSY PTVETAIPHA DLVFDTVYDA VRFQCGYQEI GKALSGAIDE RLSDTAGPLL ISICGRSRSG KSTFAHAVQR VLSETGRRVL RLELDRWILP LEHRRPDMNA EERSRVELYP EIVDVLRRSG QIEAPGYDAA SRGRLRGTTA YDARDADVIL LDGIFAGHAS IREQVDMSVF VEASEQSLLN RFHTFYAWKG LTPVAAEELW QSRIQEEWPR IDLQRTSADI VINLEEAIL
|
| |