Gene Rleg2_5598 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5598 
Symbol 
ID6978692 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp1243552 
End bp1245501 
Gene Length1950 bp 
Protein Length649 aa 
Translation table11 
GC content61% 
IMG OID643394696 
Producthistidinol-phosphate phosphatase family protein 
Protein accessionYP_002279514 
Protein GI209547596 
COG category[E] Amino acid transport and metabolism
[J] Translation, ribosomal structure and biogenesis
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0241] Histidinol phosphatase and related phosphatases
[COG1208] Nucleoside-diphosphate-sugar pyrophosphorylase involved in lipopolysaccharide biosynthesis/translation initiation factor 2B, gamma/epsilon subunits (eIF-2Bgamma/eIF-2Bepsilon) 
TIGRFAM ID[TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED
[TIGR01656] histidinol-phosphate phosphatase family domain
[TIGR01662] HAD-superfamily hydrolase, subfamily IIIA 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTTCTG AGGACGCGAT CCGCCAGGCG ATCATTATTG CGGGTGGCCT TGGAACACGC 
GCGCGCAGCA TGACGGGCGA TGCTATCCCG AAGGCCCTTT TGCCGCTCGC CGGCGTGCCG
ATCATTCTGC GGCAAATCCG CACCCTGGCT CGCGAAGGTA TCCAGCATGT GCGCGTGCTC
GGCGGTCACC TCGGCAGCCA GCTCGAGCCT GCCCTTGGCC CGGAAGCTGA AAAACTCGGT
ATCAAGATCG AGGTCTTCGT CGAGAAATCT CCGCTCGGCA CAGCGGGCTG CCTGACGACT
TTGGTCATGA CAGCCGGCGA TGTCCTGATC GTCTACGGCG ACATGCTTTT CGATATCGAT
CTGTCGGCAC TGACGCGTCA TCGCCAGCAA TTTCCCGCCG CGCTGACCAT CATCGCTCAC
CCCAACGACC ATCCCCGCAC ATCCGATCTC GTCGTCCAGA AAAGCGGTTA TCTCCAGCGT
CTTTTGCCCC GCAAAACCCC GCGCGATTCG GATTGGCGGA ATTTGGTGCC GGCCGGTTTG
TATGTGGCCT CCGAGCAATT TTTCCAGGCG CTCGTGCCGG GTCAAACAGC CGATATGATC
CACGACGTCA TTCCCGGTCT CCTCGGACGG TCCGTTCCGA TCGCGATTTA CGACACGCCG
GAATATATGA AGGATACCGG GTCGCCGAAC CGGCATGCTG CCGCGGAAGC GGACCTCCGG
CAGGAACGGG TTCATGCGGC GCATCTGTCC GTGCAAAGGC CCGCGGTTTT CTTCGATTGC
GACGGCGTGC TCAACGAAGA TGTCGGCGGC CATGGCGTCA TACATCCCGA CCAGGTGAAG
CTGATCGACC GGGCGGGCGA AGCCGTGCGG CTCGCCCGCG AGGCAGGCTT CCTGACGATT
GCGGTCACGA ACAGGCCGCA GGTCGCCAAA GGCTTTCTGG ATGAAGCCGG GCTGGATCAT
GCTCTCGGCC GCCTCGAGGC CAAGCTTGCC GAAGATGGCG GCGTCCTGGA TCGCATTTAT
TTTTGTCCGC ATCATCCGGA CAAAGGATTT CCCAACGAAG TCGCCGCGCT CAAGATCGAT
TGTGCCTGCC GGAAACCGGG CGATCTGATG ATCCGTCAGG CCATGTCGGA ACTGCCTGTC
GAAAAATCGA AATCCATCAT CATCGGCGAC AGCCTGCGCG ACATCGGTGC TGGCCGCAAG
GCGGGCATCT GGGCCTATGG CGTCCGGACC GGTTATGGCT TGCGGGACGA AAAGAGCTAT
CCCACCGTCG AAACAGCGAT ACCGCATGCC GATCTCGTCT TCGACACGGT CTATGACGCC
GTCCGCTTCC AATGCGGCTA CCAAGAGATC GGCAAGGCTC TGTCCGGCGC GATTGATGAA
CGGCTTTCAG ATACGGCCGG TCCGCTGCTC ATCAGCATAT GCGGCCGTTC CCGCTCCGGG
AAAAGTACGT TTGCACACGC CGTTCAACGC GTGCTCTCGG AAACAGGGCG CAGGGTGCTG
AGGCTGGAAC TCGACCGCTG GATACTGCCG CTCGAACATC GCCGCCCCGA CATGAACGCG
GAAGAGCGCA GCAGAGTAGA GCTCTACCCC GAGATCGTCG ACGTGCTGCG CCGCTCCGGA
CAGATCGAAG CGCCCGGCTA TGACGCGGCA AGCCGCGGCC GGCTTAGGGG CACCACCGCC
TATGACGCCC GCGATGCCGA CGTCATCCTC CTGGACGGCA TCTTTGCCGG GCATGCGTCG
ATCCGCGAAC AGGTCGATAT GTCCGTCTTC GTCGAAGCGT CCGAGCAGAG CCTGCTGAAC
CGCTTTCACA CGTTCTATGC CTGGAAAGGC CTCACGCCAG TTGCTGCCGA AGAGCTCTGG
CAGTCGCGAA TTCAGGAAGA GTGGCCGAGG ATCGATCTGC AGCGCACATC GGCCGATATC
GTCATCAATC TCGAGGAGGC AATCCTTTGA
 
Protein sequence
MGSEDAIRQA IIIAGGLGTR ARSMTGDAIP KALLPLAGVP IILRQIRTLA REGIQHVRVL 
GGHLGSQLEP ALGPEAEKLG IKIEVFVEKS PLGTAGCLTT LVMTAGDVLI VYGDMLFDID
LSALTRHRQQ FPAALTIIAH PNDHPRTSDL VVQKSGYLQR LLPRKTPRDS DWRNLVPAGL
YVASEQFFQA LVPGQTADMI HDVIPGLLGR SVPIAIYDTP EYMKDTGSPN RHAAAEADLR
QERVHAAHLS VQRPAVFFDC DGVLNEDVGG HGVIHPDQVK LIDRAGEAVR LAREAGFLTI
AVTNRPQVAK GFLDEAGLDH ALGRLEAKLA EDGGVLDRIY FCPHHPDKGF PNEVAALKID
CACRKPGDLM IRQAMSELPV EKSKSIIIGD SLRDIGAGRK AGIWAYGVRT GYGLRDEKSY
PTVETAIPHA DLVFDTVYDA VRFQCGYQEI GKALSGAIDE RLSDTAGPLL ISICGRSRSG
KSTFAHAVQR VLSETGRRVL RLELDRWILP LEHRRPDMNA EERSRVELYP EIVDVLRRSG
QIEAPGYDAA SRGRLRGTTA YDARDADVIL LDGIFAGHAS IREQVDMSVF VEASEQSLLN
RFHTFYAWKG LTPVAAEELW QSRIQEEWPR IDLQRTSADI VINLEEAIL