Gene Rleg_0030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0030 
Symbol 
ID8011277 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp27422 
End bp28471 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content64% 
IMG OID644822620 
ProductPhoH family protein 
Protein accessionYP_002973880 
Protein GI241202784 
COG category[T] Signal transduction mechanisms 
COG ID[COG1702] Phosphate starvation-inducible protein PhoH, predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.04719 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAACGGAC AAGAATTGGT TTCTTCTTCA CCGCGCCACC CCCGCACGCC GAGCGATACC 
AATCACTTCG TCCTGACGTT CGAGAACAAC CGCTTCGCCA GCGAGCTCTT CGGTCAATTC
GACCAGAACC TCAAGCTGCT CGAACAACGG CTGAACATCG ATGCGCGGGC ACGCGGCAAT
TCGGTCGTCA TCACAGGCGA TGTTGTGACC ACCAACCAGG CGCGGCGCAC GCTCGACTAT
CTCTATGAAA AACTTCAGAA AGGCGGCAGC GTGGAACAAT CCGACGTCGA GGGCGCAATC
CGCATGGCGG TCGCCGCCGA CGATCAGCTC AGCCTGCCGA CCATGGAGCG CAAAGCCAAG
CTGACGATGG CGCAGGTTTC CACGCGCAAG AAGACGATCA TCGCCCGCAC GCCGACGCAG
GACGCCTATA TCAGGGCGCT GGAACGCGCC GAGCTCGTCT TCGGCGTCGG CCCGGCCGGC
ACTGGCAAGA CCTATCTTGC CGTCGCCCAT GCCGCCCAGC TCCTGGAGCG CGGCGCGGTC
GAAAAGATCA TCCTGTCGCG CCCGGCCGTC GAGGCCGGCG AACGCCTCGG CTTCCTGCCC
GGGGACATGA AGGAAAAGGT CGACCCCTAT CTTCGCCCGC TCTATGACGC ACTCTACGAC
ATGATCCCGG CCGACAAGGT CGACCGGGCG ATCACTGCCG GCGTCATCGA AATCGCGCCG
CTGGCCTTCA TGCGCGGCCG CACGCTCGCC AACGCCGCCA TCATCCTCGA CGAAGCGCAG
AACACGACGT CGATGCAGAT GAAGATGTTC CTGACGCGTC TCGGCGAGAA TGCGCGCATG
ATCGTCACCG GCGACCCGAG CCAGATCGAC CTGCCGCGCG GCGTCAAATC CGGCCTCGTC
GAGGCCTTGC AGCTTCTGAA CGGCGTCGAG GGAATCTCGA TCGTGCGCTT CACGGATACC
GACGTCGTCC GCCACCCGCT GGTCGGGCGC ATCGTCAGGG CCTATGATTC CACGTATGCC
GTCGCCGAAG ACGTCAGCCG GCAGGGCTAA
 
Protein sequence
MNGQELVSSS PRHPRTPSDT NHFVLTFENN RFASELFGQF DQNLKLLEQR LNIDARARGN 
SVVITGDVVT TNQARRTLDY LYEKLQKGGS VEQSDVEGAI RMAVAADDQL SLPTMERKAK
LTMAQVSTRK KTIIARTPTQ DAYIRALERA ELVFGVGPAG TGKTYLAVAH AAQLLERGAV
EKIILSRPAV EAGERLGFLP GDMKEKVDPY LRPLYDALYD MIPADKVDRA ITAGVIEIAP
LAFMRGRTLA NAAIILDEAQ NTTSMQMKMF LTRLGENARM IVTGDPSQID LPRGVKSGLV
EALQLLNGVE GISIVRFTDT DVVRHPLVGR IVRAYDSTYA VAEDVSRQG