Gene Rleg_0472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0472 
Symbol 
ID8011669 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp491527 
End bp492828 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content67% 
IMG OID644823064 
ProductPpx/GppA phosphatase 
Protein accessionYP_002974317 
Protein GI241203221 
COG category[F] Nucleotide transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0248] Exopolyphosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.210353 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGAAC AGGCCGGAGC GGCAGATCAT GCCGCCCGCT CCGATGGCGA TTCGGCGCCG 
AGCCGTCGCA ACCGCCGCAA GCATCGCGGC AAGCGCGGTC TGCAGGGCCG GCCGCTAACA
CCGGCAAAAC CATCAGGTGC CCCGCAGCAG GAACGCCGCG CGGAAGAGAC GGCGCTCGGT
GGCGAACTGC GTGAAACACT GAATGGCGCG AGCGCCAACC GCAGGGGTCG CCAGGAAAGC
CATGCCCATC CCGGGCATCA GGGGGATCGT CAGGCCGGCG AACATGTGTG GCCGGAGGAG
CTCTATGCTG CCCTCGATCT CGGCACCAAC AATTGCCGTC TCCTCATTGC CCAGCCGACT
CGTCCGGGAC AATTTCGCGT CGTCGACGCC TTTTCCCGCA TCGTGCGCCT CGGCGAAGGT
CTTGCGGCCA GCGGCCGTCT GTCCGATGAG GCGATGGAGA GGGCGATCGA GGCGCTGAGG
ATCTGCGCCT CCAAGCTCAG GAACCGGGAG ATCCGCCGCA TGCGGCTGAT CGCCACCGAA
GCCTGCCGCC AGGCCGTCAA TGGCGAGGAA TTCCTCAGCC GGGTCGTTGC CGAAACCGGC
CTCGCGCTTG AGATCATCGA TCGCGAGACA GAGGCGCGGC TTGCCGTCTC CGGCTGCTCC
TCGCTGGTCG GTCGCGAGAC GCGCTCCGTC GTGCTCTTCG ATATCGGCGG CGGCTCGTCG
GAGATTGCCG TCATCCGCAT CGGCGACAAT CGCTTCAGCC GCCTTGCCAA TCACATCACC
CACTGGACCT CGTTGCCGGT CGGCGTCGTG ACCCTGTCGG AGCGCCATGG CGGGCACGAC
GTCACGCCGG AGGCCTTCGA AGGCATGGTG CGTGAAGTCG AAGGCATGCT TGGAAGCTTC
GATTGCCCGG AGATCGAGGT CGCCCAGACC GGCGACTTTC ACCTGATCGG CACATCGGGC
ACGGTGACAA CGCTTGCCGG CGTGCATCTC GACCTGCCGC GTTATGACCG GCGCAAGGTC
GATGGCATCT GGCTGTCCGA CGACGAGGTC TCCGCCATGC AGGCAAAGCT GCTCTCCTGG
GATTTCGCAA GCCGCGCCGC CAATCCCTGC ATTGGGCCCG ATCGGGCCGA TCTGGTGCTC
GCCGGCTGCG CCATTCTCGA GGCGATCCGC CGCCGTTGGC CGAGCCCGCG CATGCGGGTT
GCCGATCGCG GCTTGAGGGA AGGTCTGCTC ACCGACATGA TGGCCGATGA CGGCGTGTGG
CGGCGCAATC GTAACCGCCG CGGCCAGCGC GTGAGATCGT GA
 
Protein sequence
MMEQAGAADH AARSDGDSAP SRRNRRKHRG KRGLQGRPLT PAKPSGAPQQ ERRAEETALG 
GELRETLNGA SANRRGRQES HAHPGHQGDR QAGEHVWPEE LYAALDLGTN NCRLLIAQPT
RPGQFRVVDA FSRIVRLGEG LAASGRLSDE AMERAIEALR ICASKLRNRE IRRMRLIATE
ACRQAVNGEE FLSRVVAETG LALEIIDRET EARLAVSGCS SLVGRETRSV VLFDIGGGSS
EIAVIRIGDN RFSRLANHIT HWTSLPVGVV TLSERHGGHD VTPEAFEGMV REVEGMLGSF
DCPEIEVAQT GDFHLIGTSG TVTTLAGVHL DLPRYDRRKV DGIWLSDDEV SAMQAKLLSW
DFASRAANPC IGPDRADLVL AGCAILEAIR RRWPSPRMRV ADRGLREGLL TDMMADDGVW
RRNRNRRGQR VRS