Gene Rleg_1245 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1245 
Symbol 
ID8012350 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1221703 
End bp1223223 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content63% 
IMG OID644823826 
ProductPpx/GppA phosphatase 
Protein accessionYP_002975076 
Protein GI241203980 
COG category[F] Nucleotide transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0248] Exopolyphosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.136301 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.00727222 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTTGAAT CTGAAGCCCA GGGGCGCCTT CCGGGGATCG CCCCGGTCTC CGTCGTCGAT 
ATTGGATCGA ATTCTATTCG TCTTGTCGTC TACGAAGGCA TGTCCCGTTC GCCAACCGTC
CTCTTCAACG AAAAGGTCCT CTGCGGCCTC GGCAAGGGCG TCGCCCTTAC CGGCAAGATG
GATGAAGACA GCGTCGCGCG GGCTTTGGCG GCGCTGCACC GTTTCAAGGC TTTGTCCGAT
CAGGCGCGCG CTGCCACCAT GTATGTGCTG GCAACGGCGG CCGCGCGCGA GGCGAGCAAC
GGTCCTGATT TCATCCACCA GGCGGAAACC ATCCTTAACC GCAAGGTTCG CGTGCTCTCC
GGCGAGGAGG AGGCGAAATT CGCTTCGCTC GGCATCATCA GCGGTTTTTA CAATCCTGAC
GGCATTGCCG GCGATCTCGG CGGCGGCTCG CTGGAGCTGA TCGATATCAA GGGCAAGGAG
TTCGGCAAGG GCATCACGCT GCCGCTCGGC GGCCTGCGCC TATCGGAATA TGCCGGCGGT
TCGCTCTCCA AAGCCCAGAG CTTTGCCCGA AAGCAGCTGA AGACGGCAAA GCTGCTGTCG
AAAGGCGAGG GCCGAACCTT CTACGCTGTC GGCGGTACCT GGCGAAACAT CGCCAAGCTG
CACATGGAAA TCACTCATTA TCCGCTGCAC ATGATGCAGG GGTATGAGGT GTCGTTCGAA
GGAATGATGC AGTTCCTCGA CCAGGTGGTG ACTGCGCGCG ACTCCAGGGA GCCGGCGCTG
CAGGCCGTTT CCAAGCACCG CCGTTCGCTG CTGCCTTTCG GCGCCGTCGC CATGAAGGAA
GTGCTGAGCG CGATGAAGCC GTCGTTGATT TCCTTCTCGG CGCAGGGTGT GCGCGAGGGA
TATCTTTATT CGCTGCTGTC GGAGGCCGAG CGCCGCGCCG ATCCGCTGCT TGCCGCCGCC
GGAGAACTGG CGATCCTGCG TGCCCGTTCG CCGGAGCATG CCCGCGAGCT GGCGGAATGG
ACCGGCCGCA TGATGCCCCT CTTCGGCATC CAGGAAACCG AAGAGGAAAG CCGCTACCGC
CAGGCCGCCT GTCTGCTTGC CGATATCAGC TGGCGCGCCC ATCCTGACTA TCGCGGCCTG
CAGGCGCTGA ACGTCATCGC CCACTCTTCC TTCGTCGGCA TCAGTCATCC CGGCCGCGCC
TTCATCGCGC TTTCCAACTA TTACCGTTTC GAAGGCCTGC ATGACGACGG CGCCACCGGT
CAGCTGGCGC AGATCGCCAC GCCGCAGCTC ATCGAGCGCG CCAAGCTGCT CGGCGGCATG
CTGCGCGTCG TCTACCTCTT CTCGGCCTCG ATGCCCGGCA TCGTCAAGAA CCTGACCTTC
CGCAAATCCT CGAGCCCGGA CCTCGACCTC GAATTCGTCG TGCCTCCCGA ATATCGCGAC
TTTGCAGGCG AACGCCTGGA CGGCCGCCTG CAGCAGCTGT CGAAGCTAAC GAACAAGCGG
TTGGCGTTTC GGTTCGAGTA G
 
Protein sequence
MVESEAQGRL PGIAPVSVVD IGSNSIRLVV YEGMSRSPTV LFNEKVLCGL GKGVALTGKM 
DEDSVARALA ALHRFKALSD QARAATMYVL ATAAAREASN GPDFIHQAET ILNRKVRVLS
GEEEAKFASL GIISGFYNPD GIAGDLGGGS LELIDIKGKE FGKGITLPLG GLRLSEYAGG
SLSKAQSFAR KQLKTAKLLS KGEGRTFYAV GGTWRNIAKL HMEITHYPLH MMQGYEVSFE
GMMQFLDQVV TARDSREPAL QAVSKHRRSL LPFGAVAMKE VLSAMKPSLI SFSAQGVREG
YLYSLLSEAE RRADPLLAAA GELAILRARS PEHARELAEW TGRMMPLFGI QETEEESRYR
QAACLLADIS WRAHPDYRGL QALNVIAHSS FVGISHPGRA FIALSNYYRF EGLHDDGATG
QLAQIATPQL IERAKLLGGM LRVVYLFSAS MPGIVKNLTF RKSSSPDLDL EFVVPPEYRD
FAGERLDGRL QQLSKLTNKR LAFRFE