Gene Rleg2_1099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1099 
Symbol 
ID6979818 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1120025 
End bp1121545 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content64% 
IMG OID643395811 
ProductPpx/GppA phosphatase 
Protein accessionYP_002280619 
Protein GI209548702 
COG category[F] Nucleotide transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0248] Exopolyphosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.09229 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTGAAT CTGAAGCCCA GGGGCGCCTT CCGGGGATCG CCCCGGTCTC CGTTGTCGAC 
ATTGGATCGA ATTCCATTCG TCTCGTCGTC TACGAAGGCA TGTCCCGCTC GCCGACCATT
CTTTTCAACG AAAAGGTTCT CTGCGGCCTC GGCAAAGGCA TAGCCGTCAC CGGCAAGATG
GATGAAGACA GCGTCATCAG GGCTCTGGCG GCGCTGCACC GTTTCAAGGC CCTGTCCGAT
CAGGCACGCG CCGCCACCAT GTATGTGCTG GCGACGGCCG CCGCCCGCGA GGCGAGCAAC
GGTCCCGATT TCATCCACCA GGCGGAAACC ATCCTCGGCC GCAAGGTCCG TGTGCTCTCC
GGCGAGGAGG AGGCGAAATT CGCCTCGCTC GGCATCATCA GCGGCTTCTT CAATCCCGAC
GGCATTGCCG GCGACCTCGG CGGCGGCTCG CTTGAGCTGA TCGATATCAG GGGCAAGGAG
TTCGGCAAGG GCATCACGCT GCCGCTCGGC GGCCTGCGCC TGTCGGAATA TGCCGGCGGC
TCGCTCTCCA AGGCCCGCAC CTTTGCCCGC AAGCAGGTGA AGACCGCAAA GCTGCTGTCG
AAAGGCGAGG GGCGCACCTT CTACGCCGTC GGCGGCACAT GGCGAAACAT CGCCAAGCTG
CATATGGAAA TCACCAATTA TCCGCTGCAC ATGATGCAGG GCTACGAGGT ATCGCTTGAA
GCGATGATGC TGTTCCTCGA ACAGGTGGTG ACCGCGCGCG ATTCGAAGGA GCCTGCGTTT
CAGGCCGTCT CCAAGCACCG CCGGTCGCTG CTGCCCTTCG GCGCCGTCGC CATGACGGAA
GTGCTGAGCG CGATGAAACC GTCGGTGATT TCCTTCTCGG CGCAGGGCGT GCGTGAGGGA
TATCTCTATT CGCTGCTGTC GGAGGCCGAG CGCCGCCTCG ATCCGCTGCT GGCCGCTGCC
GGCGAGCTGG CGATCCTGCG TGCCCGTTCG CCCGAGCATG CCCGCGAGCT GGCGGAATGG
ACCGGCCGCA TGATGCCCTT GTTCGGCGTC CAGGAAACCG ACGAGGAAAG CCGCTATCGT
CAGGCCGCCT GTCTGCTGGC CGATATCAGC TGGCGCGCCC ATCCCGACTA TCGCGGCCTG
CAGGCGCTGA ACGTCATCGC CCACTCCTCC TTCGTCGGCA TCAGCCATCC TGGCCGCGCC
TTCATCGCGC TGACCAACTA CTACCGTTTC GAAGGCCTGC ACGATGACGG CGCCACCGGC
CCGCTGGCGC AGATCGCCAC AGCCCAGTTC ATCGAGCGCG CCAAGCTGCT CGGCGGCATG
CTGCGCGTCG TCTACCTCTT CTCGGCCTCA ATGCCCGGCA TCGTCAAAAG CCTGAGCTTC
CGCAAATCGT CGAACCCGGA CCTCGACCTC GAATTCGTCG TGCCGCCCGA ATACCGCGAT
TTCGCCGGCG AACGCCTGGA CGGGCGCCTG CAGCAGCTGT CGAAGCTGAC GAACAAGAGG
CTGGCGTTCC GGTTCGAGTA G
 
Protein sequence
MVESEAQGRL PGIAPVSVVD IGSNSIRLVV YEGMSRSPTI LFNEKVLCGL GKGIAVTGKM 
DEDSVIRALA ALHRFKALSD QARAATMYVL ATAAAREASN GPDFIHQAET ILGRKVRVLS
GEEEAKFASL GIISGFFNPD GIAGDLGGGS LELIDIRGKE FGKGITLPLG GLRLSEYAGG
SLSKARTFAR KQVKTAKLLS KGEGRTFYAV GGTWRNIAKL HMEITNYPLH MMQGYEVSLE
AMMLFLEQVV TARDSKEPAF QAVSKHRRSL LPFGAVAMTE VLSAMKPSVI SFSAQGVREG
YLYSLLSEAE RRLDPLLAAA GELAILRARS PEHARELAEW TGRMMPLFGV QETDEESRYR
QAACLLADIS WRAHPDYRGL QALNVIAHSS FVGISHPGRA FIALTNYYRF EGLHDDGATG
PLAQIATAQF IERAKLLGGM LRVVYLFSAS MPGIVKSLSF RKSSNPDLDL EFVVPPEYRD
FAGERLDGRL QQLSKLTNKR LAFRFE