Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_1245 |
Symbol | |
ID | 8012350 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 1221703 |
End bp | 1223223 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644823826 |
Product | Ppx/GppA phosphatase |
Protein accession | YP_002975076 |
Protein GI | 241203980 |
COG category | [F] Nucleotide transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG0248] Exopolyphosphatase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.136301 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.00727222 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGTTGAAT CTGAAGCCCA GGGGCGCCTT CCGGGGATCG CCCCGGTCTC CGTCGTCGAT ATTGGATCGA ATTCTATTCG TCTTGTCGTC TACGAAGGCA TGTCCCGTTC GCCAACCGTC CTCTTCAACG AAAAGGTCCT CTGCGGCCTC GGCAAGGGCG TCGCCCTTAC CGGCAAGATG GATGAAGACA GCGTCGCGCG GGCTTTGGCG GCGCTGCACC GTTTCAAGGC TTTGTCCGAT CAGGCGCGCG CTGCCACCAT GTATGTGCTG GCAACGGCGG CCGCGCGCGA GGCGAGCAAC GGTCCTGATT TCATCCACCA GGCGGAAACC ATCCTTAACC GCAAGGTTCG CGTGCTCTCC GGCGAGGAGG AGGCGAAATT CGCTTCGCTC GGCATCATCA GCGGTTTTTA CAATCCTGAC GGCATTGCCG GCGATCTCGG CGGCGGCTCG CTGGAGCTGA TCGATATCAA GGGCAAGGAG TTCGGCAAGG GCATCACGCT GCCGCTCGGC GGCCTGCGCC TATCGGAATA TGCCGGCGGT TCGCTCTCCA AAGCCCAGAG CTTTGCCCGA AAGCAGCTGA AGACGGCAAA GCTGCTGTCG AAAGGCGAGG GCCGAACCTT CTACGCTGTC GGCGGTACCT GGCGAAACAT CGCCAAGCTG CACATGGAAA TCACTCATTA TCCGCTGCAC ATGATGCAGG GGTATGAGGT GTCGTTCGAA GGAATGATGC AGTTCCTCGA CCAGGTGGTG ACTGCGCGCG ACTCCAGGGA GCCGGCGCTG CAGGCCGTTT CCAAGCACCG CCGTTCGCTG CTGCCTTTCG GCGCCGTCGC CATGAAGGAA GTGCTGAGCG CGATGAAGCC GTCGTTGATT TCCTTCTCGG CGCAGGGTGT GCGCGAGGGA TATCTTTATT CGCTGCTGTC GGAGGCCGAG CGCCGCGCCG ATCCGCTGCT TGCCGCCGCC GGAGAACTGG CGATCCTGCG TGCCCGTTCG CCGGAGCATG CCCGCGAGCT GGCGGAATGG ACCGGCCGCA TGATGCCCCT CTTCGGCATC CAGGAAACCG AAGAGGAAAG CCGCTACCGC CAGGCCGCCT GTCTGCTTGC CGATATCAGC TGGCGCGCCC ATCCTGACTA TCGCGGCCTG CAGGCGCTGA ACGTCATCGC CCACTCTTCC TTCGTCGGCA TCAGTCATCC CGGCCGCGCC TTCATCGCGC TTTCCAACTA TTACCGTTTC GAAGGCCTGC ATGACGACGG CGCCACCGGT CAGCTGGCGC AGATCGCCAC GCCGCAGCTC ATCGAGCGCG CCAAGCTGCT CGGCGGCATG CTGCGCGTCG TCTACCTCTT CTCGGCCTCG ATGCCCGGCA TCGTCAAGAA CCTGACCTTC CGCAAATCCT CGAGCCCGGA CCTCGACCTC GAATTCGTCG TGCCTCCCGA ATATCGCGAC TTTGCAGGCG AACGCCTGGA CGGCCGCCTG CAGCAGCTGT CGAAGCTAAC GAACAAGCGG TTGGCGTTTC GGTTCGAGTA G
|
Protein sequence | MVESEAQGRL PGIAPVSVVD IGSNSIRLVV YEGMSRSPTV LFNEKVLCGL GKGVALTGKM DEDSVARALA ALHRFKALSD QARAATMYVL ATAAAREASN GPDFIHQAET ILNRKVRVLS GEEEAKFASL GIISGFYNPD GIAGDLGGGS LELIDIKGKE FGKGITLPLG GLRLSEYAGG SLSKAQSFAR KQLKTAKLLS KGEGRTFYAV GGTWRNIAKL HMEITHYPLH MMQGYEVSFE GMMQFLDQVV TARDSREPAL QAVSKHRRSL LPFGAVAMKE VLSAMKPSLI SFSAQGVREG YLYSLLSEAE RRADPLLAAA GELAILRARS PEHARELAEW TGRMMPLFGI QETEEESRYR QAACLLADIS WRAHPDYRGL QALNVIAHSS FVGISHPGRA FIALSNYYRF EGLHDDGATG QLAQIATPQL IERAKLLGGM LRVVYLFSAS MPGIVKNLTF RKSSSPDLDL EFVVPPEYRD FAGERLDGRL QQLSKLTNKR LAFRFE
|
| |