Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_0196 |
Symbol | |
ID | 8011426 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 197189 |
End bp | 199153 |
Gene Length | 1965 bp |
Protein Length | 654 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 644822789 |
Product | hypothetical protein |
Protein accession | YP_002974046 |
Protein GI | 241202950 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3593] Predicted ATP-dependent endonuclease of the OLD family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.515468 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGCTGA AGAGTTTTAG CGTGTCGGAG TTTCGCAGCA TCATTGCTTC TGGCGAGATC ACGCTTGGAG ACGTAACATG CCTTGTCGGT AAGAACGAAG CGGGAAAGAC GGCTCTCCTG AAGGCCCTTT ACAAGCTATC GCCGATGTCC ACTACCGATG CCAGATTCGA CGTGACGGAT GACTACCCGC GCAAGGATCT GGGCGATTAC CAGCACGAAG TCGACGAGGG TGTCAGGGGT CAAGCCGTCC CTATACGTGC GACGTTCGAA TTGACCGACG CGGAGGTTTC ATCCGTAGCA AACCTGTTCG GGCCTAAGGC TCTGACTTCA AACGTTCTTA CGTTGGAGAA ATCATACGAC AATAGACGGG TCTACTCACT GGACTTTAAC GAGACGGAGG CCCTGAAGTA CGTCGTTTCG TCCGCCGAGC TGGAAAGTGC CGATCTTGCG GCTGTCGGCG ACGTCCAGAC ATGGAAGGGC TTGGGGAACC GACTGGCGGA GCTGGCCGCA TCGCCCACCG GAGCGAAGCT AAAAATGCTC GTTGATAAGA TCAACGACAA GGGTTCCGGA GGTGGCTACT ATGCGTACAA TTCTATCTTG AGCAGGAGCG TTCCGGAGTT CCTCTACTTC GACGAATATT ACCAAATGGT TGGTCACGAT AACGTTGAGG CCCTCATCCA ACGGCGCGAC AGCGAAGACT TACGGCCGTC GGACCACCCG CTTCTAGGCT TGATCAATCT TGCCCGGCTC AGCCTCGACG ACTTGATCTC GTCCAAAAGG ACGATGGAGA TGGTCAACAA GCTGGAGGCA GCAGGAAACC ATTTGACCCG CCAGATCCTC AAATACTGGT CACAAAACAA GCATCTTCAG ATGAAGTTCG ATGTCCGCGA GGCGATGGCC GACGACCCGG TCGAAATGAG GAGCGGGCAT AACATCTGGG GCCGCGTCTA CGACCAGGTG CATTGGGCGA CCACGGAGCT GAGCTCCCGA TCACGAGGCT TCGTCTGGTT CTTCTCGTTC CTCGCTTGGT ACGAAGATGT GAAACGAGCT CGCAAGAACA TTATCCTCCT TCTCGACGAG CCGGGCACCT CGCTACACGG TCGCGCACAG GGCGATCTTC TACGGTATAT CGAACAGGAG CTGCGTCCGC ACCATCAGGT AATCTACACC ACGCACTCGC CGTTCATGGT GGATCCTCAG CACTTCGACC GAGTCAGGAT CGTGCAGGAC AGGGGCATCG ATTCTGACGA TCCATTACCG CGGGAAGAGG ACGGAACGAA AGTCCTGGAA AACGTGTTCG ACGCCTCCGA CGACAGTCTC TTCCCGTTGC AGGGTGCACT CGGCTACGAC ATCAGCCAGA CACTTTTCAT CGGACCCAAC TCTCTCGTCG TGGAAGGACC ATCGGACCTC TTTTACCTTC GGGGTATGAG CAGTCTCCTC GAAAGAGAAA AGCGAACCGG GTTGAGTCCG GAATGGACGC TTACGCCCGT CGGCGGAAGC AGCAAAATCC CTACTTTCGT GGCCATGCTG GCTCCACAAA GGGGCATGAA CGTCGCGGTT CTCGTCGATA TCCAGGCTAG CGACCGGCAG ACTGTGGAAG GGCTTTATAA GAAGAAGCTG CTCGACCAGA AAAACGTCCA TACGTTTGCC GATTTCACTG GCCTTGTAGA ATCCGACGTA GAGGACATGT TCGAGCGCGA CTTCTACGTC AATCTAGTCA ACGAGGAGTT CCGCGGGCAG CTATCGTCGA AGATAACGGC TTCTAAACTC AACCGGAACC TACCTCGCGT ACTGCGCGCC CTTGAGGAGC ATTTTCAAAC GTCGCCTCTG AAATCGGGGC AGTTCGGACA TTACAGACCT GCACGCTACT TCGCTGAAAA CCTTGCTGTT CTCACGCCCA AAATCTCCGA GGAAACGAAG AACCGCTTCG AGGCATTGTT CAAGAAACTG AATGCTCAAT TATGA
|
Protein sequence | MRLKSFSVSE FRSIIASGEI TLGDVTCLVG KNEAGKTALL KALYKLSPMS TTDARFDVTD DYPRKDLGDY QHEVDEGVRG QAVPIRATFE LTDAEVSSVA NLFGPKALTS NVLTLEKSYD NRRVYSLDFN ETEALKYVVS SAELESADLA AVGDVQTWKG LGNRLAELAA SPTGAKLKML VDKINDKGSG GGYYAYNSIL SRSVPEFLYF DEYYQMVGHD NVEALIQRRD SEDLRPSDHP LLGLINLARL SLDDLISSKR TMEMVNKLEA AGNHLTRQIL KYWSQNKHLQ MKFDVREAMA DDPVEMRSGH NIWGRVYDQV HWATTELSSR SRGFVWFFSF LAWYEDVKRA RKNIILLLDE PGTSLHGRAQ GDLLRYIEQE LRPHHQVIYT THSPFMVDPQ HFDRVRIVQD RGIDSDDPLP REEDGTKVLE NVFDASDDSL FPLQGALGYD ISQTLFIGPN SLVVEGPSDL FYLRGMSSLL EREKRTGLSP EWTLTPVGGS SKIPTFVAML APQRGMNVAV LVDIQASDRQ TVEGLYKKKL LDQKNVHTFA DFTGLVESDV EDMFERDFYV NLVNEEFRGQ LSSKITASKL NRNLPRVLRA LEEHFQTSPL KSGQFGHYRP ARYFAENLAV LTPKISEETK NRFEALFKKL NAQL
|
| |