Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3907 |
Symbol | |
ID | 8014726 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 3973342 |
End bp | 3975288 |
Gene Length | 1947 bp |
Protein Length | 648 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644826477 |
Product | protein of unknown function DUF1680 |
Protein accession | YP_002977688 |
Protein GI | 241206592 |
COG category | [S] Function unknown |
COG ID | [COG3533] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.373018 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCTTAG AGAAAGTGGA GCCTATGACC AAACCAAGCA ATGACCGCCA GTTTCGTCCC GTCGCCGTTC CCGATGTGGA GCTTGGCGGC TTCTGGGGCA AATGGCAGGA CGCCGTCTGC AATTCCACTG CCGAGACCCT GCTCGACCGC TGCGTCGAGG CCGGCATGCT CAAGGCGATC GATGTCAGCC AGCCGAGCCC GGGGGTCGTC ATTCCCATTC AGCCATGGGG CGGGACGACG CAGATGTTCT GGGATTCCGA CCTCGGCAAG TCGATCGAAA CGATCGCCTA TTCGCTCTAT CGCCGGCCGA ACCCGAAGCT CGAAGCGCGT GCCGACGAGA TCATCGACAT GTATGAGAAG CTGCAGGATG AGGACGGCTA TCTGAACGCC TGGTTCCAGC GTGTGGAGCC GAGCCGCCGC TGGACCAATC TGCGCGACCA TCACGAACTC TATTGCGCCG GCCACCTGAT GGAAGCCGCG GTCGCCTATT ATCAGGCGAC CGGCAAACGC AAGCTGCTCG ACATCATGTG CCGTTATGCC GATTACATGA TCAAGATTTT CGGCCATCGC GAAGGCCAGA TATCAGGCTA TTGCGGCCAC GAGGAAGTCG AGCTGGCGCT GGTCAAGCTC GCTCGCGTGA CGGATGAGAA GAAATATCTC GAGCTGTCGA AATACTTCAT CGACGAACGC GGCACCGAAC CGCATTTCTT CACCGCCGAG GCAAGCCGCG ATGGGCGCGA TGTCTCCGAG TACCACCAGA AGACCTATGA ATATGCGCAG GCGCACCAGC CGGTGCGCGC GCAAACAAAG GTCGTCGGCC ACGCCGTGCG CGCCATGTAC CTCTATTCGG GCATGGCCGA CATCGCCACC GAATACAAGG ACGACAGCCT GACGGCAGCG CTGGAAACGC TCTGGGACGA TCTGACGACC AAGCAGATGT ACATCACCGG CGGCATCGGG CCGGCGGCCT CCAACGAGGG CTTTACCGAT TACTTCGATC TGCCGAACGA TACCGCCTAT GCGGAGACCT GCGCCTCGGT CGGGCTGGTG TTCTGGGCGA GCCGCATGCT CGGCCGCGGT CCGGATCGGC GCTATGCCGA CATCATGGAG CAGGCGCTTT ACAACGGCGC TCTGCCGGGG CTTTCGATCG ACGGCAAGAC CTTCTTCTAT GACAATCCGC TCGAAAGTGC CGGCAAGCAC CACCGGTGGA AATGGCACCA TTGCCCCTGC TGCCCGCCCA ACATCGCCCG GCTGGTGACG TCGATCGGCT CCTACATGTA CGCCGTTTCG GATAACGAGA TCGCCGTGCA CCTCTATGGT GAAAGCACCG CGCGGCTGAA GCTTGCCAAT GGCGCCGAGG TCGAACTCGA GCAGACCACC AATTATCCGT GGGAAGGTGC GGTCGCCTTT ACCACCAGGC TGGAGAAGCC GGCGAAGTTT GCACTGTCGC TGCGCGTTCC GGACTGGGCT GATGGCGCAA CCCTCAGCGT CAACGGAGAG ATGCTCGATC TCAATGCCAA TATGCGGGAC GGATATGCCA GGATCGATCG TGAGTGGGCC GCGGGCGATC GTGTCGCCCT CTACCTGCCG CTGGCGCTTC GGCCGCAATA TGCCAACCCG AAGGTGCGCC AGGATGCCGG GCGCGTCGCA TTGATGCGCG GCCCGCTGGT CTATTGCGTC GAAACGACCG ACAACGGCGA AGACCTCAAC GCCATCGTCC TGCCGCGTGA GCTCTCCACA GCCGAAACCG TCGTGCTGAA GGATCTCAAT GATGCCGTCG CCCTCGATCT CAAGGTCGAG CGCGAGGAAA CATCGAACTG GGGAACAGCG CTCTACCGCA AGGCGCCGGC GGAAAGGCAG GTCGCCACCG CGCGTTTCGT GCCCTATCAT CTCTGGGACA ACCGCGCGCC CGGAGAGATG CTCGTCTGGG TCCAGTCGGA CAGATAG
|
Protein sequence | MILEKVEPMT KPSNDRQFRP VAVPDVELGG FWGKWQDAVC NSTAETLLDR CVEAGMLKAI DVSQPSPGVV IPIQPWGGTT QMFWDSDLGK SIETIAYSLY RRPNPKLEAR ADEIIDMYEK LQDEDGYLNA WFQRVEPSRR WTNLRDHHEL YCAGHLMEAA VAYYQATGKR KLLDIMCRYA DYMIKIFGHR EGQISGYCGH EEVELALVKL ARVTDEKKYL ELSKYFIDER GTEPHFFTAE ASRDGRDVSE YHQKTYEYAQ AHQPVRAQTK VVGHAVRAMY LYSGMADIAT EYKDDSLTAA LETLWDDLTT KQMYITGGIG PAASNEGFTD YFDLPNDTAY AETCASVGLV FWASRMLGRG PDRRYADIME QALYNGALPG LSIDGKTFFY DNPLESAGKH HRWKWHHCPC CPPNIARLVT SIGSYMYAVS DNEIAVHLYG ESTARLKLAN GAEVELEQTT NYPWEGAVAF TTRLEKPAKF ALSLRVPDWA DGATLSVNGE MLDLNANMRD GYARIDREWA AGDRVALYLP LALRPQYANP KVRQDAGRVA LMRGPLVYCV ETTDNGEDLN AIVLPRELST AETVVLKDLN DAVALDLKVE REETSNWGTA LYRKAPAERQ VATARFVPYH LWDNRAPGEM LVWVQSDR
|
| |