Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_6116 |
Symbol | |
ID | 8016073 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012852 |
Strand | + |
Start bp | 155782 |
End bp | 157785 |
Gene Length | 2004 bp |
Protein Length | 667 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644827422 |
Product | oxidoreductase domain protein |
Protein accession | YP_002978622 |
Protein GI | 241258738 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.409557 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTTCAG ATCAACCGAT CCGCTGGGGC ATCATCGGCC CCGGCACCAT CGCCCGCACC TTTGCCGATG GCGTCGCTCA TTCGCGCACC GGCAGACTGG TGGCGATCGC CACCCGCAAT CCTGCAAAGC CGGGTCTTGC CGAAGGTTTC CCCGGCGCCC GCATCGTCGA TGGTTACGAG GCGTTGCTCT CCGATAAGGA GATCGATGCG ATCTATATCG CCGTCCCCCA CACCGGCCAT GCCGAATGGG CGATCAAGGC GGCACGTGCC GGCAAGCACA TCCTGGTGGA AAAGCCGATC GCGCTCTCGG CCTACGATGC CGAAGCGGTT TACTACGAGG CGAAAAAAGC CGGCGTCTTC GCCGGGGAAG CCTTCATGTA CCGCGTGCAT CCGCAAACGG AGAAGCTGGT CGAACTCGTC AAAAGCGGCG TCGTCGGCAC CGTTCGCATC ATCCGCTCGA GCTTCGGCTT CAACATGGGC AGCTATAAGC CGGAACACCG GCTTTTCGCC AACGACACCG CCGGTGGCGG CATTCTCGAT GTCGGCGGTT ATCCGGTCTC GATGGCCAGA CTGATTGCCG GCGCCGCGGA AGGCAAGGCC TTCCTCGAAC CGGAGAAGGT CTCGGGCGTC GCCCATCTCG GAGAGAGCGG TGTCGATGAA TGGGCATCGG CCGTCCTCAA GTTCCCCAAC GAGATCATCG CCGAAGTCTC CTGCTCGATC ATGGCGCAGC AGGACAATGT GCTGCGCATC ATCGGCTCGG AAGGCCGGAT CGAGGTCCAG GACTTCTGGT TTGCCTCCGG CCATAAGGGC GGCGTCGGCA AGATCGAGAT CTTCAAGGGC AGCAGCCGAG AAACCGTCGA ACTCAGGGAA GACCGCTGGC TCTATTCCTT CGAGACGGAC GCGGCCGGCG ATGCCATCCG CGCCGGCAAG ACCGAATTCA GCTCTCCCGG CATGAGCTGG GCGGATTCGA TCGGAAACCT GCGTGTACTC GACCAGTGGC GTGCCTCGGT CGGCCTCGAA TACGGCGTGG AAAAAGCCAG CAAACGCACG GCAAACATTG CCGGCGGCGC GATCGCGCGC GGCAATACCG TTCCGCAGCG TCAGATTCCT GGCATTTCCA AGCCCGCCTC GGTCGTTACA CTCGGCTTCG AATTCTTCCC GAACTTCGCC TCTGCCTCGC TGACGCTCGA CGCCTTTTAC GAAGCCGGCG GCAATGCCTT CGACACGGCC TATGTCTATG GCGGCGGCAA GACGGAAGCG ATCTTCGGTG ACTGGCACAC GAGCCGCAAG GTGGCTCGCG AGGAGATCGT GCTGATCGGC AAGGGCGCCC ATTCGCCGCT CTGCTATCCT GATATGATCG CAAAGCAGCT CGACCAGTCG CTTGCCCGGC TGAAGACCGA CTATGTCGAC ATCTATTTCA TGCATCGCGA CAATACCGAC GTGCCCGTCG GCGAGTTCGT CGATGCCATG GATGCCGAGG TCAAGCGTGG ACGCATCCGT GGCATATTCG GTGGCTCGAA CTGGACAAGG GCGCGCTTCG ACGAAGCGAT CGCCTATGCC GAAAAGACCG GCAAGACGGC GCCGGCAGCG CTTTCCAACA ACTTCTCGCT TGCCGAAATG CTCGATCCGA TCTGGGCCGG CTGCGTTGCT GCTTCCGACG ACGACTGGAA GAAATGGCTG AACGAGAAGC AGATCCCGAA CTTTGCCTGG TCGAGCCAGG GCCGCGGCTT CTTTACCGAC CGCGCCGGCC GCGACAAACG GGACGATGAG GAGATCGTCC GGGTCTGGTA TTCCGAGCGT AATTTCGGAC GCAGGGACCG CGCCATCGAG CTTGCCAACA AGCTCGGCCG CAATCCGATC CACATCGCAC TTGCCTATGT GATCGCCCAG CCTTTCCCGG TCATTCCGCT GATCGGGCCG CGCACCGTCG CCGAATTGGA AGACAGCCTC TCGGCGCTCG ACATCAAGCT GACGCCCGAG CAGGTGAAGT GGCTGGAAGG CTGA
|
Protein sequence | MTSDQPIRWG IIGPGTIART FADGVAHSRT GRLVAIATRN PAKPGLAEGF PGARIVDGYE ALLSDKEIDA IYIAVPHTGH AEWAIKAARA GKHILVEKPI ALSAYDAEAV YYEAKKAGVF AGEAFMYRVH PQTEKLVELV KSGVVGTVRI IRSSFGFNMG SYKPEHRLFA NDTAGGGILD VGGYPVSMAR LIAGAAEGKA FLEPEKVSGV AHLGESGVDE WASAVLKFPN EIIAEVSCSI MAQQDNVLRI IGSEGRIEVQ DFWFASGHKG GVGKIEIFKG SSRETVELRE DRWLYSFETD AAGDAIRAGK TEFSSPGMSW ADSIGNLRVL DQWRASVGLE YGVEKASKRT ANIAGGAIAR GNTVPQRQIP GISKPASVVT LGFEFFPNFA SASLTLDAFY EAGGNAFDTA YVYGGGKTEA IFGDWHTSRK VAREEIVLIG KGAHSPLCYP DMIAKQLDQS LARLKTDYVD IYFMHRDNTD VPVGEFVDAM DAEVKRGRIR GIFGGSNWTR ARFDEAIAYA EKTGKTAPAA LSNNFSLAEM LDPIWAGCVA ASDDDWKKWL NEKQIPNFAW SSQGRGFFTD RAGRDKRDDE EIVRVWYSER NFGRRDRAIE LANKLGRNPI HIALAYVIAQ PFPVIPLIGP RTVAELEDSL SALDIKLTPE QVKWLEG
|
| |