Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_1838 |
Symbol | |
ID | 6980576 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 1888011 |
End bp | 1889207 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643396560 |
Product | protein of unknown function UPF0118 |
Protein accession | YP_002281349 |
Protein GI | 209549432 |
COG category | [R] General function prediction only |
COG ID | [COG0628] Predicted permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.00200619 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.0419691 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGTGTGT TCGACCGTCA GAAAAGCAAT CATGAACCAC GATGGCTCGG ATCATCGGCG CCGACACGGA CGCCGCTGAT ACCCTCCATC TCGGCGGCGC GCTGGCTGCT GGTTCTGGTC GTTGCGGCGG GCGTTTACTT CTTCTACGGT TTCCTCGTGC CGGTGCTGGC AGCCCTGGTC ATCGGCTTCG CCAGCTGGCC GCTCTACCGC AAGCTGCTTG CCCGCGTCGG CGGCAATACG ACGATCGCCG CGACGATCGC CATCATCATG ATCATCACTT TCCTGGTCAT CCCGATCGGG CTTGCGATCA CCTATACGAC GGGTGAAGTG CGCACCTGGG TTGCCTGGGC AATCCATGCC AACCGCGCCG GCGCCCCGAC ACCGGCCTGG ATCGTCGCCC TGCCATGGGC CGGCGCCTAT CTCGATGAAG TCTGGACCAA ATATATCGGC AGCCCCGGCG CCCTGGGCGA AGTGATACAG GCGGTCAGCG GCGCCAATAT CGGCAATATC TACCGTGCCG TGCTTGCGGC CGGCGGTGGC GCCTTCCACC TCCTGCTGAC GCTGCTCTTC ATGCTGATCG CGCTGTTCTT CGTCTATCGC GACGGTTTTT CCTTCTCCAA GCAGATCGAC ATGCTCGGCG AGCGCATCCT GCCAAACCGC TGGGAGCGCA TTTCCCGCGT CGTGCCGGCA ACGATCAGCT CCACCGTCAT GGGCATGACG CTGATTGCGA TCGGCGAAGG CATCGTGCTC GGCCTGGCTT ACTGGATCGC CGGCGTGCCC TCGCCCGTCA CGCTCGGCGT GTTGACGGGC GTGATGGCGC TGATACCGGG CGGCGCACCG CTTTCCTTCA CGCTGGTCTC AATCTATCTC TTGGCAAGCG GCTCGCATGT CGCCGGCATC GGTCTTTTCG TCTGGGGAAC GGTCGAGCTC TTCATAGTCG ACAAGACGTT GCGGCCGAAG CTCGTCGGCG GTCCGATCAA GCTGCCCTTC CTGCCGACCT TCTTCGGTCT CGTCGGCGGC GTCAAGACGA TGGGCTTCCT CGGCCTCTTC ATTGGCCCGG TGCTGATGGC GCTGATCGTC GCCATCTGGC GCGAATGGAT CCACGAGGCC CGCAACGCCG AAAAGGGCGA GACGGGACCG CAGGTCCTCA TCGACGAGCA AGCCCCGCCG GCAATCCCCC GTATCGCCGA AGGCTGA
|
Protein sequence | MGVFDRQKSN HEPRWLGSSA PTRTPLIPSI SAARWLLVLV VAAGVYFFYG FLVPVLAALV IGFASWPLYR KLLARVGGNT TIAATIAIIM IITFLVIPIG LAITYTTGEV RTWVAWAIHA NRAGAPTPAW IVALPWAGAY LDEVWTKYIG SPGALGEVIQ AVSGANIGNI YRAVLAAGGG AFHLLLTLLF MLIALFFVYR DGFSFSKQID MLGERILPNR WERISRVVPA TISSTVMGMT LIAIGEGIVL GLAYWIAGVP SPVTLGVLTG VMALIPGGAP LSFTLVSIYL LASGSHVAGI GLFVWGTVEL FIVDKTLRPK LVGGPIKLPF LPTFFGLVGG VKTMGFLGLF IGPVLMALIV AIWREWIHEA RNAEKGETGP QVLIDEQAPP AIPRIAEG
|
| |