Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5665 |
Symbol | |
ID | 6977056 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011366 |
Strand | - |
Start bp | 53724 |
End bp | 57026 |
Gene Length | 3303 bp |
Protein Length | 1100 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643393122 |
Product | hypothetical protein |
Protein accession | YP_002277940 |
Protein GI | 209546050 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3459] Cellobiose phosphorylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.24035 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCCGA ACCTTGACCG CAGCTTTCAA ATGCCCCGAC GCGACGATCT CGGCCTCTTC ACGATCAGCA ACGGCTCCGG CCTGACGATA TCGGCGCTGC CGAACGGCAC GCTGTTTGCC ATCGAGTATG CCGACGACAA GGGGGCGGTG CAGATCAATC AGATCCAGGG TTCGCCGCTC ATCGGCGGCA TCGGCCGCCT GTATCTGCGC ATCGGCGGCG CTCGGCCTGA TGTCGTCGAG ATTGTCGGGC CGCGTGCCAA CGGCAGCTTC GGATACGATG CGACGAGCTT CTGCTGGAGC GGCAAGACAG GCGATATCGC CTATGACGTT CGGCTCGAAC TCCATCCCTC GGAAACGGCA TGGTTCTGGC GCGTCTCGCT CCGGCATCCG AAGGAGAAAA CCCTGCCGGC GGATCTGGTG CTGATCCAGG ATGTCGGCCT TGGCGATCGC GGCTTCCTGA TGAACAGCGA AGCCTATGCC TCGCAATATG TCGATCACCA TATCGCCGAT CATCCGGCAT TCGGCCCCGT GGTGATGAAC CGGCAAAATC TCAAACAGGG TGGCGCCCGC AGTCCCTGGC TCGCCCAGGG CTGCCTCGAC GGGACTGCCG CCTATGCCAC GGATGCGATG CAGCTGGTGC AGGCAAAAGA CCGTCTCGGC GATCGGCTGG TCGGCCCCTT CGGCGCCAGC CTGCCGAGTG AACGGCGGCA GCAGGAAACG GCCTGCCCGG CCATCCGGTC GAAACCGCTC GCCATTTCCG CAAGCGGTGC TGGTGCGACT TTCTTTGCAG TATTTGCCGC CGATCATCCC GAGGCGTCGA GCGATGCCGA TCTTGCGCGG CTCGACGGGC TTGCGGCCAC GGGAAGCGTT GCCGCCGGCC TCGAAGGCGT GACGCCGGTC CGCAGCCTGC TGCAGGACGC GTCGCTGTTG GAGGTCGAGC CGCTCGACAA AAAGGCGATC GGCCGGCTCT ATCCCGAGCG GAGCCTCGAA GAACGCGCCG GCGGGAAGCT GCTGTCGTTC TTCACGCCGG ACGGCGCCCT GAACCGCCAC GTCGTCCTGC GCGAGAAAGA GCTTTTGGTG GCGCGCCGCC ACGGCGCGAT CGTTAGAAGC GGTGCGAATA TGCTGCTCGA CGATTCTACT CTTGCCGCCA CCTGCTGGAT GCAGGGCATT TTCGCCGCGC AGCTGACGAT CGGCAATACC TCGTTCCACA AGCTCTTTTC CGTCTCCCGC GACCCCTACA ACCTGACGCG CGCCAGCGGG CTGCGCATCC TGGCGGATCT GGGTGCCGGC TGGCAGCTGC TGGCGGTGCC GTCGGCGTTC GAAATGGGGC TTAACGACTG CCGCTGGATC TACCAATGTT CCGAACGCAC GATCACCGTT GCGGCGGTCG CCTCCGGCGA GGACGCGGCC ATGCAATGGA CCGTCTCCGC GGAGGGAAAG CCGTGCCGCT TCCTGGTGTT CGGGCATGTC GTGCTTGGCG AGCGCGAATA TGACGCGGGC GGGCAGATCG CGGTCGATGC CGCGCGCAAA CGCATCGCCT TCCGGCCGGA TCCGGCCTGG CTCTGGGGCG AGCGTTATCC CGATGCCGGC TATTGGCTGG TGAGTTCGAC ACCCGACGCC ATCGAGGAAA TCGGCGGCGA CGAACTGCTC TATACCGATG GCGTTGCGCG CAACGGCGCC TTCGTCGCCC TGCGCTCCCG GCCGACGCAG GCCCTCTCAT TCGCCGTGGT CGGCTCGATG ACCAATGCTG AAGAAGCCGA GCGGCTGGCG CAACGCTACG AGGCTGGCGT CACCGAGGCA GCCATGCTGG CGCCGGCATC GAAATTCTGG CGGAACGCCG TTCGTGGTTT GACGATCGAT AACCCTTCGC CGGACCTTGC CGCGCAGACG ACCCTGCTGC CCTGGCTCGC GCACGACGCC ATCGTGCATC TGAGCGTTCC GCACGGCCTC GAGCAATATA CCGGTGCGGC CTGGGGCACA CGCGACGCCT GCCAGGGGCC GATCGAATTC CTGCTCGCCT ACGAGCATGA CCGAGAAGCC AAAGAGGTGG TAAAAACGGT CTTCAGCGAG CAGTACCTTG AGAAAGGCGA CTGGCCGCAA TGGTTCATGC TGGAGCCCTA TGCCAACATA AGAGCAGGCG ACAGCCATGG CGACGTTATC GTCTGGCCAT TGAAGGCGCT CTGCGACTAT ATCGAAGCGA CCGGCGATCT TGCCATTCTC GACGAGAAGG TCTCCTGGCG CGATGAAAAG ACCATGCAAA AGGCGCCGAA GGCCGACAGC ATTGCGGTCC ATGTCGACAA GCTGCTCGAT ACCGTTCGCG GCCAGTTCAT CCCGGGAACA CATCTGATCC GTTATGGCGA GGGGGACTGG AACGATTCCC TGCAGCCGGC CGATCCGCAT CTGCGCGACT GGATGGTCAG CAGCTGGACC GTCGCCCTGC TTTACGAGCA GATCGTCCGC TATTCCGCGA TCCTGCGCCG TCTCGGCCAC GGGAAAAAGG CCAGAAATAA CAAGGCAAAA TTGCTGAGGA AAATCGCAAC GGCGATGCGG CGGGATTTCA ACCGCCATCT CGTGCGGGAC GGCATCGTGG CCGGCTACGG TATCTTCGAT CCCGCCCATG ACGGCGTCGA ATTGCTGCTG CACCCGAGCG ACAGTCGCAC CGGCCTCTCC TACTCGCTGA TCGCGATGAC GCAGGCGATG CTCGGCGGGC TGTTCACGCC GGATCAGCGA CGCGATCATA TGAAGCTGAT CGAAGAGCAT CTGCTCTTCC CAGACGGCGT GCGGCTGATG GAGAAGCCGG CGACCTATGC CGGAGGACCG GAGACGCTGT TTCGCCGGGC CGAATCCTCC TCCTTCGTCG GCCGCGAGAT CGGGCTGATG TATGTGCATG CGCATCTGCG TTATTGCGAG ACGCTCGCTT TGGACGGCGA GGCGAACGAA CTCTGGAAGG CGATTTCGCT CGTCAATCCC ATCGCCGTCA CCACGGCCCT GCCGCAGGCG TCGTTGCGCC AGCGCAATAC CTATTTCAGC AGCAGCGACG CGGCCTTCCA TGATCGCTAT CAGGCGGCGG CGCAATGGGC GCGCGTCAAG GCCGGAAAGG TCGCGGTCGA CGGCGGCTGG CGCATCTATT CGAGCGGGCC CGGACTCTAT GCCAGGAGCT TCGTCGAAAA TATCCTCGGC CTCAAACGAC GCTTCGGCCG GCGCAGACGC AAGCCGCTTC TTCCCGCGGT TCACGCTTCC GTCGAGCTGC AGACGGATCA CGCCGCCTGG CGGCGGCTGA TGAAACCGAA GCCCGACGCG TAA
|
Protein sequence | MAPNLDRSFQ MPRRDDLGLF TISNGSGLTI SALPNGTLFA IEYADDKGAV QINQIQGSPL IGGIGRLYLR IGGARPDVVE IVGPRANGSF GYDATSFCWS GKTGDIAYDV RLELHPSETA WFWRVSLRHP KEKTLPADLV LIQDVGLGDR GFLMNSEAYA SQYVDHHIAD HPAFGPVVMN RQNLKQGGAR SPWLAQGCLD GTAAYATDAM QLVQAKDRLG DRLVGPFGAS LPSERRQQET ACPAIRSKPL AISASGAGAT FFAVFAADHP EASSDADLAR LDGLAATGSV AAGLEGVTPV RSLLQDASLL EVEPLDKKAI GRLYPERSLE ERAGGKLLSF FTPDGALNRH VVLREKELLV ARRHGAIVRS GANMLLDDST LAATCWMQGI FAAQLTIGNT SFHKLFSVSR DPYNLTRASG LRILADLGAG WQLLAVPSAF EMGLNDCRWI YQCSERTITV AAVASGEDAA MQWTVSAEGK PCRFLVFGHV VLGEREYDAG GQIAVDAARK RIAFRPDPAW LWGERYPDAG YWLVSSTPDA IEEIGGDELL YTDGVARNGA FVALRSRPTQ ALSFAVVGSM TNAEEAERLA QRYEAGVTEA AMLAPASKFW RNAVRGLTID NPSPDLAAQT TLLPWLAHDA IVHLSVPHGL EQYTGAAWGT RDACQGPIEF LLAYEHDREA KEVVKTVFSE QYLEKGDWPQ WFMLEPYANI RAGDSHGDVI VWPLKALCDY IEATGDLAIL DEKVSWRDEK TMQKAPKADS IAVHVDKLLD TVRGQFIPGT HLIRYGEGDW NDSLQPADPH LRDWMVSSWT VALLYEQIVR YSAILRRLGH GKKARNNKAK LLRKIATAMR RDFNRHLVRD GIVAGYGIFD PAHDGVELLL HPSDSRTGLS YSLIAMTQAM LGGLFTPDQR RDHMKLIEEH LLFPDGVRLM EKPATYAGGP ETLFRRAESS SFVGREIGLM YVHAHLRYCE TLALDGEANE LWKAISLVNP IAVTTALPQA SLRQRNTYFS SSDAAFHDRY QAAAQWARVK AGKVAVDGGW RIYSSGPGLY ARSFVENILG LKRRFGRRRR KPLLPAVHAS VELQTDHAAW RRLMKPKPDA
|
| |