Gene Rleg2_5665 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5665 
Symbol 
ID6977056 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011366 
Strand
Start bp53724 
End bp57026 
Gene Length3303 bp 
Protein Length1100 aa 
Translation table11 
GC content64% 
IMG OID643393122 
Producthypothetical protein 
Protein accessionYP_002277940 
Protein GI209546050 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3459] Cellobiose phosphorylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.24035 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCCGA ACCTTGACCG CAGCTTTCAA ATGCCCCGAC GCGACGATCT CGGCCTCTTC 
ACGATCAGCA ACGGCTCCGG CCTGACGATA TCGGCGCTGC CGAACGGCAC GCTGTTTGCC
ATCGAGTATG CCGACGACAA GGGGGCGGTG CAGATCAATC AGATCCAGGG TTCGCCGCTC
ATCGGCGGCA TCGGCCGCCT GTATCTGCGC ATCGGCGGCG CTCGGCCTGA TGTCGTCGAG
ATTGTCGGGC CGCGTGCCAA CGGCAGCTTC GGATACGATG CGACGAGCTT CTGCTGGAGC
GGCAAGACAG GCGATATCGC CTATGACGTT CGGCTCGAAC TCCATCCCTC GGAAACGGCA
TGGTTCTGGC GCGTCTCGCT CCGGCATCCG AAGGAGAAAA CCCTGCCGGC GGATCTGGTG
CTGATCCAGG ATGTCGGCCT TGGCGATCGC GGCTTCCTGA TGAACAGCGA AGCCTATGCC
TCGCAATATG TCGATCACCA TATCGCCGAT CATCCGGCAT TCGGCCCCGT GGTGATGAAC
CGGCAAAATC TCAAACAGGG TGGCGCCCGC AGTCCCTGGC TCGCCCAGGG CTGCCTCGAC
GGGACTGCCG CCTATGCCAC GGATGCGATG CAGCTGGTGC AGGCAAAAGA CCGTCTCGGC
GATCGGCTGG TCGGCCCCTT CGGCGCCAGC CTGCCGAGTG AACGGCGGCA GCAGGAAACG
GCCTGCCCGG CCATCCGGTC GAAACCGCTC GCCATTTCCG CAAGCGGTGC TGGTGCGACT
TTCTTTGCAG TATTTGCCGC CGATCATCCC GAGGCGTCGA GCGATGCCGA TCTTGCGCGG
CTCGACGGGC TTGCGGCCAC GGGAAGCGTT GCCGCCGGCC TCGAAGGCGT GACGCCGGTC
CGCAGCCTGC TGCAGGACGC GTCGCTGTTG GAGGTCGAGC CGCTCGACAA AAAGGCGATC
GGCCGGCTCT ATCCCGAGCG GAGCCTCGAA GAACGCGCCG GCGGGAAGCT GCTGTCGTTC
TTCACGCCGG ACGGCGCCCT GAACCGCCAC GTCGTCCTGC GCGAGAAAGA GCTTTTGGTG
GCGCGCCGCC ACGGCGCGAT CGTTAGAAGC GGTGCGAATA TGCTGCTCGA CGATTCTACT
CTTGCCGCCA CCTGCTGGAT GCAGGGCATT TTCGCCGCGC AGCTGACGAT CGGCAATACC
TCGTTCCACA AGCTCTTTTC CGTCTCCCGC GACCCCTACA ACCTGACGCG CGCCAGCGGG
CTGCGCATCC TGGCGGATCT GGGTGCCGGC TGGCAGCTGC TGGCGGTGCC GTCGGCGTTC
GAAATGGGGC TTAACGACTG CCGCTGGATC TACCAATGTT CCGAACGCAC GATCACCGTT
GCGGCGGTCG CCTCCGGCGA GGACGCGGCC ATGCAATGGA CCGTCTCCGC GGAGGGAAAG
CCGTGCCGCT TCCTGGTGTT CGGGCATGTC GTGCTTGGCG AGCGCGAATA TGACGCGGGC
GGGCAGATCG CGGTCGATGC CGCGCGCAAA CGCATCGCCT TCCGGCCGGA TCCGGCCTGG
CTCTGGGGCG AGCGTTATCC CGATGCCGGC TATTGGCTGG TGAGTTCGAC ACCCGACGCC
ATCGAGGAAA TCGGCGGCGA CGAACTGCTC TATACCGATG GCGTTGCGCG CAACGGCGCC
TTCGTCGCCC TGCGCTCCCG GCCGACGCAG GCCCTCTCAT TCGCCGTGGT CGGCTCGATG
ACCAATGCTG AAGAAGCCGA GCGGCTGGCG CAACGCTACG AGGCTGGCGT CACCGAGGCA
GCCATGCTGG CGCCGGCATC GAAATTCTGG CGGAACGCCG TTCGTGGTTT GACGATCGAT
AACCCTTCGC CGGACCTTGC CGCGCAGACG ACCCTGCTGC CCTGGCTCGC GCACGACGCC
ATCGTGCATC TGAGCGTTCC GCACGGCCTC GAGCAATATA CCGGTGCGGC CTGGGGCACA
CGCGACGCCT GCCAGGGGCC GATCGAATTC CTGCTCGCCT ACGAGCATGA CCGAGAAGCC
AAAGAGGTGG TAAAAACGGT CTTCAGCGAG CAGTACCTTG AGAAAGGCGA CTGGCCGCAA
TGGTTCATGC TGGAGCCCTA TGCCAACATA AGAGCAGGCG ACAGCCATGG CGACGTTATC
GTCTGGCCAT TGAAGGCGCT CTGCGACTAT ATCGAAGCGA CCGGCGATCT TGCCATTCTC
GACGAGAAGG TCTCCTGGCG CGATGAAAAG ACCATGCAAA AGGCGCCGAA GGCCGACAGC
ATTGCGGTCC ATGTCGACAA GCTGCTCGAT ACCGTTCGCG GCCAGTTCAT CCCGGGAACA
CATCTGATCC GTTATGGCGA GGGGGACTGG AACGATTCCC TGCAGCCGGC CGATCCGCAT
CTGCGCGACT GGATGGTCAG CAGCTGGACC GTCGCCCTGC TTTACGAGCA GATCGTCCGC
TATTCCGCGA TCCTGCGCCG TCTCGGCCAC GGGAAAAAGG CCAGAAATAA CAAGGCAAAA
TTGCTGAGGA AAATCGCAAC GGCGATGCGG CGGGATTTCA ACCGCCATCT CGTGCGGGAC
GGCATCGTGG CCGGCTACGG TATCTTCGAT CCCGCCCATG ACGGCGTCGA ATTGCTGCTG
CACCCGAGCG ACAGTCGCAC CGGCCTCTCC TACTCGCTGA TCGCGATGAC GCAGGCGATG
CTCGGCGGGC TGTTCACGCC GGATCAGCGA CGCGATCATA TGAAGCTGAT CGAAGAGCAT
CTGCTCTTCC CAGACGGCGT GCGGCTGATG GAGAAGCCGG CGACCTATGC CGGAGGACCG
GAGACGCTGT TTCGCCGGGC CGAATCCTCC TCCTTCGTCG GCCGCGAGAT CGGGCTGATG
TATGTGCATG CGCATCTGCG TTATTGCGAG ACGCTCGCTT TGGACGGCGA GGCGAACGAA
CTCTGGAAGG CGATTTCGCT CGTCAATCCC ATCGCCGTCA CCACGGCCCT GCCGCAGGCG
TCGTTGCGCC AGCGCAATAC CTATTTCAGC AGCAGCGACG CGGCCTTCCA TGATCGCTAT
CAGGCGGCGG CGCAATGGGC GCGCGTCAAG GCCGGAAAGG TCGCGGTCGA CGGCGGCTGG
CGCATCTATT CGAGCGGGCC CGGACTCTAT GCCAGGAGCT TCGTCGAAAA TATCCTCGGC
CTCAAACGAC GCTTCGGCCG GCGCAGACGC AAGCCGCTTC TTCCCGCGGT TCACGCTTCC
GTCGAGCTGC AGACGGATCA CGCCGCCTGG CGGCGGCTGA TGAAACCGAA GCCCGACGCG
TAA
 
Protein sequence
MAPNLDRSFQ MPRRDDLGLF TISNGSGLTI SALPNGTLFA IEYADDKGAV QINQIQGSPL 
IGGIGRLYLR IGGARPDVVE IVGPRANGSF GYDATSFCWS GKTGDIAYDV RLELHPSETA
WFWRVSLRHP KEKTLPADLV LIQDVGLGDR GFLMNSEAYA SQYVDHHIAD HPAFGPVVMN
RQNLKQGGAR SPWLAQGCLD GTAAYATDAM QLVQAKDRLG DRLVGPFGAS LPSERRQQET
ACPAIRSKPL AISASGAGAT FFAVFAADHP EASSDADLAR LDGLAATGSV AAGLEGVTPV
RSLLQDASLL EVEPLDKKAI GRLYPERSLE ERAGGKLLSF FTPDGALNRH VVLREKELLV
ARRHGAIVRS GANMLLDDST LAATCWMQGI FAAQLTIGNT SFHKLFSVSR DPYNLTRASG
LRILADLGAG WQLLAVPSAF EMGLNDCRWI YQCSERTITV AAVASGEDAA MQWTVSAEGK
PCRFLVFGHV VLGEREYDAG GQIAVDAARK RIAFRPDPAW LWGERYPDAG YWLVSSTPDA
IEEIGGDELL YTDGVARNGA FVALRSRPTQ ALSFAVVGSM TNAEEAERLA QRYEAGVTEA
AMLAPASKFW RNAVRGLTID NPSPDLAAQT TLLPWLAHDA IVHLSVPHGL EQYTGAAWGT
RDACQGPIEF LLAYEHDREA KEVVKTVFSE QYLEKGDWPQ WFMLEPYANI RAGDSHGDVI
VWPLKALCDY IEATGDLAIL DEKVSWRDEK TMQKAPKADS IAVHVDKLLD TVRGQFIPGT
HLIRYGEGDW NDSLQPADPH LRDWMVSSWT VALLYEQIVR YSAILRRLGH GKKARNNKAK
LLRKIATAMR RDFNRHLVRD GIVAGYGIFD PAHDGVELLL HPSDSRTGLS YSLIAMTQAM
LGGLFTPDQR RDHMKLIEEH LLFPDGVRLM EKPATYAGGP ETLFRRAESS SFVGREIGLM
YVHAHLRYCE TLALDGEANE LWKAISLVNP IAVTTALPQA SLRQRNTYFS SSDAAFHDRY
QAAAQWARVK AGKVAVDGGW RIYSSGPGLY ARSFVENILG LKRRFGRRRR KPLLPAVHAS
VELQTDHAAW RRLMKPKPDA