Gene Rleg2_2834 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_2834 
Symbol 
ID6981578 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp2884190 
End bp2885770 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content57% 
IMG OID643397546 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002282330 
Protein GI209550413 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.360597 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCAGT TCACGAAAAA ATTTCTCGCC TCCGCAATGT TTGGCACATT GCTGGCGTTT 
TCGGCGCATG CGGCCACGCT GAATATTCAC AATGGCGGCG ACCCGCAATC GCTCGATCCG
CAGAAGCTGT CCGGCGATTG GGAGAATCGT ATCGCCGGCG ACATTTTCGA AGGCTTGGTC
ACGGAAGACG CCAAGGACAA TCCGGTCCCC GGCCAGGCCG AAAGCTGGAC GATTTCACCT
GATGGCAAGG TTTACACCTT CAAGCTTCGC GACGGCATCA AATGGTCCGA TGGCCAGCCG
GTAACGGCAG GAGACTTCGT CTTCGCCTTC CAGCGCCTCG TCGACCCGAA GAACGCCGCC
GACTATGCCT ATCTGCAGTT CACCATCAAA AATGCCGAAA AGATCAACAA GGGTGAGATT
ACCGATCTCA ATCAGCTCGG CGTCAAGGCA ATCGACGACA AGACGCTTGA AATCACCCTC
GAAAACCCGA CCCCCTATTT CCTCAACGCT CTGATGCACT ACACCGCCTA TCCGCTGCCC
AAGCACGTGG TCGAGGCGAA GGGCCAGGAT TGGGTCAAGA TCGGCAATAT CGTCACCAAC
GGACCTTACA AGCCGGTCGA GTGGGTTCCG GGCTCGCATG TCACCACGGT CAAAAACGAC
CAGTGGTACG GCACCAAGGA CCTGAAGATC GACGGTGCCA AGTTCTTCGT GCTCGAGGAT
CAGGAAGCGG CACTGAAACG TTACCGCGCC GGCGAATTCG ATATCCTCAC CGATTTCCCC
ACCGACCAGT ATGAGTGGAT GAAGAAGAAC CTGCCGGGCC AGGCACATGT CGCTCCCTTC
TCCGGCCTCT ATTACTACGT CATCAATTCG ACCAAGCCGC CCTTCGCCGA CAAGCGCGTG
CGCCAGGCTC TCTCCATGGC GATCAACCGC GAAGTCATCG GCCCGCAGAT TCTCGGCACC
GGCGAACTGC CGGCCTATTC CTGGGTCCCG CCAGGCACGG CAAACTACGG CGAACCGGCC
TACGTCTCCT GGAAGGATCT TCCCTACAAG GACAAGGTCG AAGAAGCCAA GAAGCTGCTG
AAGGAAGCCG GTTTCGGTCC GGATCATCCG CTGACAGCCG AGCTCAAATA CAACACCAAC
GACAATCACA AGCGCATCGC CGTGGCGATC GCCTCGATGT GGAAGCCGCT TGGCGTCAAT
GTCGAACTCG TCAATGCCGA GACCAAGGTC CATTATGACC AGTTGCAGCG CGGCGAAGTG
CAGATCGGCC GCGCCGGCTG GCTGGCCGAC TATAACGACC CAGACAACTT CCTGAACCTC
CTGGTCACAG GCGTCCAGAT GAATTACGGC CGCTGGTCCA ATCCTGACTA CGACAAGATG
ATCAAGGACG GCAACGCCGA GACCGATCTC GCCAAGCGCG CCGCAATCTT CAAGAAAGCC
GAACAGCTGG CACTGGATGA TTCCGCCGCC CTGCCGATCT ACTATTATGT TTCGAAGAAC
GTCGTCTCAC CGAAGATCGA AGGCTTCGTC GATAACATCC AGGACATCCA CCGCACCCGC
TGGCTGTCGA TGAAAGAGTA A
 
Protein sequence
MNQFTKKFLA SAMFGTLLAF SAHAATLNIH NGGDPQSLDP QKLSGDWENR IAGDIFEGLV 
TEDAKDNPVP GQAESWTISP DGKVYTFKLR DGIKWSDGQP VTAGDFVFAF QRLVDPKNAA
DYAYLQFTIK NAEKINKGEI TDLNQLGVKA IDDKTLEITL ENPTPYFLNA LMHYTAYPLP
KHVVEAKGQD WVKIGNIVTN GPYKPVEWVP GSHVTTVKND QWYGTKDLKI DGAKFFVLED
QEAALKRYRA GEFDILTDFP TDQYEWMKKN LPGQAHVAPF SGLYYYVINS TKPPFADKRV
RQALSMAINR EVIGPQILGT GELPAYSWVP PGTANYGEPA YVSWKDLPYK DKVEEAKKLL
KEAGFGPDHP LTAELKYNTN DNHKRIAVAI ASMWKPLGVN VELVNAETKV HYDQLQRGEV
QIGRAGWLAD YNDPDNFLNL LVTGVQMNYG RWSNPDYDKM IKDGNAETDL AKRAAIFKKA
EQLALDDSAA LPIYYYVSKN VVSPKIEGFV DNIQDIHRTR WLSMKE