Gene Rleg2_4132 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4132 
Symbol 
ID6982904 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp4312042 
End bp4313874 
Gene Length1833 bp 
Protein Length610 aa 
Translation table11 
GC content57% 
IMG OID643398862 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002283620 
Protein GI209551703 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCTT TGTGGTCGAA GATCGGTTTG TTTGTATCGC TTGCGGGTGT CTTGGCGCCC 
ATGACCGGAA CGGGTCAGGA CCAGCCGTTT CAGATCGGCA GTTCCGTCAT CAGTGAGATG
AAGTACAAGC AGGGCTTTGC GCATTTCGAC TACGTCAATC CCAATGCCCC AAAAGGCGGA
GACCTGCGCC TTTCTGCAAG CGGCGCTTTC GACACCTTCA ATCCTATCCT TGCCAAAGGC
CAGATAGCGG CAGGGCTCTC GCTCGTCTAC GACACGTTGA TGAAGCCGAC CGATGACGAG
CTCCTTGTCT CCTATGGTCT GCTTGCCGAG GGGCTGTCCT ATCCGGATGA CGTCTCAAGC
GCGACCTTTC GCCTACGCAA GGAAGCGAAA TGGGCGGATG GCCAGCCGAT AACGCCCGAC
GACGTCATCT TCAGTCTGGA TAAGACGAAG GAATTAAACG CTGCCACCGC GAACTATTAC
CGGCACGTGG TGAAGGCCGA AAAGACGGGC GATCGCGACG TCACTTTCAC CTTCGACGAA
AAGAACAATC GCGAGCTCCC GAATATTCTC GGCCAGTTGG TGATCGTGCC GAAACATTGG
TGGGAGGGGC AGGGGCCGGA CGGCAAGCCG CGCGACATCT CAAAGACGAC GCTCGAGCCT
GTGATGGGAT CGGGGCCTTA TAAGATCGCA TCCTTTTCGT CCGGCGCGAC GATCCGTTAT
GAACTGCGCG ACGATTATTG GGGCAAGGAT CTCAATGTGA ATGTCGGCCA GAACAATTTC
CGCAACGTCA TTTACACCTA TTTCGGCGAT CGCGATGTCG AGTTCGAAGC CTTTCGCGCC
GGCAATAGTG ACTTCTGGCA GGAGACAACG GCGTCCCGCT GGGCGACGGG TTATGATTTT
CCCGCAGTGA AGGAAGGACG CGTCAAGAAA GAAGAGGTTG CAAATCCGCT GCGCTCCACC
GGCATTCTGC AAGCGCTCGT GCCCAATATG CGGCGTGACC TTTTCAAGGA TGAACGGGTC
CGTGAGGCGC TGAATTACGG CCTCGATTTC GAGGAGCTGA ACCGGACCGT TGCCTTCAAC
AGCTACAAGC GCATCGACAG CTACTTCTGG AACACCGAAC TCGCCTCCTC CGGCCTGCCG
CAGGGGCGTG AACTGGAAAT ACTGCAGGCC ATGAAGGACA AGGTGCCGCC TGAGGTCTTC
ACGACGCCCT ACACCAATCC GGTCGGGGGC GATCCGCAGA AAAGCCGCGA CAACCTCCGC
AAGGCGATTG CATTGCTCAA AGAATCCGGA TGGGAGATCA AGAACAATCG CATGGTCAAT
GGCAAGACCG GCCAGCCGAT GAGTTTCGAG ATCCTGTTGT CGAGCCCGGT ATTGGAGCGC
TGGGCGGTGC CCTATGCCAA CAATCTCAAG AAAATCGGCA TCGATGCGCG GGTGCGCACA
GTCGACGCCT CGCAAGCCGT CAACCGCGAA CGCAGCTTCG ATTACGACAT GATCTGGAAT
GTCTGGGCGG AGACGATGAA CCCGGGCAAC GAGCAAGCAG ATTATTGGGG GTCTGGTTCG
GTCGACCAGC AGGGCTCCCA CAATTATGCG GGCATCGCCA ATCCGGCGGT CGATGAACTC
ATCCACATGA TCATCTTCGC ACCCAATCGT GCGGAACAGG TCGCAGCGAT CAAGGCAATG
GATCGCGTAT TGCTGGCAAA CCACTACGTC ATCCCGCTGT TCTATCGCGA TAGCTATAAC
CTCGCCTATT GGAACACGAT TACGCACCCG ACGGACTTCC CGACCTATGG ACTGGGTTTC
CCGGAAGCCT GGTGGTCCGC CTCGGCAAAA TGA
 
Protein sequence
MTALWSKIGL FVSLAGVLAP MTGTGQDQPF QIGSSVISEM KYKQGFAHFD YVNPNAPKGG 
DLRLSASGAF DTFNPILAKG QIAAGLSLVY DTLMKPTDDE LLVSYGLLAE GLSYPDDVSS
ATFRLRKEAK WADGQPITPD DVIFSLDKTK ELNAATANYY RHVVKAEKTG DRDVTFTFDE
KNNRELPNIL GQLVIVPKHW WEGQGPDGKP RDISKTTLEP VMGSGPYKIA SFSSGATIRY
ELRDDYWGKD LNVNVGQNNF RNVIYTYFGD RDVEFEAFRA GNSDFWQETT ASRWATGYDF
PAVKEGRVKK EEVANPLRST GILQALVPNM RRDLFKDERV REALNYGLDF EELNRTVAFN
SYKRIDSYFW NTELASSGLP QGRELEILQA MKDKVPPEVF TTPYTNPVGG DPQKSRDNLR
KAIALLKESG WEIKNNRMVN GKTGQPMSFE ILLSSPVLER WAVPYANNLK KIGIDARVRT
VDASQAVNRE RSFDYDMIWN VWAETMNPGN EQADYWGSGS VDQQGSHNYA GIANPAVDEL
IHMIIFAPNR AEQVAAIKAM DRVLLANHYV IPLFYRDSYN LAYWNTITHP TDFPTYGLGF
PEAWWSASAK