Gene Rleg2_1733 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1733 
Symbol 
ID6980470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1772990 
End bp1774792 
Gene Length1803 bp 
Protein Length600 aa 
Translation table11 
GC content60% 
IMG OID643396456 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002281246 
Protein GI209549329 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.197004 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCCGGA TCGTCGTCCC CCTCGTCCTC TCGCTGCTGT GCGGCGCTGT TGCCGCCGAG 
CCGCTGCATG GCATCGCGAT GCATGGCGAG CCTGGCTTGC CGGCCGATTA CAAACACTTC
CCTTACGTCA ATCCCGACGT GAAGAAGGGC GGCAAGATCA CCTATGGAGT CGTCGGCACC
TTCGACAGCC TCAACCCGTT CATTCTGAAA AGCATGCGCA CGACGGCGCG CGGCATGTGG
GATCCGGAAT ATGGCAATCT CGTCTACGAA TCGCTGATGC AGCGCTCCAG GGACGAGCCC
TTCACGCTCT ACGGCCTTCT TGCCGAGACG GTGGAATGGG ACGACGCCCG GAGCTTCATC
CAGTTCAACC TCAATCCGAA GGCGAAATGG GCAGATGGCC AGCCGGTGAC GCCCGAGGAT
GTGATGTTCA CCTTCGAGCT GATGCGCGAC AAGGGGCGCG TGCCCTTCGC CAACCGCCTC
AACGTCGTCG CCAAGATGGA AAAGGTCGGC GAAAACAGCG TGCGCTTCAC CTTCAACGAC
AAGGCCGACC GCGAGACTCC TTTGATTTTC GGTCTTTTCC CGGTCCTGCC GAAACACGCG
ATCGATCCGG AAACCTTCGA CCGCTCGTCG CTGACACCGC CTGTGGGATC CGGTCCCTAC
AAGGTGAAGA CGGTGAAGCC CGGCGAGAGC ATCACCTATG AGCGCGATCC CAATTACTGG
GGCAAGGACA TTCCCTCCAA GGTCGGCACC GACAATTACG ATCAGATCAC CGTCCAGTAT
TTCCTGCAGG ACACGACGCT GTTCGAGGCC TTCAAGAAGG GCGATGTCGA CGTCTATCCC
GACGGCAATC CCGGCCACTG GGCCAATGCC TATAATTTCC CCGCGGTCAC CTCAGGCGCC
GTCGTCAAGG ACGTATTCAC ACCAAAACTG CCGAGCGGCA TGCTCGGCTT CGTGTTCAAC
ACGCGCCGGC CGATCTTTGC CGACACCAAG GTGCGCGAAG GCCTGTCGTT GGTGTTCGAT
TTCGAATGGG CAAACAAGAA CCTTTATTCC GGCGCCTATA AGCGCACCCA GAGCTTCTGG
CAGAATTCGG AGTTGTCCAG TTTCGGCGTT CCCGCCAATG CGGCCGAACT TGCGTTGCTC
GGACCGATCA AGGACAAAAT CGCACCCGCG ATTCTCGACG GCACCTACAA GCTTCCGGTC
ACTGACGGCT CCGGCCGCGA CCGCGATGTG CTGAAGCAGG CCGTTGGACT GTTGAAACAG
GGCGGCTATA CGATCCAGGG CGGCAAGATG CTGGATGCCT CCGGCCGCCA GCTCGCCTTC
GAGATCATGA CGCAGAACGC CGATCAGGAG AAACTCGCCA TTGCCTATCA GCGTTCGCTG
CAGACAATCG GCATCGCCGC TTCGATCCGC ACCGTCGACG ATTCGCAGTA TCAGAGCCGG
ACGAATAGCT TCGACTACGA CATGATCATG AAGTCTTACA CCTCGTCGCT GTCGCCCGGA
AACGAACAGC TCGGCCGCTG GTCGTCGGCT GCGCGCACGC GCGAGGGTAC GGACAGTTTT
GCCGGCGCCA ATGATCCCGA TATCGACACG CTGATCGATC ATCTGCTGAG GGCACGCTCG
GCTGAGGATT TCACCGCGGC GGTGCGCTCC TACGATCGGC TGCTGCTTTC CGGCCATTAC
GTGCTGCCGC TCTATCATAT CGACCAGCAA TGGGTGGCTC ACAGCAAACG CATCGGCGGT
CCCGACAGCG TACCGCTCAA TGGCTATCAA CTACCGGTCT GGTGGGATAC GAGCGTGCAG
TAG
 
Protein sequence
MLRIVVPLVL SLLCGAVAAE PLHGIAMHGE PGLPADYKHF PYVNPDVKKG GKITYGVVGT 
FDSLNPFILK SMRTTARGMW DPEYGNLVYE SLMQRSRDEP FTLYGLLAET VEWDDARSFI
QFNLNPKAKW ADGQPVTPED VMFTFELMRD KGRVPFANRL NVVAKMEKVG ENSVRFTFND
KADRETPLIF GLFPVLPKHA IDPETFDRSS LTPPVGSGPY KVKTVKPGES ITYERDPNYW
GKDIPSKVGT DNYDQITVQY FLQDTTLFEA FKKGDVDVYP DGNPGHWANA YNFPAVTSGA
VVKDVFTPKL PSGMLGFVFN TRRPIFADTK VREGLSLVFD FEWANKNLYS GAYKRTQSFW
QNSELSSFGV PANAAELALL GPIKDKIAPA ILDGTYKLPV TDGSGRDRDV LKQAVGLLKQ
GGYTIQGGKM LDASGRQLAF EIMTQNADQE KLAIAYQRSL QTIGIAASIR TVDDSQYQSR
TNSFDYDMIM KSYTSSLSPG NEQLGRWSSA ARTREGTDSF AGANDPDIDT LIDHLLRARS
AEDFTAAVRS YDRLLLSGHY VLPLYHIDQQ WVAHSKRIGG PDSVPLNGYQ LPVWWDTSVQ