Gene Rleg_1917 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1917 
Symbol 
ID8012962 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1907207 
End bp1909012 
Gene Length1806 bp 
Protein Length601 aa 
Translation table11 
GC content61% 
IMG OID644824506 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002975738 
Protein GI241204642 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.530573 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCCGGA TTGTCGTCCC TCTCGTCCTC TCGCTGCTGT GCGGCACTGT CGCTGCCGAG 
CCGCTGCACG GCATCGCGAT GCACGGCGAG CCGGCCTTGC CGGCCGACTA CAAACACTTC
CCTTACGTCA ACCCCGACGT GAAAAAGGGC GGCAAGATCA CTTACGGCGT CGTCGGCACC
TTCGACAGTC TCAACCCGTT CATCCTGAAG AGCATGCGCA CGACGGCGCG CGGCATGTGG
GATCCGGAAT ATGGCAATCT CGTCTACGAA TCGCTGATGC AGCGCTCCAG GGACGAACCC
TTCACGCTTT ACGGCCTGCT TGCCGAGACG GTCGAATGGG ACGACAACAG GAGCTTCATC
CAGTTCAATC TCAATCCGAA GGCGAAATGG GACGACGGCC AGCCGGTGAC GCCCGAAGAC
GTGATGTTCA CCTTCGAGCT GATGCGCGAC AAAGGCCTCC CGCGCTACGC AACCCCCCTG
AAGAAATTCG TCGCCAAGGT GGAAAGGGTC GGCGAGCGTA GCGTGCGCTT GACCTTCACC
GACAAAGCTA ACCGTGAGAC GCCTTTGATC TTCGGCCTTT TCCCGGTGCT GCCCAAACAT
GCGGTCGATC CCGAAACCTT CGACCGCACA GCGCTGACGC CGCCGGTCGG CTCTGGCCCC
TATAAGGTAA AGACGGTGAA GCCCGGCGAG AGCATCACCT ATGAGCGCGA CCCGAATTAC
TGGGGCAAGG ACATTCCCGC CAAGGTCGGC ACGGACAATT ACGACCAGAT CACCGTCCAG
TATTTCCTGC AGGACACGAC GCTGTTCGAG GCCTTCAAGA AAGGCGACGT CGACACCTAC
CCTGACGGCA ATCCCGGCCA TTGGGCCAAT GCCTATGATT TCCCCGCCGT CACCTCGGGG
GCTGCAATCA AGGATGCGTT CACGCCTAAA CTGCCGAGCG GCATGCTCGG CTTCGTGTTC
AACACGCGCC GGCCGATCTT TGCCGATCTC AAGGTGCGCG AGGGCCTGTC GCTGGTGTTC
GATTTTGAAT GGGCGAACAA GAACCTCTAT TCCGGAGCCT ACAAGCGTAC GCAGAGCTTC
TGGCAGAATT CCGAGCTGTC GAGCTTTGGC GTTCCGGCCG ATGCGAGGGA ACTTGCCCAG
CTCGGGCCGA TCAAGGACAA GATCGCGCCA GAGATTCTCG ACGGCACCTA CAAGCTGCCC
GTCACCGACG GCTCCGGCCG CGACCGCAAT GTACTGAAAC AGGCCGTCGC GCTGTTGAAG
CAGGGCGGCT ATACGATCCA AGGCGGCAAG ATGCTTGACG CCTCCGGCCG CCAGCTCGCC
TTCGAGATCA TGACGCAGAA TGCCGACCAG GAGAAACTCG CCGTCGCCTA CCAGCGTTCG
CTGCAGACGA TCGGCATCGC CGCCGCGATC CGTACGGTCG ACGATTCGCA GTATCAGAGC
CGGACGAACA GCTTCGATTA CGACATGATC CTCAAGTCCT ACACGTCTTC GCTCTCGCCT
GGAACCGAAC AGCTCGGCCG CTGGTCTTCG GCTGTGCGCA CGCAGGAGGG GACGGACAGC
TTTGCAGGCG CCAACGATCC CGATCTCGAC ACGCTGATCG ACCATCTGCT CGGGGCGCGC
TCCGCCGAGG ATTTCACCGC GGCGGTGCGC TCCTACGACC GGCTGCTGCT TTCCGGCCAT
TACGTGCTGC CGCTCTATCA TATGGACCAA CAATGGGTGG CGCGCAGCAA GCGCATCGGC
CATCCCGACA CGGTGCCGCT CTACGGCTAC CAGCTGCCGG TATGGTGGGA CGTGAGCGCG
CAATAG
 
Protein sequence
MLRIVVPLVL SLLCGTVAAE PLHGIAMHGE PALPADYKHF PYVNPDVKKG GKITYGVVGT 
FDSLNPFILK SMRTTARGMW DPEYGNLVYE SLMQRSRDEP FTLYGLLAET VEWDDNRSFI
QFNLNPKAKW DDGQPVTPED VMFTFELMRD KGLPRYATPL KKFVAKVERV GERSVRLTFT
DKANRETPLI FGLFPVLPKH AVDPETFDRT ALTPPVGSGP YKVKTVKPGE SITYERDPNY
WGKDIPAKVG TDNYDQITVQ YFLQDTTLFE AFKKGDVDTY PDGNPGHWAN AYDFPAVTSG
AAIKDAFTPK LPSGMLGFVF NTRRPIFADL KVREGLSLVF DFEWANKNLY SGAYKRTQSF
WQNSELSSFG VPADARELAQ LGPIKDKIAP EILDGTYKLP VTDGSGRDRN VLKQAVALLK
QGGYTIQGGK MLDASGRQLA FEIMTQNADQ EKLAVAYQRS LQTIGIAAAI RTVDDSQYQS
RTNSFDYDMI LKSYTSSLSP GTEQLGRWSS AVRTQEGTDS FAGANDPDLD TLIDHLLGAR
SAEDFTAAVR SYDRLLLSGH YVLPLYHMDQ QWVARSKRIG HPDTVPLYGY QLPVWWDVSA
Q