Gene Rleg_3951 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3951 
Symbol 
ID8014766 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4025366 
End bp4027267 
Gene Length1902 bp 
Protein Length633 aa 
Translation table11 
GC content61% 
IMG OID644826520 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002977731 
Protein GI241206635 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTCC GTAAATATGC AGCCTTTGCC TTATTGTCGC TCATGGCCCT TCCGGCATTT 
GCGCAGGATT TCATTGCCGG CATACCGCGC AACGAAACGC TGATCATCCA GGGCACGCCG
CAGCAGAATG CCGACTGGTT CAACGTCTGG GCTCCGGGCG GCGGGGCGGC GGCAAACCTC
AACGGCCTGC AGCAGCTGAC CACCGATACG CTGTGGTTCA TCAATCCCGA GGGCGGCAAG
GATGCCTGGC AGAATGCGCT TGCCAGCGAA CCGCCGAGCT ACAACGCCGA TTTCACCGAG
ATGAAGGTAA AGCTGCGCAA GGGGATCTTC TGGAGCGACG GTGTCGAATT TACCAGCGAC
GACGTGGTCT ATACCGTGCA GACGCAGATC GACCATCCCG GCATGGCCTG GAGCGCGCCC
TTCACCGTCA ATGTCGCGAG TATCGAAAAT CCTGATCCGC AGACCGTCGT CTTTCATCTG
AAGAAGCCGA ACTCGCGCTT CCACACGCTG TTCACGGTGC GTTGGAATGC CGCCTGGATC
ATGCCGAAGC ACGTCTTCGA GAAGGCCGCC GATCCGCTGT CCTTCAATAA TAATCCGCCG
GTTTCGCTCG GGCCCTACCA GCTGCAGAGC TATGACAAGG GCGGCAACTG GACGATCTGG
AAGCTGCGTG ATGACTGGCA GCGCACGTCG ATCGGCCTGG CTGCGGCACA ACCGCCTGAG
GTCAAATACG TCGTCTATCG GGCCGCCGGC AATCCGGAAG CGCGCGTCAT CGAACAGCGC
AACCACAATC TCGACGTCAT CAACGACATG GCGCCCGAGG GCATGTTCTC GATCATGCGC
GACAGCAAGA GCACGGCCTC CTGGCTGAAG GGCTTTCCTT TCGCACATCC GGACCCGACG
CTGCCTTCCG TTCTCTTCAA TACGAAGAAG GCGCCCTTCG ACAATAAAGA CGTGCGCTGG
GCTCTCGCTC TACTGATCGA TATCCGCGAA GTGGCGCTCG GCTCCTACCG CGGCGCGGCC
AATATCGCCG CCCTTGCCAC GCCGCCGACG GGTTCTGCCC CGGACGACTA TTACGCACCG
ATGCAGGACT GGCTGACGAA TTTCGAGCTC GACACCGGAT CCCGCAAGAT CAAGCCCTAC
GATCCCAACA TCTCGGCGCA GATCGCCAAT ATGGTGCGCA GCCAATGGGC CGATCAGATC
CCGACCGATC CGGCCAAGCT GCAGCGTACA TTCGGCTTCG GCTGGTGGAA GAAGGACGTT
CAGGCTGCAA CCGAGCTGCT GCAGAAGGCT GGTTTCAAGA AAAGCGGCCG CCAGTGGGTG
AAGCCTGACG GCACGCCCTT TACGATCCGC CTGCAGGTGG AAGGCGATGC CATCCCGACG
CTTGCCCGCG CCGGCACGGT GATCGCCCAG CAATGGTCGC AGGCCGGCAT CGCGACCAAG
GTCGATGTCG CCGGCCCGAC CAATGGCCAG CGCCTCAGCA CCGGCGATTT CGAGACGGCG
ATCTACTGGA GCATCGAGAC CTGGGGCGGT CATCCCGACC TCTCCTTCTT CCTCGACAGC
TATCATTCGG AGTTCATCAA GCCGGTCGGG CAGATCCAGC CGCCGCGCAA TCTGCAGCGC
TGGCAGGATC CGCGTCTCGA CCAGCTGATC GAGCGCAATC GGTCGATCGC CTTCGATTCG
CCCGATGTCG CCAAGCTCGG CCAGGACTTC CTGAAGCTTG CCGTCGAGGA AATGCCGATG
ATCCCGCTGA TGGCCTACAA CAAGTTCGCA CCGCTCGATA CGACCTACTG GACCAACTAT
CCGAGCGCTG ACAATCCCTA TTCGGCCTCG GGTCCGAACT GGTCGAACAT TCGCTACATG
GTGGTCGGGC TGAAGGCCAA TCCGGATGCG CCGAAGCCTT GA
 
Protein sequence
MKFRKYAAFA LLSLMALPAF AQDFIAGIPR NETLIIQGTP QQNADWFNVW APGGGAAANL 
NGLQQLTTDT LWFINPEGGK DAWQNALASE PPSYNADFTE MKVKLRKGIF WSDGVEFTSD
DVVYTVQTQI DHPGMAWSAP FTVNVASIEN PDPQTVVFHL KKPNSRFHTL FTVRWNAAWI
MPKHVFEKAA DPLSFNNNPP VSLGPYQLQS YDKGGNWTIW KLRDDWQRTS IGLAAAQPPE
VKYVVYRAAG NPEARVIEQR NHNLDVINDM APEGMFSIMR DSKSTASWLK GFPFAHPDPT
LPSVLFNTKK APFDNKDVRW ALALLIDIRE VALGSYRGAA NIAALATPPT GSAPDDYYAP
MQDWLTNFEL DTGSRKIKPY DPNISAQIAN MVRSQWADQI PTDPAKLQRT FGFGWWKKDV
QAATELLQKA GFKKSGRQWV KPDGTPFTIR LQVEGDAIPT LARAGTVIAQ QWSQAGIATK
VDVAGPTNGQ RLSTGDFETA IYWSIETWGG HPDLSFFLDS YHSEFIKPVG QIQPPRNLQR
WQDPRLDQLI ERNRSIAFDS PDVAKLGQDF LKLAVEEMPM IPLMAYNKFA PLDTTYWTNY
PSADNPYSAS GPNWSNIRYM VVGLKANPDA PKP