Gene Rleg_5178 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5178 
Symbol 
ID8007074 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp584225 
End bp586162 
Gene Length1938 bp 
Protein Length645 aa 
Translation table11 
GC content60% 
IMG OID644822088 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002973348 
Protein GI241113513 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.758105 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.361664 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAAG AAATACTGTC GCGTGGCGTC ACCCGCCGCG CCGTGCTTGG TGGCATGGCT 
GGTGTCGCGG CGCTGTCTAT CGCCGGTCGC GTGTCTGCCG CAGGCGGTGA AGCGCCGGCA
CTTGCCCAAC TCGCCAAGGA TGGAAAGCTG CCGCCGCTCG CCGAGCGCCT GCCGAAAAAG
CCGATGGTCG TGACGCCGTT CGAGAAAGTC GGCACCTATG GCGGTTCGCT GCGCCGTGGC
CTTCGCGGTT CATCCGACCA TAACGGCATC CTGCGTATGG TCGGCAACCA GAGCCTTGTG
CGCTGGAATC TCGATTTTAC CGCCGTGCAG CCGAACCTTG CCGAGCGCTG GGAAGTCAGC
GACGACGCAA CACAATTCAC GTTCCATCTC ATCGAAGGCG TGCGCTGGTC GGACGGTCAT
CCCTTTACGG CCGATGACGT CGTTTTCGCG ATCGAAGACT GCGTCAAGAA CACCGAGCTC
TACAGCTCGA CACCGGCGCA GCTTGCCGTC GCCGGCAAGC CAGTCACCGT CGAGAAGATC
GACGATTACA CGGTGAAGTT CACCTTTGCC GCGGCCAATG CGCTTTATCT GGAAAACCTC
GCCACGCCGC TTGGTCAGCA TCCGACGCTG TTCCCGAAGC ATTACTGCAG CCAGTTCCTG
CCGAAATATA ATCCCAATAT CGAGGCCGAT GCGAAGAAGG CCGGCGTCAC CAGCTGGACG
GAGCTGTTCC GCAGCCGTTG CGGCGACATC GAGATCCCGT CGCGATGGGG CAATGTCGAC
AAGCCGACGC TCGATCCATG GGTGGTCAAG GAGCCCTATG CCGGCGGTGC GACGCGTGTC
GTCATGACCC GCAATCCCTA TTTCTGGCAG GTCGATACCG AGGGCAACCA GCTTCCCTAC
ATCGATGAAA TCAACTTCGG CATCTCACAG GACGTCGAAT CGCTGATGCT GAACGTCATC
TCTGGAAAGA TCGACATCCA GGAACGCCAC ATCAGCGTTC TCGCCAACAA GCCGACGCTG
TCCAAGAACA TGGAAAAGGG CGATTATCGG CTGCTGACGC TCGTGCCTTC GGCCTCGCAA
CAGTGCCAGA TCTATTTCAA CATCACCCAC AAAGATCCTG CCATGCGCAA GATGTTTGCC
GACAAGGCGT TCCGGCAAGC GCTTTCGATC GGCATCAATC GCCAGGAGCT CATCGACATC
GTCTATTTCG GACAGAGCGA GCCCTACCAG GCAGGGCCGC GTCCGACCCA TCCGTGGTAT
AACGAAAAAT ACGCGCGCCA ATTCACCGAA TTCGACGCCG ACAAGGCAGG CGCGATGCTC
GATGAGGCCG GCTATAAGAA AGGCGGCGAC GGTTTCCGCC TCCGGCCCGA CGGCCAGAAG
GTGTTCTTCT CGATCGACGT CATTCCGACG CTTTATCCCG ACCTCGTCGA TGCCCTGGAA
CTGGTCAAGG CGCATTGGGC TCAGATCGGT GTCGACATGA AGGTCAACAC GATCGAGCGG
GCGCTCTACT ACACCCGCGG CGACGACAAC GCCCATGACG CGGCGGTGTG GCCGGGTCCT
GGCGGTCTCG ATCCAATGCT CGATCCGCGC GATTTCTTCG CCTTCCATCC GCAGGGTTCG
CGTTACGCCA TTCCGTGGAC GCTTTGGTAC ACCTCCAACG GCGCACGCGG CGAAGAACCG
CCAGAAAGCC AGAAGAAGCG CATGAAGCTC TTCGACGAAG CGCGTTCGAC GGCCGATCTC
GACAAGCGCG GCGCAATCAT GAAGCAGATC TTCGACATCG CGGCCGAGGA GTTCGAGACC
GTCGGCCTTT GCCTTGCCGT CGGCGGTTTC GGCATCATCC GCAACAATCT GCGCAATGTT
CCCGAGAAGG AGCCGGATAG CTGGTCCTGG CCCAATCCCG GTCCGGCAAT GCCGCAGCAA
TTCACCTTCA CGAGCTGA
 
Protein sequence
MAKEILSRGV TRRAVLGGMA GVAALSIAGR VSAAGGEAPA LAQLAKDGKL PPLAERLPKK 
PMVVTPFEKV GTYGGSLRRG LRGSSDHNGI LRMVGNQSLV RWNLDFTAVQ PNLAERWEVS
DDATQFTFHL IEGVRWSDGH PFTADDVVFA IEDCVKNTEL YSSTPAQLAV AGKPVTVEKI
DDYTVKFTFA AANALYLENL ATPLGQHPTL FPKHYCSQFL PKYNPNIEAD AKKAGVTSWT
ELFRSRCGDI EIPSRWGNVD KPTLDPWVVK EPYAGGATRV VMTRNPYFWQ VDTEGNQLPY
IDEINFGISQ DVESLMLNVI SGKIDIQERH ISVLANKPTL SKNMEKGDYR LLTLVPSASQ
QCQIYFNITH KDPAMRKMFA DKAFRQALSI GINRQELIDI VYFGQSEPYQ AGPRPTHPWY
NEKYARQFTE FDADKAGAML DEAGYKKGGD GFRLRPDGQK VFFSIDVIPT LYPDLVDALE
LVKAHWAQIG VDMKVNTIER ALYYTRGDDN AHDAAVWPGP GGLDPMLDPR DFFAFHPQGS
RYAIPWTLWY TSNGARGEEP PESQKKRMKL FDEARSTADL DKRGAIMKQI FDIAAEEFET
VGLCLAVGGF GIIRNNLRNV PEKEPDSWSW PNPGPAMPQQ FTFTS