Gene Rleg2_5447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5447 
Symbol 
ID6978541 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp1090073 
End bp1092010 
Gene Length1938 bp 
Protein Length645 aa 
Translation table11 
GC content61% 
IMG OID643394548 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002279366 
Protein GI209547448 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.00390622 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCTAATG AAATTCTGTC GCGCGGTGTT ACCCGCCGCG CCGTGCTTGG CGGCATGGCC 
GGTGCAGCGG CACTGTCCAT CGCGGGCCGC GCATTCGCCG CAGGCGGCGA AGCGCCCGCA
CTTGCGCAGC TTGTGAAAGA TGGAAAACTG CCGCCGCTCG CCGAGCGCCT GCCGAAGAAG
CCGATGGTCG TCAAGCCGTT CGAAAAAGTC GGCACCTATG GCGGCTCTCT GCGCCGCGGC
CTTCGCGGTT CATCCGACCA TAACGGCATT CTGCGCATGG TCGGTAACCA GAGCCTCGTG
CGCTGGAACC TCGAATTTAC CGCTGTGGAA CCGAACCTTG CCGAGCGCTG GGAAGTCAGC
GACGACGCGA CGCAATTCAC CTTCCATCTG ATCGAAGGCG TGCGCTGGTC GGACGGTCAT
CCCTTTACAT CAGATGACGT CGTTTTCGCG ATCGAAGACT GCGTCAAAAA CACCGAGCTC
TACAGCGCGA CACCGGCGCA GCTTGCTGTC GCCGGCAAGC CGGTTGTCGT CGAGAAGATC
GACGATCATA CGGTGAAGTT CACCTTTGCG GCGGCCAACG CGCTTTACCT GGAAAACCTC
GCCACGCCGC TCGGCCAGCA TCCGACGCTC TTTCCGAAGC ACTACTGCAG CCAGTTCCTG
CCGAAATATA ATCCCAACAT CGAAGCCGAT GCGAAGAAAG CCGGCGTCAC CAGCTGGACG
GAGCTGTTCC GCAGCCGCTG CGGCGACATC GAGATCCCGA CACGATGGGG CAATGTCGAC
AAGCCGACGC TCGATCCATG GGTGGTCAAG GAACCCTATG TCGGTGGCGC CACGCGTGTC
GTCATGACCC GCAATCCTTA TTTCTGGCAG GTCGATACCG AGGGCAATCA GCTTCCCTAT
ATCGATGAGA TCAACTTCGG CATCTCGCAG GACGTCGAAT CACTGATGCT GAACGTCATC
TCGGGCAAGA TCGACATCCA GGAGCGCCAT ATCAGCGTGC TTGCCAACAA GCCGACGCTG
TCCCAGAACA TGCAGAAGGG CGACTATCGT CTGCTGACGC TGGTGCCTTC GGCCTCGCAG
CAGTGCCAGA TCTACTTCAA CATCACCCAC AAAGATCCTG CCATGCGCAA GATGTTTGCG
GATAAGTCGT TCCGGCAGGC GTTGTCGATC GGCATTAATC GTCCGGAGCT CATCGATATC
GTCTATTTCG GCCAGAGCGA GCCTTACCAG GCCGGACCGC GTCCGACGCA TCCCTGGTAT
AACGAGAAAT ACGCGCGCCA ATTCACCGAA TTTGACGCCG ACAAGGCAGG CGCGATGCTC
GACCAGGCCG GCTACAAGAA AGGCGGCGAC GGTTTCCGCC TCCGGCCGGA CGGCCAGAAG
GTGTTTTTCT CGATCGACGT CATTCCGACG CTCTATCCCG ACCTCGTCGA CGCGCTCGAG
CTGGTCAAGA CGCATTGGGC CCAGATCGGC ATCGACATGA AGGTCAACAC CATCGAACGG
GCGCTCTATT ACACCCGCGG CGACGACAAT GCCCATGATG CGCAGGTATG GCCGGGCCCT
GGCGGTCTCG ATCCGATGCT CGATCCGCGC GATTTCTTCG CCTTCCATCC GCAGGGCTCG
CGTTACGCCA TTCCGTGGAC GCTGTGGTAC ACCTCCAACG GTGCACGCGG CGAAGAACCC
CCGGAAAGCC AGAAGAAGCG CATGAAGCTC TTCGACGAGG CGCGCTCGAC GGCCGATCTC
GACAAGCGCG GTGCGGTCAT GAAGCAGATC TTCGATATCG CCGCGGAGGA ATTCGAGACC
GTCGGGCTCT GCCTTGCCGT CGGCGGCTTC GGCATCATCC GCAACAATCT GCGCAATGTT
CCCGAAAAAG AGCCGGACAG CTGGTCCTGG CCCAATCCGG GGCCTGCGCT GCCGCAGCAG
TTCACCTTCA CGAGCTGA
 
Protein sequence
MPNEILSRGV TRRAVLGGMA GAAALSIAGR AFAAGGEAPA LAQLVKDGKL PPLAERLPKK 
PMVVKPFEKV GTYGGSLRRG LRGSSDHNGI LRMVGNQSLV RWNLEFTAVE PNLAERWEVS
DDATQFTFHL IEGVRWSDGH PFTSDDVVFA IEDCVKNTEL YSATPAQLAV AGKPVVVEKI
DDHTVKFTFA AANALYLENL ATPLGQHPTL FPKHYCSQFL PKYNPNIEAD AKKAGVTSWT
ELFRSRCGDI EIPTRWGNVD KPTLDPWVVK EPYVGGATRV VMTRNPYFWQ VDTEGNQLPY
IDEINFGISQ DVESLMLNVI SGKIDIQERH ISVLANKPTL SQNMQKGDYR LLTLVPSASQ
QCQIYFNITH KDPAMRKMFA DKSFRQALSI GINRPELIDI VYFGQSEPYQ AGPRPTHPWY
NEKYARQFTE FDADKAGAML DQAGYKKGGD GFRLRPDGQK VFFSIDVIPT LYPDLVDALE
LVKTHWAQIG IDMKVNTIER ALYYTRGDDN AHDAQVWPGP GGLDPMLDPR DFFAFHPQGS
RYAIPWTLWY TSNGARGEEP PESQKKRMKL FDEARSTADL DKRGAVMKQI FDIAAEEFET
VGLCLAVGGF GIIRNNLRNV PEKEPDSWSW PNPGPALPQQ FTFTS