Gene Rleg2_4497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4497 
Symbol 
ID6977591 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp131924 
End bp133513 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content62% 
IMG OID643393675 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002278493 
Protein GI209546575 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.122706 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGACT ATAGCAAATA TCTCGCAGGC CGCGTCACCG CGGGCGGTCT CAGCCGCCGC 
GAATTCATGG GACGCGCCGT GGCGGCGGGC ATCACACTTG CCGTCGCCGA CAAGCTCTTC
ACCGAAAGTG CGCAAGCCGC CGAACCGAAG CGTGGCGGTC ACTTGAAACT CGGCCTCGAA
GGCGGTGCAG CCACCGATTC CAAGGACCCG GCAAAATTCC TGTCGCAGTT CATGTTCTGC
GTCGGCCGCT GCTGGGGCGA CATGCTTGTT GAATCCGACC CGCTGACCGG TGCCGCCGTG
CCGGCGCTCG CCGAGTCCTG GGAAGCCTCG AAAGACGCGG TCACCTGGAC CTTCAAGATC
CGCAAGGGCG TCAAGTTCCA CGACGGCAAG GAAATGACGA TCGACGACGT TGTCGCGACG
CTAAAGCGCC ACACCGATAA AAAGTCGGAG TCCGGCGCGC TCGGCGTTCT CGGCTCGATC
AAGGAGATCA AGGCCGACGG CGGCAATCTC GTCCTGGCGC TCAGCGAAGG CAATGCCGAT
ATGCCGCTGC TGCTGTCGGA CTACCATCTG GTCATCCAGC CGAATGGCGG CGTCGACGAC
CCGCTCGCCT CGATCGGCAC CGGTCCCTAT AAGCTGACGA GCTTCGAGGC CGGCGTCCGC
GCCACCTTCG AAAAGAACAA GGAGGACTGG CGCAGCGACC GGGGTTATGT CGATTCGATC
GAGGTGATCG GCATGAACGA CGCCACCGCC CGTATCGCAG CACTTTCGTC CGGCCAGGTG
CATTACATCA ACCGCGTCGA CCCGAAGACT GTCAACCTGT TGAAACGCGC GCCTAATGTC
GAGATCCTCT CGACCGCCGG CCGCGGCCAT TACGTCTTCA TCATGCACTG CGACAAGGCG
CCTTTCGACA ATAACGACCT GCGCCTGGCG CTCAAATACG CCATGGACCG CGAGACCATG
GTGCAGAAGA TCCTCGGCGG TTACGGCAAG GTCGGCAACG ACTTCCCGAT CAACAGCACC
TACGCGCTGT TTCCCGAGGG CATCGAGCAG CGTGTTTACG ATCCTGACAA GGCCGCCTTC
CACTACAAGA AATCGGGCCA TAGCGGCTCG GTGCTGCTGC GCACCTCCGA AGTCGCCTTC
CCCGGCGGCG TCGATGCGGC CGTGCTCTAT CAGGAAAGCT GCAAGAAGGC CGGGATCGAG
ATCGAGGTCA AGCGCGAACC GGGCGACGGC TACTGGACCA ACGTCTGGAA CGTCCAGCCC
TTCTCGACCT CCTATTGGGG CGGCCGGCCG ACGCAGGACC AGATGTATTC CACAGCCTAT
CTCTCGACGG CGGACTGGAA CGACACCCGG TTCAAGCGTC CCGATTTCGA CAAGCTGCTG
CTACAGGCCC GCTCGGAACT TGATGAAGCC AAGCGCAAGG ACATGTACCG CACCATGGCG
ATGATGGTGC GCGACGAAGG CGGCGTGATC CTGCCGATGT TCAACGACTT CGTGAACGCC
TCCACCAAGC AGGTGAAGGG TTACGTCCAC GACATCGGCA ACGACATGTC GAACGGCTAT
GTCGCCACCC GCGTCTGGTT GGACGCCTGA
 
Protein sequence
MNDYSKYLAG RVTAGGLSRR EFMGRAVAAG ITLAVADKLF TESAQAAEPK RGGHLKLGLE 
GGAATDSKDP AKFLSQFMFC VGRCWGDMLV ESDPLTGAAV PALAESWEAS KDAVTWTFKI
RKGVKFHDGK EMTIDDVVAT LKRHTDKKSE SGALGVLGSI KEIKADGGNL VLALSEGNAD
MPLLLSDYHL VIQPNGGVDD PLASIGTGPY KLTSFEAGVR ATFEKNKEDW RSDRGYVDSI
EVIGMNDATA RIAALSSGQV HYINRVDPKT VNLLKRAPNV EILSTAGRGH YVFIMHCDKA
PFDNNDLRLA LKYAMDRETM VQKILGGYGK VGNDFPINST YALFPEGIEQ RVYDPDKAAF
HYKKSGHSGS VLLRTSEVAF PGGVDAAVLY QESCKKAGIE IEVKREPGDG YWTNVWNVQP
FSTSYWGGRP TQDQMYSTAY LSTADWNDTR FKRPDFDKLL LQARSELDEA KRKDMYRTMA
MMVRDEGGVI LPMFNDFVNA STKQVKGYVH DIGNDMSNGY VATRVWLDA