Gene Rleg_6426 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6426 
Symbol 
ID8016925 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012854 
Strand
Start bp141495 
End bp143084 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content62% 
IMG OID644828221 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002979421 
Protein GI241554208 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.0312111 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGACT ACACGAAGTA TCTCGCAAGC CGTGTCACCG CCGGCGGGCT CAGCCGCCGC 
GAATTCATGG GACGCGCCAT GGCCGCGGGC ATCACACTTG CCGTCGCCGA CAAGCTCTTC
ACCGAAAGTG CGCAAGCTGC CGAACCGAAG CGCGGCGGTC ACTTGAAGCT CGGCCTCGAA
GGCGGTGCCG CTACCGATTC CAACGACCCG GCGAAGTTCC TGTCGCAGGT CATGTTCTGC
ATCGGCCGCT GCTGGGGCGA CATGCTGGTC GAGTCTGATC CGCTGACCGG TGCCGCCGTG
CCGGCGCTCG CCGAATCCTG GGAACCGTCG AAGGACGCGG CGACCTGGAC CTTCAAGATC
CGCAAGGGCG TCAAGTTTCA CGATGGCAAG GAACTGACGA TCGATGACGT TGTTGCGACG
CTGAAGCGCC ACACCGACGC CAAGTCGGAA TCCGGCGCGC TCGGTGTTCT CGGCTCGATC
AAGGAGATCA AGGCCGACGG CGGCAATCTC GTGCTGACGC TCAGCGAAGG CAATGCCGAC
ATGCCGCTGC TGCTGTCGGA CTACCATCTG GTCATCCAGC CGAACGGCGG CGTCGACGAT
CCTCTCGCCT CGATCGGCAC CGGCCCCTAC AAGATGACAA GCTTCGAGCC CGGCGTCCGC
GCCACCTTCG AAAGGAACAA AGACGACTGG CGCACCGACC GCGGTTACGT CGATTCGATC
GAAATCATCG GCATGAACGA TGCGACCGCC CGCATCGCGG CCCTGTCGTC CGGCCAGGTG
CACTACATCA ACCGGGTTGA CCCGAAGACC GTCAACCTCT TGAAACGCGC ACCCAACGTC
GAGATTCTCT CGACCGCCGG CCGTGGCCAT TACGTCTTCA TCATGCATTG TGACAAGGCG
CCGTTCGACA ACAACGACCT GCGCCTGGCC CTCAAATATG CCATGGACCG TGAGGCCATG
GTGCAGAAGA TCCTCGGCGG TTACGGCAAG GTCGGCAACG ACTTCCCGAT CAACAGCACC
TATGCGCTGT TTCCCGAGGG CATCGAGCAG CGCGTTTACG ATCCTGACAA GGCTGCCTTC
CACTATAAGA AGTCAGGTCA TAGCGGCTCG GTCCTCCTGC GCACCTCCGA AGTCGCCTTC
CCCGGCGGTG TCGACGCAGC CGTCCTCTAT CAGGAAAGCT GCAAGAAGGC CGGCATCGAG
ATCGAGGTCA AGCGCGAACC GGGCGACGGC TACTGGACCA ACGTCTGGAA CGTCCAGCCC
TTCTCGACCT CCTATTGGGG TGGCCGCCCG ACGCAGGACC AGATGTATTC AACCGCCTAT
CTCTCGACGG CGGATTGGAA CGACACCCGT TTCAAGCGTC CTGACTTCGA TAAGCTGCTG
CTGCAGGCCC GTTCCGAACT TGATGAAGTC AAGCGCAAGG ACATGTATCG CACCATGGCG
ATGACGGTGC GCGACGAGGG CGGGGTGATC TTGCCGATGT TCAACGATTT CGTGAATGCC
TCCACCAAGC AGGTGAAGGG TTATGTCCAC GACATCGGCA ACGACATGTC GAACGGCTAC
GTTGCGACCC GCGTCTGGCT GGACGCTTGA
 
Protein sequence
MNDYTKYLAS RVTAGGLSRR EFMGRAMAAG ITLAVADKLF TESAQAAEPK RGGHLKLGLE 
GGAATDSNDP AKFLSQVMFC IGRCWGDMLV ESDPLTGAAV PALAESWEPS KDAATWTFKI
RKGVKFHDGK ELTIDDVVAT LKRHTDAKSE SGALGVLGSI KEIKADGGNL VLTLSEGNAD
MPLLLSDYHL VIQPNGGVDD PLASIGTGPY KMTSFEPGVR ATFERNKDDW RTDRGYVDSI
EIIGMNDATA RIAALSSGQV HYINRVDPKT VNLLKRAPNV EILSTAGRGH YVFIMHCDKA
PFDNNDLRLA LKYAMDREAM VQKILGGYGK VGNDFPINST YALFPEGIEQ RVYDPDKAAF
HYKKSGHSGS VLLRTSEVAF PGGVDAAVLY QESCKKAGIE IEVKREPGDG YWTNVWNVQP
FSTSYWGGRP TQDQMYSTAY LSTADWNDTR FKRPDFDKLL LQARSELDEV KRKDMYRTMA
MTVRDEGGVI LPMFNDFVNA STKQVKGYVH DIGNDMSNGY VATRVWLDA