Gene Rleg_5006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5006 
Symbol 
ID8007597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp390696 
End bp392306 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content60% 
IMG OID644821921 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002973181 
Protein GI241113346 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.264167 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGACA AGATCACCAA TTGGACCAGA TCCGACGACT CCATGGTCGA AAGCGCCATC 
CGTCGTGGCG CCACCCGTCG CGAGTTGCTG CATATGATGC TCGCGGGCGG CGTGGCCCTG
TCTGCCGGCG GGCTCGTGCT TGGCCGTGCC GGCAAGGCGC TCGCCGCCAC GCCCGTTTCC
GGCGGCTCGC TCAAGGCGGC CGGCTGGTCG TCCTCGACGG CCGATACGCT CGACCCCGCC
AAGGCGTCGC TCTCCACCGA CTATGTCCGG TGCTGCTCCT TCTATAACCG CCTTACCTTC
CTCGACAAAT CAGGCACGCC GCAGATGGAG CTTGCCGACG CGATCGAGTC CAAGGATGCG
AAGACCTGGA CGGTCAAGCT GAAGAACGGC GTTACTTTCC ATGACGGCAA GCCGCTGACC
GCCGATGACG TAGTTTTCTC GCTGAAGCGC CATCTCGACC CATCCGTCGG CTCGAAGGTC
GCCAAGATCG CCGCCCAGAT GACCGGCTTC AAGGCGGTCG ACAAACAAAC CGTCGAGATC
ACGCTCGCCA GCCCGAATGC CGACCTGCCG ACCATTCTGT CGATGCATCA CTTCATGATC
GTCGCCGACG GCACGACCGA TTTCACCAAG GCCAACGGCA CCGGCGCCTT CGTCAAGGAA
GTCTTCGAGC CGGGCGTTCG CTCGGTCGGG ATCAAGAACA AGAATTACTG GAAATCCGGC
CCGAACGTCG ATTCCTTCGA ATATTTCGCC ATCAGCGACG ACAATGCCCG CGTTAACGCG
CTGCTTTCGG GCGACATCCA CCTCGCAGCC TCGATCAATC CGCGCTCGAT GCGCCTCGTC
GAGACCCAGG GCGACGGCTT CACCTTGTCG AAGACCACCT CCGGCAACTA CACCAATCTC
AACATGCGAC TGGATATGGA GCCCGGCAAC AAGCGGGATT TCGTCGAGGG CATGAAGTAT
CTCGTCAACC GCGAACAGAT CGTCAAAGCG GCGCTGCGCG GTCTCGGCGA AGTTGGCAAC
GACCAACCCG TTTCGCCTGC GAACTTCTAT CATGACGCAG AGCTGAAAGC GCGGGCCTTC
GATCCCGACA AGGCGAAGTT CCACTTCGAC AAGGCCGGGG TTCTCGGCCA ATCCATTCCG
ATCATCGCTT CCGATGCGGC GGCTTCCTCG ATCGACATGG CCATGATCAT ACAGGCGGCC
GGCGCCGAAA TCGGCATGAA GCTCGACGTC CAGCGAGTGC CATCTGATGG CTATTGGGAC
AATTACTGGC TCAAGGCGCC GATCCACTTC GGCAATATCA ACCCGCGTCC GACCCCGGAT
ATCCTCTTCT CCCTGCTCTA CACCTCGGAC GCTCCGTGGA ACGAAAGCCA GTACAAGTCG
GAGAAGTTCG ACAAGATGCT GATCGAGGCG CGCGGCTCTC TCGATCAAGA CAAGCGCAAG
ACGATCTACA ACGAGATGCA GGGCATGGTC GCCCAGGAAG CTGGTACTAT CATTCCCGCC
TATATCTCGA ACGTCGACGC CACGACCGCC AAGCTCAAGG GCCTGGAAGC CAACCCGCTC
GGCGGCCAGA TGGGATATGC TTTCGCGGAA TATGTCTGGC TTGAAGCCTG A
 
Protein sequence
MNDKITNWTR SDDSMVESAI RRGATRRELL HMMLAGGVAL SAGGLVLGRA GKALAATPVS 
GGSLKAAGWS SSTADTLDPA KASLSTDYVR CCSFYNRLTF LDKSGTPQME LADAIESKDA
KTWTVKLKNG VTFHDGKPLT ADDVVFSLKR HLDPSVGSKV AKIAAQMTGF KAVDKQTVEI
TLASPNADLP TILSMHHFMI VADGTTDFTK ANGTGAFVKE VFEPGVRSVG IKNKNYWKSG
PNVDSFEYFA ISDDNARVNA LLSGDIHLAA SINPRSMRLV ETQGDGFTLS KTTSGNYTNL
NMRLDMEPGN KRDFVEGMKY LVNREQIVKA ALRGLGEVGN DQPVSPANFY HDAELKARAF
DPDKAKFHFD KAGVLGQSIP IIASDAAASS IDMAMIIQAA GAEIGMKLDV QRVPSDGYWD
NYWLKAPIHF GNINPRPTPD ILFSLLYTSD APWNESQYKS EKFDKMLIEA RGSLDQDKRK
TIYNEMQGMV AQEAGTIIPA YISNVDATTA KLKGLEANPL GGQMGYAFAE YVWLEA