Gene Rleg_5539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5539 
Symbol 
ID8016430 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012853 
Strand
Start bp125745 
End bp127025 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content63% 
IMG OID644827706 
ProductABC transporter substrate-binding protein 
Protein accessionYP_002978906 
Protein GI241518278 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0803518 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.00000350864 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTTACCC GACGCGATTT TCTGAAGACG ACGGCCGCCA CCGGTGCATT GGCGGCGACA 
TCCGGGCTCG CCGCCCCCGC GATCGCGCAG GACGCCGCGA TCAAGCTCGG CTATGTCAGC
CCGCAGACGG GACCGCTTGC CGCCTTCGGT GAGGCCGACA AGTTCGTCAT CGACAGTTTT
CTGGCAGTCA CCAAGTCGAA GGGTCTCAAC TACGAGGTTG TCGTCAAGGA CAGCCAATCC
AATCCGAACC GGGCGGCGGA GGTCGCCAAG GAACTGATCG TCACCGACGA GGTGAACCTG
ATCCTCGTCG CCTCGACGCC GGAGACCACC AATCCGGTGG CGACCACCTG CGAGGCTGAG
GAAATGCCCT GTATTTCGAC GGTGGCTCCC TGGCAGCCGT GGTTCATCGG CCAGCAGGGC
AATCCCGGCG ACCCGACCTC CTGGAAACCA TTGAACTACG CCTATCACTT CTTCTGGGGT
CTCGAGGACG TCATCTCGGT CTTCACCAAC ATGTGGGCGC AGATCGAGAC CAACAAGAAG
GTTGGCGGCC TCTTCCCAAA TGACGGCGAC GGCAATGCCT GGGGCGACAA GGTCGTCGGC
TTCCCGCCGG TGCTGGAAAA GATGGGCTAC GGGCTGATCG ACCCCGGCCG CTATCAGAAC
ATGACGGATG ATTTCTCGGC GCAGATCAAC GCCTTCAAAT CGGGCCAGTG CGAAATCATC
ACCGGCGTGG TGATCCCGCC TGACTTCACC ACCTTCTGGA ACCAGGCCAA GCAGCAGGGT
TTCGCCCCGA AGATCGCCTC GATCGGCAAG GCACTGCTGT TCCCGCAGAC GGTGGAGGCG
CTCGGCAATG CCGGGCATAA TCTGTCGTCG GAAGTCTGGT GGACGCCGAG CCATCCGTTC
AAATCGTCCT TGACGGGCGA AAGTACAGCA GAGGTGGCGG CCGCCTTTAC CAAGGCGACT
AGCAGGCCGT GGACGCAGCC GATCGGTTTT GCCCATGCGC TGTTCGAGCT GGCGGTGGAT
GCGATGAAGC GGGCCGGAGA TCCGACAGAC GGGGATGCCG TCGCGCAGGC GATTGCCGCC
ACCAAGCTCG ATACGCTGGT CGGGCCGATT GCTTGGGACG GCAAGGGCCT GCCGCCTTTC
GCGGCCAAGA ACATTGCCAA GACGCCGCTC GTCGGCGGCC AGTGGCGGTT GAAGGACGGC
GGCGGCTACG ATCTCGTCAT CACCGACAAC AAGACGGCGC CGAACATTCC GGTCGGCGGC
AAGATGGAAG CAATCGCCTG A
 
Protein sequence
MFTRRDFLKT TAATGALAAT SGLAAPAIAQ DAAIKLGYVS PQTGPLAAFG EADKFVIDSF 
LAVTKSKGLN YEVVVKDSQS NPNRAAEVAK ELIVTDEVNL ILVASTPETT NPVATTCEAE
EMPCISTVAP WQPWFIGQQG NPGDPTSWKP LNYAYHFFWG LEDVISVFTN MWAQIETNKK
VGGLFPNDGD GNAWGDKVVG FPPVLEKMGY GLIDPGRYQN MTDDFSAQIN AFKSGQCEII
TGVVIPPDFT TFWNQAKQQG FAPKIASIGK ALLFPQTVEA LGNAGHNLSS EVWWTPSHPF
KSSLTGESTA EVAAAFTKAT SRPWTQPIGF AHALFELAVD AMKRAGDPTD GDAVAQAIAA
TKLDTLVGPI AWDGKGLPPF AAKNIAKTPL VGGQWRLKDG GGYDLVITDN KTAPNIPVGG
KMEAIA