Gene Rleg_5173 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5173 
Symbol 
ID8007069 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp578724 
End bp580310 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content61% 
IMG OID644822083 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002973343 
Protein GI241113508 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.12226 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.127243 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAGAC AAGTACTAAC AACGATCGCA ATTACCGGCG CCATGATGAC GGCGCAGCCC 
AGCTTCGCGG CGTCGCCGCC CAATATGCTG GTGATCGGCA CCAATCTCAC CGGCATAAGG
ACGCTCGATC CGGCGCAGAA CAATGCCCGC ACGGTTTCCG AACTGATCTC GAATATTTAC
GACAACCTCG TGCAGCTGTC GCCGGACGAC CTTAAAACGC TGAAGCCGAT GCTCGCGAAG
CAATGGAGTG TCTCGGCGGA TGGCAAGATC ATCACACTGA CATTACGCGA CGACGCGGTC
TTCCAGAGCG GCAACAAGGT CACCGCCGAG GATGCGGCCT GGTCGATCCA GCGCGTCATC
AAGATGGGCC AGGTCGGCTC CACCGACATC GCGCTCTGGG GCTTCACGCC TGAAAACGTC
GAAAAGCTCG TTCGCGCAAA AGACGAACAC ACGCTCGAGA TCGAGCTGCC GCAGGCGGTC
AATACCGATC TGGTGCTCTA TTCGCTGGCG GGCTCGTCGA TCGGCATCGT CGACAAGAAG
ACGGTACTGT CGCACGAGGC AAACAGCGAT TTTGGCGGCG CCTGGCTTTC CGCCAATTCC
GCCGGCAGCG GCCCATTCAG CCTGGCGCAG TGGCGGCCGA ACGATGTCGC GATCTTCAAT
GCCCAGCCGA AATATTGGGG CGGCAAGCCG GCCATGGCCC GTGTCGTTGC GCGTCACATC
CCGGAATCCG GCAATCTCCG GCTTCAGCTC GAAGCCGGCG ACGTCGATGT CGGCCAATAT
GTTTCAAGCG GCGACCTCGA TGCGCTCGCC ACCAAGAAGG ACATGGTCAT CGAGAATGTC
CCGGGTCTCG GCTTCTACTA TATCGCCCTC AATCAGAAAG ACCCGGATCT GCAGAAGCCA
AAGGTTCGCG AGGCCTTCCA GCATGCCTTC GACTGGAAAG CGATCTCCGG CAACATCATG
CGCTATACGG GCTTTCCCTG GCAGTCGATG ATCCCGCGCG GCATGATCGG CGCTCCCGGT
GAGGCGGCGG TCCGCTACGA TTACGATCCC GCCAAGGCCA AGCAGTTGCT GGCGGAAGCC
GGATATCCCA ATGGCCTGAA GAAGGTGCTC AATCCGTCGG GAGCCGCGAC CCTGCCCTTC
GCCGAAGCGC TGCAGGCGAG TGCGCGCGCC GCCGGCCTTG ATCTCGATCT CGTGCCCGGC
GAGTTCACGC CCGCCTTCCG CGAGCGCAAA TTCGAAGTGC TGCTCGGCAA TTCCGGCGCC
CGCCTGCCCG ATCCCTTCGC GGTCGCCACG CAATATGCCT TCAACCCCGA CAATAGCGAC
GAGGCACGCC TCGGCAGCTA TTACCTCTGG CGCACGGGCA TGAAGGTGGA CGAGCTCAAC
ACGCTCATCG ACCAATCGAT GAAGGAGCGC GACACGGAAA AGCGCACAGA CATCTTCAAG
AAGATGGATG GCATCTATGC CGGCATGGCT TCGCCGCTGG TCATCTTCTT CCAGCGAACC
GACCCCTATG TCATGCGCGC CAACGTCAAG GGCTATCACG GGCATACGAC ATGGTCGACG
CGCTGGCATG ACGTGACCAA GGAGTAG
 
Protein sequence
MLRQVLTTIA ITGAMMTAQP SFAASPPNML VIGTNLTGIR TLDPAQNNAR TVSELISNIY 
DNLVQLSPDD LKTLKPMLAK QWSVSADGKI ITLTLRDDAV FQSGNKVTAE DAAWSIQRVI
KMGQVGSTDI ALWGFTPENV EKLVRAKDEH TLEIELPQAV NTDLVLYSLA GSSIGIVDKK
TVLSHEANSD FGGAWLSANS AGSGPFSLAQ WRPNDVAIFN AQPKYWGGKP AMARVVARHI
PESGNLRLQL EAGDVDVGQY VSSGDLDALA TKKDMVIENV PGLGFYYIAL NQKDPDLQKP
KVREAFQHAF DWKAISGNIM RYTGFPWQSM IPRGMIGAPG EAAVRYDYDP AKAKQLLAEA
GYPNGLKKVL NPSGAATLPF AEALQASARA AGLDLDLVPG EFTPAFRERK FEVLLGNSGA
RLPDPFAVAT QYAFNPDNSD EARLGSYYLW RTGMKVDELN TLIDQSMKER DTEKRTDIFK
KMDGIYAGMA SPLVIFFQRT DPYVMRANVK GYHGHTTWST RWHDVTKE