Gene Rleg2_5442 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5442 
Symbol 
ID6978536 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp1084725 
End bp1086311 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content62% 
IMG OID643394543 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002279361 
Protein GI209547443 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0169825 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAAAC AGCTACTGAC AACGATCGCA ATTACCGGCG CCCTGATGAC GACGCAGCCT 
ACCTGGGCTG CCTCGCCGCC CAATATGCTG GTCATCGGCA CCAATCTCAC CGGCATCCGC
ACGCTCGATC CGGCCCAGAA CAATGCCCGC ACCGTTTCCG AGCTGATCTC GAACCTCTAC
GACAACCTGG TGCAGCTGTC GCCGGACGAT CTCAAGACAC TGAAGCCGAT GCTTGCGACG
CAATGGAGCG TCTCGGACGA CGGCAAGATC ATCACGCTCA CCCTGCGCGA CGACGCCGTC
TTCCAGAGCG GCAACAAGGT CACCGCCGAG GATGCGGCCT GGTCGATCCA GCGCGTCATC
AAGATGGGCC AGGTCGGCTC CACCGACGTG GCGCTTTGGG GCTTCAAGCC CGATAACGTC
GAAAAACTCG TTCGGGCAAA GGACGAGCAT ACGCTCGAAA TCGAGCTGCC GCAGTCGGTG
AATACCGATC TGGTGCTTTA TTCGCTGGCA GGCTCGTCGA TCGGCATCGT CGACAAGAAG
ACGGTGCTGT CGCATGAGGC GAACGGCGAT TTCGGCGGCG CCTGGCTTTC GGCCAATTCC
GCCGGCAGCG GACCATTCAG CCTGGCGCAA TGGCGGCCGA ACGACGTGGC GATCTTCAAT
GCGCAGCCGA AATACTGGGG TGGCAAGCCC GCCATGGCCC GCGTCGTCGC CCGCCACATC
CCTGAATCCG GCAATCTGCG ACTGCAGCTC GAAGCCGGCG ACGTCGATGT CGGCCAGTAC
GTGGCAAGCG GTGATCTCGA TGCGCTGTCC ACCAACAAGG ATATGGTCAT CGACAATGTG
CCGGGTCTCG GCTTCTATTA TATCGCCCTC AATCAGAAAG ACCCGGATCT GCAGAAGCCG
AAGGTTCGCG AGGCCTTCCA GCACGCCTTC GACTGGAAGG CGATCTCCGG CAACATCATG
CGCTATACGG GCTTTCCCTG GCAGTCGATG ATTCCGCGCG GCATGATCGG CGCACCCGAC
GAGGCCGCCG CCCGCTACGA CTACGATCCC GCCAAAGCCA AGCAGTTGCT GGCGGAGGCC
GGATATCCGA ACGGCTTGAA GAAGGTGCTC AATCCGTCGG GGGCAGCGAC CCTGCCCTTC
GCCGAAGCGC TGCAGGCGAG CGCGCGGGCC GCCGGCCTCG ATCTCGATCT GGTGCCAGGC
GAGTTCACGC CCGCCTTCCG CGAACGCAAA TTCGAAGTGC TGCTCGGCAA TTCCGGCGCC
CGCCTGCCGG ATCCCTTTGC TGTCGCCACG CAATATGCCT TCAACCCCGA CAATAGCGAC
GAGGCGCGCC TCGGCAGCTA TTATCTCTGG CGCACGGGCA TGAAGGTGGA CGACCTCAAC
ACTCTCATCG ATCAATCGAT GAAAGAGCGC GATACGGCCA AGCGCACGGA TATCTTCAAG
AAGATGGACG GCATCTATGC CGGCATGGCC GCCCCGCTCG TCATCTTCTT CCAGCGAACC
GACCCCTATG TCATGCGCGC CAACGTCAAG AATTATCACG GGCACACGAC CTGGTCGACG
CGCTGGCACG ACGTGACCAA GGAGTAG
 
Protein sequence
MLKQLLTTIA ITGALMTTQP TWAASPPNML VIGTNLTGIR TLDPAQNNAR TVSELISNLY 
DNLVQLSPDD LKTLKPMLAT QWSVSDDGKI ITLTLRDDAV FQSGNKVTAE DAAWSIQRVI
KMGQVGSTDV ALWGFKPDNV EKLVRAKDEH TLEIELPQSV NTDLVLYSLA GSSIGIVDKK
TVLSHEANGD FGGAWLSANS AGSGPFSLAQ WRPNDVAIFN AQPKYWGGKP AMARVVARHI
PESGNLRLQL EAGDVDVGQY VASGDLDALS TNKDMVIDNV PGLGFYYIAL NQKDPDLQKP
KVREAFQHAF DWKAISGNIM RYTGFPWQSM IPRGMIGAPD EAAARYDYDP AKAKQLLAEA
GYPNGLKKVL NPSGAATLPF AEALQASARA AGLDLDLVPG EFTPAFRERK FEVLLGNSGA
RLPDPFAVAT QYAFNPDNSD EARLGSYYLW RTGMKVDDLN TLIDQSMKER DTAKRTDIFK
KMDGIYAGMA APLVIFFQRT DPYVMRANVK NYHGHTTWST RWHDVTKE