Gene Rleg2_5763 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5763 
Symbol 
ID6977153 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011366 
Strand
Start bp167872 
End bp169398 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content61% 
IMG OID643393219 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002278037 
Protein GI209546147 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000235373 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.926009 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACAC TTGTCGCATT CCTGCTCGGC ACCGCGCTCG TCGCCCTGCC TTCGACCTTG 
CTCGCCCAGG AAAAGGGCGG CGTCATCAAT GTCGCGACGA TCGGCGAGCC GCCGACGCTC
GATCCGATGT CGTCGACGGC CGATCTCGTC GGCATCGTCA CGCAGCATAT TTTCGAAACC
CTCTACACTT TCGACAAGAG CTGGAACGTC ACACCGCTGC TGGCCGAAAG CCTGCCTGAG
ATCAGCGCCG ACGGCAAAAC CTATACGATC AAGCTCAGGA CCGGCATCAA GTTCCACGAC
AATAGCGACA TGACCTCGGA CGATGTCGTC GCCTCGCTTG GCCGCTGGAT GAAGATCGCC
TCGCGCGGCA AGCAGGTGGC CGGCTTCATC GACAAGATTA CCGCCGCTGA TGCCTCGACA
GTGACCATCA CGCTGAAGCA GCCCTATGCG CCGCTGACCT CGCTGCTTGC CTTCAACAAT
TCGGCGGCAA TCATCATCCC TGCCGAAAAG CAGGACGAGC CGATGAAGGA CTTCATCGGC
ACCGGTCCCT ATATGCTGAA GGAGCGCAAG GCCGACCAAT ATATCCAGCT CGTCCGCTTC
GACGGCTACA AGTCCCGCGA AGGCGACAGC AATGGGTATG GCGGCGCCCG CCATCAATAT
CTCGATGAAA TCCGCTTCGT GCCGGTGCCG GATCCGAACA CCCGCGTCGA GGCCGCCATC
TCAGGCCAGT ATGATTATGT CGACTCGATC GCGGTCGAAT CCTACGACAA GCTGAAAGCT
TCCAACGCCT CGCAGCCGGT CATGTTGAAG CCCTTCGGCT ACCCGGTCTT CGTCTTCAAT
ACCAAGGAAG GTGTGGCCGG GAATGTCGAG GTTCGCAAGG CGATCCGCCA GGCGCTCAGC
ATGGAAGACA TGCTGGCCGC CGCCTTCGGC AGCAAGGATT TTTATGCGCT CGACGGCGCC
ATCTATCCGA AGACCTTTTC CTGGTCGACG GATGCCGGCG TCGAGGGCGC CTATAACGTC
GCCGATCCGG AAGGGGCTGC CGCTGCCGCC AAGAAGGCCG GCTACAACGG TGAGCCGATC
CGGATTCTGA CCAGCCGCCA GTACGAATTC CACTACAAGA TGGCGCAGGT CGCCGCCGAA
TATCTGAAGC TTGCCGGCTT CACCGTCGAT ATGCAGGTGG TGGACTGGGC GACGCTGACG
CAGCGCCGCA CCGACCCGAA GCTCTGGGAT ATCTACATCA CCCACAGCCC CTTCCTGCCG
GAGCCGGCGC TGATCGGCTC GCTCTCGACC AGCTCGCCCG GCTGGTGGGA TACGCCGGCC
CGCAAGGCCG CCGTCGATGC CTTCACCTCC GAGGTCGACC CGAAGAAGCG CGTGGCACTC
TGGGCCGATG TCCAGAAGGC TATCTATACA GACGCGCCCT TCATGAAGAT CGGCGATTTC
AACGCCGTCG CGGCAAAGTC GGTCAAGCTT GAAGGCGTCG ATGCGGCCCC GTGGCCATAT
TTCTGGAACG CTTCGATCAA GAAGTAA
 
Protein sequence
MKTLVAFLLG TALVALPSTL LAQEKGGVIN VATIGEPPTL DPMSSTADLV GIVTQHIFET 
LYTFDKSWNV TPLLAESLPE ISADGKTYTI KLRTGIKFHD NSDMTSDDVV ASLGRWMKIA
SRGKQVAGFI DKITAADAST VTITLKQPYA PLTSLLAFNN SAAIIIPAEK QDEPMKDFIG
TGPYMLKERK ADQYIQLVRF DGYKSREGDS NGYGGARHQY LDEIRFVPVP DPNTRVEAAI
SGQYDYVDSI AVESYDKLKA SNASQPVMLK PFGYPVFVFN TKEGVAGNVE VRKAIRQALS
MEDMLAAAFG SKDFYALDGA IYPKTFSWST DAGVEGAYNV ADPEGAAAAA KKAGYNGEPI
RILTSRQYEF HYKMAQVAAE YLKLAGFTVD MQVVDWATLT QRRTDPKLWD IYITHSPFLP
EPALIGSLST SSPGWWDTPA RKAAVDAFTS EVDPKKRVAL WADVQKAIYT DAPFMKIGDF
NAVAAKSVKL EGVDAAPWPY FWNASIKK