Gene Rleg2_1692 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1692 
Symbol 
ID6980429 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1722237 
End bp1723832 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content59% 
IMG OID643396416 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002281206 
Protein GI209549289 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.122173 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCATT TTTCTAAAGG CTTGTTCGTC GGCGCCGTTT TCGGCGCCCT GACAATCGCC 
GCGCCGCAGC TGCAGGCCGC CACGCCGCAG GATCAATTGG TGATCGGGAC ATCGCTGGCC
CAGGTTCTGT CCCTCGATCC GCAGCAGGCG ACCGAAGGCA AGGCCGTCGA GATCATGTCC
AATCTTTACG ACCGGCTGGT CGCCAGCACG GCTGATGGTA AGATCCTTCC GCAGCTGGCG
GAAAGCTGGA AGGTTGACGA CAAGGGCATC ACCTTCACGC TGCGCAAGGC CAATTTCGCC
TCCGGCAATC CGGTGACTTC GAAGGACGTC GTCTATTCGC TGGCGCGGCT CTTGAAGATG
GATCAGGCCG CCGCCGCCAA CCTCAAGCGC GTCGGCTACG ATAAGAACAA TGTCGATAAG
CTCGTCAAGG CGGTCGACGA CCAGACGGTG CGCATCGATC TTTCCGACCA GGTGACGGCA
GAGCTTCTGC TCTACCGGCT GACGACGACC ACCACCAGCG TGGTCGACAG CGTCGAGGTC
GAAAGCCACG CCGTCGACAA TGACTACGGC AACGCCTGGA TGCGCACGCA TTCGGCCGGC
TCCGGCCCGT TCACCCTCAA TCGCTGGTCT CCGAACGAAT TGGTCATTCT CGACGCCAAC
AAGAATTATA TGACCGGCGC GCCGAAGATG AAGCGCGTCA TCGTTCGCCA TGTGCCGGAA
AGCCAGGTCG AGCGGCTGAT GCTGGAGCGC GGCGATATCG ATATTGCCAG CGCCCTGACA
GCCTCCGATC TCGCGACATT CCAGGCCAAG CAGGGCTTTG CCATCCAGCG CATTCCGACC
GGCGGCTTCT ACGTGCTGTC GATGAATGCC GGCAACCAGT ACCTTTCCAA TCCCAAGGTT
CGGGAAGCGA TTGCCTATGG CATCGATTAC AAGGGCATCG AAAAGACGAT CATGGGACCT
TACGGACGGG CAAGAACCGT TCCCGTTCCG GAGAACTTCG AATATGCGAT CCCAAGCCCG
GATTGGCAGC TCAACGTTGA AAAGTCCAAG CAGCTGCTGA GCGAGGCAGG CTTCAAGGAC
GGCTTCTCGC TGACGCTGAA GACCATTGCG CAAACGCCGC GCATCGATCT TGCCACCGCC
ATCCAGGCAT CGCTTGCCCA GGTCGGCATC AAGATCGACA TCCAGCAGGG CAACGGTTCG
GAAATCATCG CCGCCCATCG CGCCCGGGAT TTCGATCTGC TGATCCCGCA GACCAGCGCC
TATATGCCGA ATGTGCTCGG CTCGATGGAG CAGTTTTCCT CCAATCCGGA CAACTCGAAA
GAGGCCAACA ATGCCGGCAA TTTCGTCTGG CGCTCGGCTT GGGACATTCC AGAGCTGACA
GCCCTGACCG CAAAAGCATC GATGGAGCCG GACGCCAAGA AGCGCGGCGA ACTCTACGTT
CAGATGCAGA AGATGTTCGT CGAACAGAAG CCGGCCGTGC TGCCGATGTT CGAGCGCTTT
GAGCCGATCG TCCTCACCGG CAGGGTCCAG GGATATGTCG GACATCCGTC GCAAATGACG
CGTCTCGAGA ACGTGACCAA GGTCGAAACC CAGTAA
 
Protein sequence
MKHFSKGLFV GAVFGALTIA APQLQAATPQ DQLVIGTSLA QVLSLDPQQA TEGKAVEIMS 
NLYDRLVAST ADGKILPQLA ESWKVDDKGI TFTLRKANFA SGNPVTSKDV VYSLARLLKM
DQAAAANLKR VGYDKNNVDK LVKAVDDQTV RIDLSDQVTA ELLLYRLTTT TTSVVDSVEV
ESHAVDNDYG NAWMRTHSAG SGPFTLNRWS PNELVILDAN KNYMTGAPKM KRVIVRHVPE
SQVERLMLER GDIDIASALT ASDLATFQAK QGFAIQRIPT GGFYVLSMNA GNQYLSNPKV
REAIAYGIDY KGIEKTIMGP YGRARTVPVP ENFEYAIPSP DWQLNVEKSK QLLSEAGFKD
GFSLTLKTIA QTPRIDLATA IQASLAQVGI KIDIQQGNGS EIIAAHRARD FDLLIPQTSA
YMPNVLGSME QFSSNPDNSK EANNAGNFVW RSAWDIPELT ALTAKASMEP DAKKRGELYV
QMQKMFVEQK PAVLPMFERF EPIVLTGRVQ GYVGHPSQMT RLENVTKVET Q