Gene Rleg2_5668 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5668 
Symbol 
ID6977059 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011366 
Strand
Start bp60142 
End bp61815 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content60% 
IMG OID643393125 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002277943 
Protein GI209546053 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTCACA ACATTTCCGT GCACGCCGGC TTAGCAGCGG CAACCGCGCT GACGGCGCTC 
GTCGCTTTCG GGGCGGGCAG TGCGGCGGCT GAATCGGTGC TGACGATGCA CATCGAAGAG
CAGACCAGTT GGGTTCAGAA CTTCAATCCG TTCGATCTTG CCGGCCGTCG GCAGAGCACG
ATGGATTTCA TCTACGAGCC GCTGGTCATC TTCAATGCCG AGGATGGCGG CAAGCCGGTT
TTCCGCCTGG CGACCGCTTA CAAATTCTCC GAGGATATGA AATCGGTCAC CTATACGCTG
CGACCCGGTG TGAAATGGTC CGACGGCCAG CCGCTGACCT CGGCCGATGT GAAATATACG
ATCGACCTGA TGCTGAAGAA CGCAGCCCTC GACACGGTCG GCGTCGGTCA AACGGTTGCC
TCGGTCGAGA CGCCGTCGGC GACCGAGGTG AAGGTCGATC TCAAGGCCGT CAACTCCGAT
TTCCCGGAAA CGCTGGCGGA CCTCGCCATC GTTCCCGAGC ATATCTGGAA GGACGTGTCC
GATCCCGTTG CCTTCAAGAA CGAGAAGCCG GTCGGTTCCG GCCCGATGAC CGAGCTGCGC
CGCTTCACGC CGCAGGTCTA CGAGCAGTGC CGCAACCCGA ACTACTGGGA TGCCGCCTCG
CTGCATGTCG ATTGCCTGAG ACTGCCGCAG ATCTCCGGCA ACGACCAGAT GCTTGCCATC
CTGCCTGAGG GCAATATGGA CTGGATCGGC TCCTTCATTC CCCAGATCGA CAAGACTTTC
GTCGGGCTCG ATGCCGACCA TAACGGCTAC TGGCAGCCGC CGGCCGAAAC CGTCGCTTTC
CAGATGAATT TCAAGAGCGG CAATGACGGC AACCTCGAGG CCTATAAGGA CCTTAACTTC
CGCCATGCCT TCAGCCTCGC GATGGATCGC AAGTCCATGG TCGATATTGC CGGCTTCGGC
TATCCCGTCG TCAACGAACA TGCGACCGGC CTGCCGCCGC GTTTCGAAAG CTGGCGCAAC
AAGGATGCCG AGGGCGGCAA GGACGCCTTC ATGGGCTTCG ACACCGAAAA GGCCAGCAAG
ATTCTCGACG ATGCCGGCTA CAAGAAGGGC GCCGACGGCT TCCGCACGAC GCCGAGCGGC
AAACCGATCG CCTTCCCGAT CATCGTTCCG AACGGCTGGA CGGACTGGAT CGATGCGGTC
CAGATCGCGG TCGAAGGCCT GCGCGCCGCC GGCATCAATG CGTCGGTCGC CACGCCCGAA
TATGAACAAT GGCGCAAAGA GATCATCGAC GGCAGCTTCG AGGTCGTCAT GAACTCCCGC
GCCGACGGCG CAACGCCGTT CCGCGGCTAT TACCAGAGCC TTTCCACAGC CTATGGCGGG
CGCATCACCG GCGCGCCCTC GCGTTATTCG AACCCGAAAC TGGACGCGCT TTTCGATCAA
TATCTGCAGG CAACGTCGGA CGACGATCAC AAGAAGATCT TCAACGACAT TCAAATGCTG
ATCGCCGACG ACTTCCCCGT CGTTCCCGTC TTCAACGGGC CGACCTGGTA TCAGTTCTCC
AGCAAGCGCT TCACCGGCTG GGTCACCGAC AAGGATCCGG TGATGAATCC CGAGGATCAC
GACAACAACC GCATGCGCCT GATGCATCTC TTGCGTCTCA AGCCGGTCAG CTAA
 
Protein sequence
MFHNISVHAG LAAATALTAL VAFGAGSAAA ESVLTMHIEE QTSWVQNFNP FDLAGRRQST 
MDFIYEPLVI FNAEDGGKPV FRLATAYKFS EDMKSVTYTL RPGVKWSDGQ PLTSADVKYT
IDLMLKNAAL DTVGVGQTVA SVETPSATEV KVDLKAVNSD FPETLADLAI VPEHIWKDVS
DPVAFKNEKP VGSGPMTELR RFTPQVYEQC RNPNYWDAAS LHVDCLRLPQ ISGNDQMLAI
LPEGNMDWIG SFIPQIDKTF VGLDADHNGY WQPPAETVAF QMNFKSGNDG NLEAYKDLNF
RHAFSLAMDR KSMVDIAGFG YPVVNEHATG LPPRFESWRN KDAEGGKDAF MGFDTEKASK
ILDDAGYKKG ADGFRTTPSG KPIAFPIIVP NGWTDWIDAV QIAVEGLRAA GINASVATPE
YEQWRKEIID GSFEVVMNSR ADGATPFRGY YQSLSTAYGG RITGAPSRYS NPKLDALFDQ
YLQATSDDDH KKIFNDIQML IADDFPVVPV FNGPTWYQFS SKRFTGWVTD KDPVMNPEDH
DNNRMRLMHL LRLKPVS