Gene Rleg_6684 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6684 
Symbol 
ID8022594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012858 
Strand
Start bp115007 
End bp116653 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content59% 
IMG OID644833551 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002984685 
Protein GI241666601 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.146405 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.692475 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAT ATCTTCTTGC CGCCGCCGCA CTAACGCTGC TTTCGGGATC TGCCATGGCG 
CAAACGATCC TCACAGTGAA TATCGAACCG GCGACGACCT GGGTCCGCAA CTTCAACCCG
TTCAACCAGA CCTCGTCGCG TCAATCGACA CTCGACTTCA TCTACGAGCC GCTGGTCGTC
TTCAATCGCT TCGACAGCAA CAAGCCGGTC TATCGCCTGG CGGAAAGCTT CAAACTCTCC
GACGATCTGA AGAGCATCGA TTTCAAGCTG CGCCCGAACC TGAAATGGTC GGACGGTAAG
CCGCTGACCG CAGCCGACGT CAAGTTCACC TATGATTACC TGAAGAAATT TCCGGCGCTC
GACTTCGTCA GCATCTGGAC CTTCATCACC GATGTGCAGG CCGTCGACGG CCAGACGGTG
CGCTTCACGC TCGCCAATCC GAGCTCGCTC GCCGCCGAGC AGATCTCGCA ACTGCCGATC
GTTCCGGAAC ATGTCTGGAA GGACGTTGCC GATCCCGTCA CCTTCGCCAA CGAGACACCT
GTTGGCAGCG GCCCGCTGAC GGAAGTGCCG CGCTTCACCG GCCAGACTTA CGACCAGTGC
CGCAACCCGA ACTACTGGGA CAACGAGCAC CTGAAAGTCG ATTGCATGCG CTTCCCGCAG
CTCGCCGACA ACAATCAGAT GCTGACGGCA ACGGCCGACG GCACGCTCGA CTGGGGCGTC
TCCTTCATCC CCGATATCGA CAATGTCTAT GTTTCCAAGG ACCCGGCGCA TTTCCACTAC
TGGTATTCGC CAAGCAGCAT GGTCGCCTTC CTGTTCAACC TGGAAACGGC GAACGAGAAC
AACAAGAAGG CCTTCAACGA CCTGAAGTTC CGCCGTGCCG TCTCGATGGC ACTCGACCGC
AAGACGATGA TCGACGTCGC AGGCTACGGC TATCCGACGC TGAACGAAGA CCCCGGCCTG
ATGGGCGAGC TCTACAAGAG CTGGGCGGAC CCGTCCGTCA AGGCCGACTT CGGCAAGTTC
GCGACCTATG ATGCCGATGC TGCCAAGGCC TTGCTCGACG AGGCGGGCTA CAACGACAAG
GACGGCGACG GCTTCCGCGA CAATCCCGAC GGCACCAAGA TCTCCTTCTC GATCATCGTC
CCCAGCGCCT GGACGGACTG GATCGATACC GTCAACCTCG CGGTCGAGGG CATGCAGGCG
GTCGGGATCG ACGCCAAGAT CGAAACGCCG GAGGAAGCCG TCTGGACCGG AAACCTCATC
AACGGCACCT TCGATGCGGC GATCAACAGC CTGCCGGCAT CGGCCTCGCC CTATTACCCC
TACAAGCGCG CTTTCAGTGC TTCGGATAAG GGCAAGACCC GCTTCACCGC GCAGCGCTGG
TTCAATCCGG AGGTCGAAAA ACTCGTCACC GAGTTCACCC ATACCGCCGA CCTTGCCAAG
CAGAAGGATG CGATGAACAA GGCGCAGCGC ATCGTCGCCG AAAACATGCC TGTGATTCCG
GTGTTCAACA ATCCGAACTG GTATCAGTAC AACACCAAGC GCTTCACCGG CTGGTCGACC
AAGGAAAACC CCTTCGTCAA TCCGTCGATC TCGCGGACCA ATCCGGCACG CCTGCTGAAC
CTGCTGGCCC TCGAGCCGGT CAAGTAA
 
Protein sequence
MKKYLLAAAA LTLLSGSAMA QTILTVNIEP ATTWVRNFNP FNQTSSRQST LDFIYEPLVV 
FNRFDSNKPV YRLAESFKLS DDLKSIDFKL RPNLKWSDGK PLTAADVKFT YDYLKKFPAL
DFVSIWTFIT DVQAVDGQTV RFTLANPSSL AAEQISQLPI VPEHVWKDVA DPVTFANETP
VGSGPLTEVP RFTGQTYDQC RNPNYWDNEH LKVDCMRFPQ LADNNQMLTA TADGTLDWGV
SFIPDIDNVY VSKDPAHFHY WYSPSSMVAF LFNLETANEN NKKAFNDLKF RRAVSMALDR
KTMIDVAGYG YPTLNEDPGL MGELYKSWAD PSVKADFGKF ATYDADAAKA LLDEAGYNDK
DGDGFRDNPD GTKISFSIIV PSAWTDWIDT VNLAVEGMQA VGIDAKIETP EEAVWTGNLI
NGTFDAAINS LPASASPYYP YKRAFSASDK GKTRFTAQRW FNPEVEKLVT EFTHTADLAK
QKDAMNKAQR IVAENMPVIP VFNNPNWYQY NTKRFTGWST KENPFVNPSI SRTNPARLLN
LLALEPVK