Gene Rleg_6643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6643 
Symbol 
ID8022893 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012858 
Strand
Start bp73169 
End bp74713 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content58% 
IMG OID644833510 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002984644 
Protein GI241666560 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.19211 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACAAGC TTCTTCTTGC AGGCACCATG TTCATGGCGA TGACCGGCGT GATCCACGCC 
CGCGACATCG TCGTGGCTCA AAGTTCCGAT CTGCGCAGCA ACAATCCGGG CGTCAATCGC
GACGGCAATA CCGATGGCGT CATCCTGCAT ATCGTCGAAG GGCTCGTCGG CTATGCCAAC
AACGGCGAGG TCAAGCCGCT GCTGGCAAAG AGCTTCGAAG TCTCGGCCGA TGGGCTGACC
TACAGCTTCA AACTGCGTGA CGACGTCAAA TTCCACAACG GCAAGACATT GACCGCCGAT
GACGTCGTTT GGAACTGGAA CCGCTATCTC AAGCCCGAAA CGAAATGGAC CTGCCTTCCT
GACTTCGACG GCAACGGCAG CGTGCATGTT ACGGGGGTCA AGGCAGTCGA TGCGTCGACC
GTCACCATCA CGCTGGAAAA GCCATCCGCG GTTTTCCTTG GCCTGATGTC GCGCCCCGAA
TGCGGTTACA CCGGGATCAT TTCCCCGGAA TCGGTCGGCG CAGACGGAAA TTTCGTCAAG
CCGATCGGCA CCGGTCCCTT CAAATGGGAT GAATGGAAAA AGGGCGAGTA TATCCATCTC
GCCAAGTTCG ACGATTATGT CTCGCCAGAG AATGACGGCA AGCCCGACGG CATGGTCGGC
TCCAAACGCC CTCTCGTCGA TGGCATCAAG TTCATGGTGA TTCCCGATGC TTCGACCGTA
AAGGCCGGCC TTCAGTCCGG GGTGCTCGAT ACCGCGGAGA TTTCGCCGGA TCTCATTCCC
GAATTCAAGA CGAGCGACAC GATGCAATTG ATCGTGGCGC GCAACAACGG CAAAAACCTC
TTCTACATCC AGACGCGCGA CAAGGTTCTG AGCAATCCCG GCGTGCGCCG CGCCATGGCA
ATGGCGCTCG ATCTCGACCA ACTCGTCGAG GCCGCCTCCA ATGGCACCGG CGCAGCCAAC
GGTTCGATGG TTTCGCAAGA CTCGCTCTAT TTCGACGATG TCCAGAAGGA GCGTCTGCCC
TACGACGTCG AGGCCGCGAA GAAAGAACTT GCGACGGCCG GCTATAAGGG CGAGCCGATT
ACCATCATCG CCAACAAGCG CAGCAACGTG CCAAGCTTCC CGGCCGCGGT GATGGCGCAG
GCCATGATGC AGCAGGCAGG TCTCAATGTG CAGATCGAGG TGCTCGACTA TGCAACGCAG
GTCGATCGCC GCCGGTCCGG CAACTACCAG ATCATCTCGC AATCGGTCGC GCCGCGGCTC
GATCCGGCGC TGATGTACGG CTTCTATGTC GGCAACAAGG ACAAGAATGC GTCGTTGATG
TGGGATGATC CAAAGGCCGT CGAGTTGATG AAGGCCGCCT ATGCGGAACC CGACCAGACG
AAGCGTCAAG CGATCTTCGA CGAGTTTCAC ACGCTGATGC TCAAGGAAAT GCCGGGCATC
TTCCTCTATG ACATGGTCGA TGTCTGGGGC GCGACCAAGA AGCTGAAGGG CCAGCCCGTC
TGGCAATCGA ATGCCCGTCT TTGGGAAGTT TCGCTCGACA ACTGA
 
Protein sequence
MHKLLLAGTM FMAMTGVIHA RDIVVAQSSD LRSNNPGVNR DGNTDGVILH IVEGLVGYAN 
NGEVKPLLAK SFEVSADGLT YSFKLRDDVK FHNGKTLTAD DVVWNWNRYL KPETKWTCLP
DFDGNGSVHV TGVKAVDAST VTITLEKPSA VFLGLMSRPE CGYTGIISPE SVGADGNFVK
PIGTGPFKWD EWKKGEYIHL AKFDDYVSPE NDGKPDGMVG SKRPLVDGIK FMVIPDASTV
KAGLQSGVLD TAEISPDLIP EFKTSDTMQL IVARNNGKNL FYIQTRDKVL SNPGVRRAMA
MALDLDQLVE AASNGTGAAN GSMVSQDSLY FDDVQKERLP YDVEAAKKEL ATAGYKGEPI
TIIANKRSNV PSFPAAVMAQ AMMQQAGLNV QIEVLDYATQ VDRRRSGNYQ IISQSVAPRL
DPALMYGFYV GNKDKNASLM WDDPKAVELM KAAYAEPDQT KRQAIFDEFH TLMLKEMPGI
FLYDMVDVWG ATKKLKGQPV WQSNARLWEV SLDN