Gene Rleg_6670 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6670 
Symbol 
ID8022580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012858 
Strand
Start bp100181 
End bp101707 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content61% 
IMG OID644833537 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002984671 
Protein GI241666587 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000010094 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0327722 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACAC TGGTCGCATT CCTTCTTGGC ACTGCGCTCG TCGCCCTGCC TTCGACGCTC 
CTTGCCCAGG AAAAGGGCGG CGTCATCAAT GTCGCGACGA TCGGCGAACC GCCGACGCTC
GATCCGATGT CGTCGACGGC CGATCTGGTC GGCATCGTCA CGCAGCACAT TTTCGAAACG
CTCTACACCT TCGACAAGAG CTGGAACGTC ACGCCGCTTC TGGCCGAAAG CCTGCCGGAG
ATCAGCGCCG ACGGCAAAAC CTATACGATC AAGCTCAGGA CCGGCATCAA GTTCCACGAC
AATACCGATA TGACCTCGGA AGATGTCGTC GCCTCGCTTG GCCGCTGGAT GAAGATCGCT
TCGCGCGGCA AGCAGGTGGC CGGCTTCATC GATAAGGTCA CCGCCGTCGA TCCCTCAACC
GTCACGATCA CGCTGAAGCA GCCCTATGCG CCGCTGACCT CGCTGCTCGC CTTCAACAAT
TCGGCCGCCA TCATCATCCC ATCCGAGAAG CAGGACGAGC CGATGAAGGA CTTCATCGGC
ACCGGTCCCT ACATGCTGAA GGAGCGCAAG GCCGACCAGT ATATCCAGCT TGTCCGCTTC
GATGGCTACA AGTCACGTGA AGGCGACAGC GATGGCTATG GCGGCGCCCG CCACCAGTAT
CTCGATGAGA TCCGCTTCGT GCCGGTGCCG GATCCGAACA CCCGCGTCGA GGCTGCCGTT
TCCGGCCAGT ACGACTACGT CGACTCGATC CCGGTCGAAT CCTACGACAA GCTGAAGGCC
TCCACCGCCT CGCAGCCGAT CATCCTGAAG CCCTTCGGTT ATCCCGTCTT CGTCTTCAAT
ACGAAGGAAG GTATTGCTGG GAATGTCGAG GTTCGCAAGG CGATCCGCCA GGCGCTCAGC
ATGGAAGACA TGCTGGCGGC GGCTTTCGGC AGCACGGATT TCTACGCGCT CGACGGCGCC
ATCTATCCCA AGACCTTTGC CTGGTCGACA GATGCTGGCG TCGAGGGCGC CTATAACGTC
GCCGATCCGG AAGGGGCGGC GGCTGCCGCC AAGAAGGCCG GCTACAACGG CGAACCGATC
CGCATCCTGA CCAGCCGCCA GTATGAGTTC CACTACAAGA TGGCGCAGGT CGCCGCCGAA
TATCTGAAGC TTGCCGGCTT CACCGTCGAT ATGCAGGTTG TGGACTGGGC GACGCTGACG
CAGCGCCGTA CCGATCCAAA GCTCTGGGAT ATCTACATCA CCCATAGCCC CTTCCTGCCG
GAGCCTGCCC TGATCGGCTC GCTCTCGACC AGCTCGCCCG GCTGGTGGGA TACCCCGGCC
CGCAAGGCCG CCGTCGATGC CTTCACCTCG GAAGTCGATC CGAAGAAGCG CGTGGCGCTC
TGGGCCGATG TCCAGAAGGC GATCTATGCC GACGCCCCCT TCATGAAGAT CGGCGACTTC
AACGCCGTTT CGGCAGAATC GACCAAGCTT GAGGGCGTCG ATCCGGCTCC GTGGCCGTAT
TTCTGGAATG CTTCGATCAA GAAGTAA
 
Protein sequence
MKTLVAFLLG TALVALPSTL LAQEKGGVIN VATIGEPPTL DPMSSTADLV GIVTQHIFET 
LYTFDKSWNV TPLLAESLPE ISADGKTYTI KLRTGIKFHD NTDMTSEDVV ASLGRWMKIA
SRGKQVAGFI DKVTAVDPST VTITLKQPYA PLTSLLAFNN SAAIIIPSEK QDEPMKDFIG
TGPYMLKERK ADQYIQLVRF DGYKSREGDS DGYGGARHQY LDEIRFVPVP DPNTRVEAAV
SGQYDYVDSI PVESYDKLKA STASQPIILK PFGYPVFVFN TKEGIAGNVE VRKAIRQALS
MEDMLAAAFG STDFYALDGA IYPKTFAWST DAGVEGAYNV ADPEGAAAAA KKAGYNGEPI
RILTSRQYEF HYKMAQVAAE YLKLAGFTVD MQVVDWATLT QRRTDPKLWD IYITHSPFLP
EPALIGSLST SSPGWWDTPA RKAAVDAFTS EVDPKKRVAL WADVQKAIYA DAPFMKIGDF
NAVSAESTKL EGVDPAPWPY FWNASIKK