Gene Rleg_4682 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4682 
Symbol 
ID8007158 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp46020 
End bp47618 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content60% 
IMG OID644821616 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002972876 
Protein GI241113041 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGAC TGTCTAGATT ATCCGTGATT GCGCTTGGCG CCCTGCTGTC GACGGCTGCC 
GTTCCAGCTC TTGTCGTTTC GGGCGTTGCA ATAGAGGCCC AGGCAGCCAC GCTATCGGGC
GGCTTCGATG TCGGTCCCGG AGGTTTCCAG GGCAACTTCA ATCCGCTCGC CGCGACCGCC
GGCTTCACCT GGCTCAGCAT CTATTACGAA CCGCTGATCA CTTATGACGA GAAGCTGCAG
AAGGTCGTCG GCGCGCTGGC AAGCTCCTAC GAGGTCAGCT CCGACCAGAT GACCTACACG
TTCAAGCTGG TGGACGCCAA ATGGCATGAC GGCAAACCGT TCACTGCCAA GGACGCAAAG
TTCACCATGG CCCTTGCGAT GGACGCGAAA ACCGGCTCGG TGCTCGCCGC CCGGCTGAAG
GGCATATCGT CCGTCGAGAC GCCGGATGAG CACACTGTTG TCATCAAGCT CAGCGCCCCC
AGCAGCAGTT TTCCCGACAC GATGACCAAA GTGATGATGC TGCCCGAGCA TGCGCTCTCC
TCGATCCCGG CCGACCAGCT GACGAAGAAC ACCTGGTGGT CCACAGCTCC GATCGGCACC
GGTCCGTTCA AATTCACCAA ATACGTCTCG GATCAATATG TCGAACTTGC CGCAAACACC
GATTATCGCG GTGGCAAACC CGCACTGGAA CGCGTCATCA ATCGCTATTT CGCCAACCCG
GCCGCAGCAA TCGCTGCGCT GAGATCCGGC GAAATCCAGT TCACCTATGT CGATTCCAAC
GACGTGCCGA CCTTCAAGGA CAACAAGGAC TTCCAGGTCA TAGAAGGCAA CTCTTTCGTC
GTCAACTACC TGGGCTTCAA CCACGAATCC CCGCTCTGGA AGGACGTGCG CGTCCGCCAG
GCGGTGATGT ACGCGATCAA TCGCGATGCC ATCATCCAGA GCCTTTATGG CGGTGCGGCC
AAGCCTGCCA ACTGCGCCTA TGTCGCCGAA CAGCTGATAC CCCCTGATAT CGACAGCTAT
GCCTATGATC CCGAGAAGGC CAAGCAGTTG TTGACGGAAG CCGGCTGGGA CCAGATCAAC
GGCGGCAAGC AGATCACCCT TCTGACCTAT TACACCACGC CGCTGGCGAC CAACGTGCTT
GCCGCAGTCC AGGCGATGCT TGCCCAGGTC GGCATCAACA TCGTCCCGCG CGCCGTCGAT
GCGCCGACCT ATAACAGCAT CGTGCTCAAT GCGACGCCGG ATATCGCCCA GTTCCAGTTG
GTTTATGCCG GGCTGCAGAA CGGGCCGGAT GCCGGAAGCA TCAATGTCGG CCTCAACGAG
AAGCAGATCC CTCCGGCCGG GCCGAATGTC GCCAGGGTTC GCATGCCTGA CCTCACCAAG
GCGCTCGATA GCGCCTTGGC CGAGCCTGAT AGCACCAAGC GGGATGCCGC CTACCAGGAC
GTCTGCAAGG TGATGAACAC CAACCTGCCC TGGGCGACGC TCTGGGTGGC AAACCGCTAT
GGCATCGTCT CGACAAAGGT GAAGGATTTC GTCTGGACGC CAGCGCCGGG CGGCGGCCCC
TACCAGGCCA ATCCGCAGAA ATGGTCGATC GCCGAATAG
 
Protein sequence
MKRLSRLSVI ALGALLSTAA VPALVVSGVA IEAQAATLSG GFDVGPGGFQ GNFNPLAATA 
GFTWLSIYYE PLITYDEKLQ KVVGALASSY EVSSDQMTYT FKLVDAKWHD GKPFTAKDAK
FTMALAMDAK TGSVLAARLK GISSVETPDE HTVVIKLSAP SSSFPDTMTK VMMLPEHALS
SIPADQLTKN TWWSTAPIGT GPFKFTKYVS DQYVELAANT DYRGGKPALE RVINRYFANP
AAAIAALRSG EIQFTYVDSN DVPTFKDNKD FQVIEGNSFV VNYLGFNHES PLWKDVRVRQ
AVMYAINRDA IIQSLYGGAA KPANCAYVAE QLIPPDIDSY AYDPEKAKQL LTEAGWDQIN
GGKQITLLTY YTTPLATNVL AAVQAMLAQV GINIVPRAVD APTYNSIVLN ATPDIAQFQL
VYAGLQNGPD AGSINVGLNE KQIPPAGPNV ARVRMPDLTK ALDSALAEPD STKRDAAYQD
VCKVMNTNLP WATLWVANRY GIVSTKVKDF VWTPAPGGGP YQANPQKWSI AE