Gene Rleg_4491 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4491 
Symbol 
ID8015253 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4622177 
End bp4623706 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content61% 
IMG OID644827067 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002978268 
Protein GI241207172 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAAGT TTTCGCTTGC GCCTTCAGCG CGTTTCGCCC GGCGGCTTTC GCTCGGTGCC 
GCACTTTCGG CCGGCCTGGT GATGACGGCG ATGACGCCGG CCGAGGCGGC AAAGACTACC
CTCAATCTCG GCATGAGCGT CGAGCCGGCC GGTCTCGACC CGACGATCGC AGCACCGGTC
GCGATCGGCC AGGTGACCTG GCAGAACGTG TTCGAAGGGC TGGTGACGAT CGACCAGTCC
GGCAAGATCC AGCCGCAGCT GGCAAAAAAC TGGGAGATCT CTCCCGATGG CCTGACCTAT
ACGTTCAAGC TGCAGACCGG CGTCAAATTC CATGACGGCG AGGCCTTCGA TGCCACTGCC
GCCAAGTTTT CGCTCGACCG TGCCCGTGGC GCCGATTCGG TCAATCCGCA GAAGCGCTTC
TTCGCTTCGA TCGCCTCGAT CGATACGCCG GATGCCGAAA CGCTGGTGCT GCATCTCTCT
GCGCCGACCG GCAGCCTGAT CTACTGGCTC GGCTGGCCAG CCTCTGTGAT GGTCGCACCG
AAGACGGCTG CCGACGACAA GACGACGCCA GTGGGGACCG GCCCCTTCAG TTTCGCCAGC
TGGGCGAAGG GCGACAAGGT CGAACTCACC AGGAATGCCG ATTATTGGAA CAAGGATGCG
GCCGCCAAGC TCGACAAGGT GACCTTCCGC TTCATCGCCG ATCCGCAGGC GCAGGCGGCA
GCGCTGAAAT CCGGCGATCT CGATGCCTTT CCGGAATTTG CCGCGCCTGA GCTGATGAGT
TCTTTCGACG GCGATGCGAG GCTCGTCACC AAGATCGGCA ATACCGAGCT CAAGGTCGTT
GCCGGCATGA ACACTGCCAA GAAGCCGTTC GACGACAAAC GCGTCCGCCA AGCGCTGATG
ATGGCGATCG ACCGCAAGAC GGTGATCGAC GGCGCATGGT CGGGCCTCGG CACGCCGATC
GGCAGCCACT ACACGCCGAA CGATCCGGGC TATCAGGACA TGACAGGCGT GCTGCCTTAC
GACGTCGAGA AGGCGAAGGC GCTGCTTGCC GAAGCAGGCT ACCCCAACGG TTTCACCTTC
ACGATCAAAT CGCCGCAGAT GGCTTATGCG CCGCGCAGCG CCCAGGTGAT GCAGGCGATG
TTTGCCGAGA TCGGCGTGAC GATGAATATC GAGCCGACGG AATTTCCGGC GAAATGGGTC
CAGGACATCA TGAAGGACCG CAACTTCGAC ATGACGATCG TCGCCCATGC CGAACCGCTC
GACATCGACA TCTATGCGCG CGATCCCTAT TATTTCAATT ATAAGAACCC CGCTTTCAAC
GCGCTGATGA AGAAGGTTCA GGAGACGACC GATCCCGCCG CGCAGAATGC GATCTATGGC
GAAGCGCAGA AGATCCTCGC CGAGGACGTG CCGGCGCTCT ACCTCTTCGT CATGCCGAAA
CTCGGCGTCT GGGACAAGAA GCTGAAGGGC CTGTGGGAGA ACGAGCCTAT CCCTTCCAAC
GTGCTGACTG GTGTTTCCTG GGACGAGTGA
 
Protein sequence
MIKFSLAPSA RFARRLSLGA ALSAGLVMTA MTPAEAAKTT LNLGMSVEPA GLDPTIAAPV 
AIGQVTWQNV FEGLVTIDQS GKIQPQLAKN WEISPDGLTY TFKLQTGVKF HDGEAFDATA
AKFSLDRARG ADSVNPQKRF FASIASIDTP DAETLVLHLS APTGSLIYWL GWPASVMVAP
KTAADDKTTP VGTGPFSFAS WAKGDKVELT RNADYWNKDA AAKLDKVTFR FIADPQAQAA
ALKSGDLDAF PEFAAPELMS SFDGDARLVT KIGNTELKVV AGMNTAKKPF DDKRVRQALM
MAIDRKTVID GAWSGLGTPI GSHYTPNDPG YQDMTGVLPY DVEKAKALLA EAGYPNGFTF
TIKSPQMAYA PRSAQVMQAM FAEIGVTMNI EPTEFPAKWV QDIMKDRNFD MTIVAHAEPL
DIDIYARDPY YFNYKNPAFN ALMKKVQETT DPAAQNAIYG EAQKILAEDV PALYLFVMPK
LGVWDKKLKG LWENEPIPSN VLTGVSWDE