Gene Rleg2_6274 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_6274 
Symbol 
ID6983347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011370 
Strand
Start bp220856 
End bp222391 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content55% 
IMG OID643399283 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002284039 
Protein GI209552123 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.0245039 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGTACC TGAATCGCAG ACGTTTCATG CAACTCTCCG CTGCCGTGGC AGCGTCGGGA 
TTTGCGCCTG GATTTGCAAG GGCCGAAGGT AAGCGCGGCG GCCATCTCCG CGTGGGTTTG
GCCGGAGGCT CGTCACAGGA TACCCTTGAT CAGCTCACCT ACGTGTCCGA TGCGACTTGG
ATATTTTCCA GCAACGTAAG AAACAACCTC GTTGAAATTG ATGAACTGAA CCAACAGGTG
CCTGGTCTCG CGGAGCGCTG GGAGGTTTCA CCGGACGCGA CCCGGTTCTC CTTTTTTATA
CGAAAAGGCG TGACCTTCCA CAGCGGAAAA ACTCTCACCG CCGACGATGT CGTTGCCTCT
CTCAATATTC ATCGGGGAGA GACATCCCAG TCGCCGGCAA AGGAGGAAAT GCGCGATGTT
GTCGACATCA AGGCTGACGG ATCCCGCGTC GATGTCACGT TCTCTACCCC CAACATCGAT
TTTCTTAGTC TCGTCACCAC GTTCAATTTT GGTATCCTGC CCGTTGCCGA TGGCAAGATC
GACAGGCTGA CGAAGGACGG CACCGGCCCT TACATGCTTG AAAGCTTCGA ACCCGGTCAA
AGCATAATCC TGAAACGAAA TCCCAATTAC TGGAAACCGG AGGCGGGCTT TTTCGATACC
GCTGAAGTGA CGTTCATCGA AGATGATGCC GCTCGCATGA ACGCTATTAG GACAGGCTTG
GTCGACGTCG TGAACAAGGT CGACTTAAAG ACCGCATCGG TGCTGAAGCG CGTCAAAGGT
ATCCGGGTGG AGGATATCAA GACTGAGCAA TTCAACTCCT TCGCCATGAT GATCGACACC
GCCCCGTTCA ATGACAACAA TATCCGGCTG GCATTGAAAT ACGGGGTCAA CCGCGAGGAA
CTGGTCAAAA AAATTCTCCT AGGTTACGGC TCGATCGGCA ATGACCATCC GGTCGGCGTC
ACCAACAAGT TTTTCAATTC ACAAATACAG CAGACCGAGT TCGACGCCGA CAAGGCGAAA
TACTACTTGA AGCAAGCTGG CCTCACGAGG CTCGACGTGT CGCTCAGCGC CTCAGATGCA
GGCTTTCCCG GCGCCGTCGG ATCCTCCTCC CTTTACCAGT CGTCCGCTGC GGCGGCGGGC
ATCAACATCA ACGTCGTCCG GGAACCCAAT GACGGCTTCT ATGAGAATGT CTGGTTGAAG
AAGCCGTTTG CGACTGTCTT CTGGGGGAAG CTCGCATCGG TCGGGCTGCA GTTTTCGCAA
GCGTATCTGC CCGGAGCCAC GTGGAACGAA ACCCATTGCA ATCTACCGCA GGTCACGGAG
TTGATCCGCA CCGCCCGCGG AATCGTGGAC GAAACGAAGC GAGGCGAGAT CTATCACGAA
CTCCAGTCGG TCATTCACGA ACAAGGGGGA TCGATCATCC CGATGTTCAC CAATTTCGTC
TGGGCCGTGC GAGACAACGT GCAGCACGGA CCGAACTTGC AGAACGATCT GACACTGGAC
GGTCTGAAAT GCTTTCAGCG TTGGTGGTTC GCCTAG
 
Protein sequence
MQYLNRRRFM QLSAAVAASG FAPGFARAEG KRGGHLRVGL AGGSSQDTLD QLTYVSDATW 
IFSSNVRNNL VEIDELNQQV PGLAERWEVS PDATRFSFFI RKGVTFHSGK TLTADDVVAS
LNIHRGETSQ SPAKEEMRDV VDIKADGSRV DVTFSTPNID FLSLVTTFNF GILPVADGKI
DRLTKDGTGP YMLESFEPGQ SIILKRNPNY WKPEAGFFDT AEVTFIEDDA ARMNAIRTGL
VDVVNKVDLK TASVLKRVKG IRVEDIKTEQ FNSFAMMIDT APFNDNNIRL ALKYGVNREE
LVKKILLGYG SIGNDHPVGV TNKFFNSQIQ QTEFDADKAK YYLKQAGLTR LDVSLSASDA
GFPGAVGSSS LYQSSAAAAG ININVVREPN DGFYENVWLK KPFATVFWGK LASVGLQFSQ
AYLPGATWNE THCNLPQVTE LIRTARGIVD ETKRGEIYHE LQSVIHEQGG SIIPMFTNFV
WAVRDNVQHG PNLQNDLTLD GLKCFQRWWF A