Gene Rleg2_5891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5891 
Symbol 
ID6977280 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011366 
Strand
Start bp301634 
End bp303538 
Gene Length1905 bp 
Protein Length634 aa 
Translation table11 
GC content61% 
IMG OID643393346 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002278164 
Protein GI209546274 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACAGT TGAAGAGGCT CGCATTTGGC GTCGCGCTGG CCGCACTCGG CCTGACGGCG 
GCGGCCAAAG CCGATGACTA TACCTCATTG CCGCGCAAGG AGACGCTCAT AGTCGAAAAT
CCGGAAGGGA CGATCAAAAA TCCCGGCTGG TTCAACATCT GGGTCAATGC CGGCGCCGGT
GTCTCCACCG GTCTGCAGCA GCTGACCATG GATACGCTCT GGTATATCGA CCCCGAACAA
GGGCTCGGCG GCGCGACCTG GGATAATTCG CTGGCCGCCG ACAAGCCGCA ATATAATGCC
GACTTTACCG AAATGACCGT GAAACTGCGC AAGGGGCTCT TCTGGAGCGA CGGCGTCGAG
TTCACGGCCG ACGACGTGGT CTATACCGTC AAGACGCAGA TGGATCATCC CGGCATGGTC
TGGAGTGCAG CCTTCTCGGT GCAGGTGGCA AGCGTCGAGG CGACCGATCC TCAGACCGTG
GTGTTCAAGC TGAAGAAGCC CAATTCGCGC TTCCACGCCC TTTTCACCGT TCGCTGGAAC
GGCGCATGGA TCATGCCCAA GCATGTGTTC GAGAAGGTCG CCGATCCGCT TCGCTACGAT
TTCGCCAATC CGGTTTCGCT CGGCGCCTAC AAGCTCAAGG CCTACGATCC TCAGGGCAAG
TGGTACACCT GGGAGAAACG CGACGACTGG CAGAAGACAT CGCTTGCCCG CTTCGGCGAG
CCGGCCCCGA AATATGTAAC TTACGTCGAC CCCGGCCCGC CGGATAAACG CACCATCGCC
CAGCTCGAGC ACAATCTCGA TATCATCCAC GACAATACGC CTGAGGGCAT GTTCACCCTC
AAGGAGAAAT CCAAGTCGGT CGAGACCTGG TTCCCGGGCT TCCCCTTCGC CCATCCGGAT
CCGACGCTGC CGGCTGTCAT TTTCAACACC CAGGATCCGA CCTTCAACAA TCCTGACGTG
CGCTGGGCGC TGGCCCTGCT GATCGACATC AAGGCCGTCG ACATGGCGAG CTATCGCGGC
ACCGCCACGC TCTCGGCACT CGGTGTGCCG CCGACGGCGA TCGCCATGAA AGACTATCAG
GCGCCGATGC AGGATTGGCT GAAGGATTTC GAGATCGACA CCGGCAAGCA GAAGATCAAG
CCTTATGACC CGACGATCGG GCAGCAGGTC GCCGATCTCC TGCGCAAGCA GCCGAAGTTC
AAGGATCAGA TCCCGACCGA TCCTAAGGCG ATCAGCACGG CCTTCGGCTA TGGCTGGTGG
AAGCCCAACC CGCAGGCAGC CGCCGAGCTG CTTCAAAAGG CGGGCTTCAA GAAGAGCGGC
GGCAAATGGA TGACCCCTGA TGGCCAGCCG TTCAGGATCC GGATGACCGT CGAGGGCGAC
ACCCGCTCCG TCTTCACCCG GGCAGGCACG CTGATCGCCC AGCAATGGGC CGCCTTCGGC
ATCGATGCCA AAGCCGTACC GACGACCAAC CTGTGGCAGG TTGCACTGCA GCCTGGCGAT
TTCCAGGTGG CGATTGCCTG GAGCGTCGAA ACCTGGGGCG GCGATCCCGA CCTGTCCTTC
TTCCTCGACA GCTGGCACTC GCAGTTCGTG GCCAAGAAGG GTGAGAACCA GCCGCCGCGC
AACTGGCAGC GCTGGAGCAA TCCGGAGCTC GACAAGATCA TCGAGACCAT CCGCGGCATC
AGCGCCGACG ATCCGAAGGG CATCGAACTC GGCAAGGATT ATCTGAAGCT GGTTGCCCGC
GAAATGCCGA CGATCCCGCT GATGTCCTAT AACGTCTTCA CCTCGATGGA TACGACCTAT
TGGACCGGTT ATCCAACGAT CAAGGACCCC TATACCGACC CGGTGCCGAA CTGGGCCAAC
TCCAGGCTGA TGATGGTCAA GCTGAAGCCG GCCCAACCGA AATAA
 
Protein sequence
MQQLKRLAFG VALAALGLTA AAKADDYTSL PRKETLIVEN PEGTIKNPGW FNIWVNAGAG 
VSTGLQQLTM DTLWYIDPEQ GLGGATWDNS LAADKPQYNA DFTEMTVKLR KGLFWSDGVE
FTADDVVYTV KTQMDHPGMV WSAAFSVQVA SVEATDPQTV VFKLKKPNSR FHALFTVRWN
GAWIMPKHVF EKVADPLRYD FANPVSLGAY KLKAYDPQGK WYTWEKRDDW QKTSLARFGE
PAPKYVTYVD PGPPDKRTIA QLEHNLDIIH DNTPEGMFTL KEKSKSVETW FPGFPFAHPD
PTLPAVIFNT QDPTFNNPDV RWALALLIDI KAVDMASYRG TATLSALGVP PTAIAMKDYQ
APMQDWLKDF EIDTGKQKIK PYDPTIGQQV ADLLRKQPKF KDQIPTDPKA ISTAFGYGWW
KPNPQAAAEL LQKAGFKKSG GKWMTPDGQP FRIRMTVEGD TRSVFTRAGT LIAQQWAAFG
IDAKAVPTTN LWQVALQPGD FQVAIAWSVE TWGGDPDLSF FLDSWHSQFV AKKGENQPPR
NWQRWSNPEL DKIIETIRGI SADDPKGIEL GKDYLKLVAR EMPTIPLMSY NVFTSMDTTY
WTGYPTIKDP YTDPVPNWAN SRLMMVKLKP AQPK