Gene Rleg2_6448 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_6448 
Symbol 
ID6983519 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011371 
Strand
Start bp110113 
End bp111750 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content62% 
IMG OID643399445 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002284201 
Protein GI209552286 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00292129 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.318047 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTTTG ATCCATCTGA TAAAACCAAT CTATCGCGCC GCAACGCGCT GAAGCTCGGT 
CTGGCGGCCG GCGTCGGCCT CACCGTGTTC GGGATGAATG CCCGCATCGT GATGGCCGAT
GAAGGCCAGG TCCTGAAGGT CGCACATCCG GCCTTCGACC AGGACTGGTC GCCGCTGCGC
GGCGGCGGCA GGACGTTCCG CTGGAATTCG ATCTGGTGGG CTTCGCCGAT GTATTTCGAC
AGCCAGGGCA ATATCAAGCC TTACGTCTTC GCCAGCTGGG AATCGGCCGA CAACACGGTG
TGGACCTTCA AGATCGACCC GAAGGCCGTC TTCTCCGACG GCAGCAAGAT CACCTCGGCC
GACGTCAAGG GATCGTGGGA AGTCGCCTCG ATGCCGAACA CCAAGAGCCA GCGCGCCGAC
CAGGTGCTGA GCAAGGTCAA GGGTTACGCC GAAATCGCCG CCGGTTCCGG CAAGGAGCTG
ACCGGTGTGG CGACTCCTGA TGAGGGAACA GTCGTGGTGA CGCTCGCCGC TGCCGATCCG
ATCTTCTTCA TGCGTCTCGC AAACCACATC GCGCCGATCA CCAAAGCGTC GCAATCGCGC
GGCAGCGACG GCGAGGAAAT CATCGACTGG TATAAGCCCG AAAACAAGCC GGTCTTCTCC
GGCCCCTTCA AGCTGACGAG CATCGATATC GATGCCGGCA AGATCACATT CGAGCCGAAT
GAAAACTTCT TTGGGTCGAA GCCGAAGCTT GCCCGCATCG ACATCACCTC GATCGAGGAC
AATGTGACGG CGACCTCGCT GATCAAGTCC GGCGAGTTCA ACGCCCATAC CGAACTCGTT
ACCTCGACGA TCATCCAGGA TCTCGGCCCA GAATTCTCGG CCGGCCCGCT GATCCCGACC
AGCCAGCACT TCTGGTTCAA CATCTCCCGC GCGCCGATGG ACGATCCGAA GGTCCGCCAG
GCGCTGATCA TGGCGGTCGA TCGCGACGGC CTGTTCAAGG CGTCCTATCC CGATGGGCCG
CACAAGAAGG CCGATCAGAT CCTCAATTCG GTTCCCGGCG CCGACAATTC CGGCTTCGAG
TCCTTTCCCT ATGATCCGGC AGCCGCCAAG AAGCTGCTTG CCGAATCGAG CTATGGCGGG
CCCGAGCGCC TGCCGAAGAT CCTGTTCGTC GGCATTTCGG CGCCGGCCAT TCAGGCCGCC
GCCCAGTTCA TCGCCGAGCA GTGGCGCCAG AATCTCGGCA TCACGGCCGT CGACATGAAA
CCGCAACAGG ACGCCTATGC CGGCCCGGAC CAGAACTCGG TGCAAATCTT CCGCGACGAC
GTCGGCACCC GTGTCCCCGA CGCCGTTTCG TATCTGGCGG GCAGCATCGC CTCGACCTCG
TCGAACGCGC AGAACAAGCT CGGCGGATAC AAGAACGACA AGGTCGACAG CGCCCTTGCC
GAAGCGGCGA CCAAGGCTGC GGACGATCCG CAGCGCATCT CTCTCGCCCA GGAGGCCCAG
AAGGCGTTCC GCGACGATTG GGCCTTCATC CCGTGGTATT CTCAGGCGAT GTCGCGCTGG
GCCACCAAGG AGGTCAAGGG CATGGAGAAG AACCTCGACT GGCAGATAGC CGAACCCTGG
AACATTTCGA TCGGTTGA
 
Protein sequence
MSFDPSDKTN LSRRNALKLG LAAGVGLTVF GMNARIVMAD EGQVLKVAHP AFDQDWSPLR 
GGGRTFRWNS IWWASPMYFD SQGNIKPYVF ASWESADNTV WTFKIDPKAV FSDGSKITSA
DVKGSWEVAS MPNTKSQRAD QVLSKVKGYA EIAAGSGKEL TGVATPDEGT VVVTLAAADP
IFFMRLANHI APITKASQSR GSDGEEIIDW YKPENKPVFS GPFKLTSIDI DAGKITFEPN
ENFFGSKPKL ARIDITSIED NVTATSLIKS GEFNAHTELV TSTIIQDLGP EFSAGPLIPT
SQHFWFNISR APMDDPKVRQ ALIMAVDRDG LFKASYPDGP HKKADQILNS VPGADNSGFE
SFPYDPAAAK KLLAESSYGG PERLPKILFV GISAPAIQAA AQFIAEQWRQ NLGITAVDMK
PQQDAYAGPD QNSVQIFRDD VGTRVPDAVS YLAGSIASTS SNAQNKLGGY KNDKVDSALA
EAATKAADDP QRISLAQEAQ KAFRDDWAFI PWYSQAMSRW ATKEVKGMEK NLDWQIAEPW
NISIG