Gene Rleg2_6559 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_6559 
Symbol 
ID6983629 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011371 
Strand
Start bp233025 
End bp234635 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content63% 
IMG OID643399555 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002284311 
Protein GI209552396 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.67626 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.403463 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATTC CAGAAATCTC GCGGCGTACG CTGATGAAGG GCACCGCCCT TCTGCTTGCC 
TCGACGGCGC TCACCCGACA GGTGCTGGCG CAGGCCGCGC CCGCCGGCGG CCGGCTGATT
GTTGCGGCCG ATTCCGAACC GAAGAATCTC AATCCTGCCA TCGTCGCCTC GAACGGCGTC
TTCTTCATCG CCAGCAAGGT GATCGAACCA TTGGCCGAAG CCTCGTTCGA GGGCAAGGAC
GGGCTTGCGC CGCGGCTTGC CACCTCCTGG GAGGGTTCGG CGGACGGTCT CTCCGTTACC
TTCAAGCTGC GCGACGGCGT TACTTGGCAC GACGGCAAGC CGTTCACCTC ACTCGATGTC
GCCTTCTCCG CCCTCAACAT CTGGAAGCCG CTGCAGAATC TCGGCCGCCT GGTCTTTGCC
AATCTCGAAG CCGTCGATAC TCCTGACGAT TACACCGCCG TGTTCCGCTT CTCCAAGCCG
ACGCCGTTCC AGCTCATCCG CAACGCCCTG CCCGTCGTCA CCAGCGTCGT TGCCAAGCAC
ATCTTCGACG GCAGCGATAT CGCCGCCAAT CCGGCCAACA ACACGCTCGT CGGCACCGGT
CCGTTCAAAT TCGCCGAATA CAAGCCCGGC GAATATTACC GCCTGACGCG CAACGAGAAC
TACTGGGACA ACGATCAGCC GAAGCTCGAC GAGATCGTCT TCCGGGTGCT GCCCGACCGC
GCGTCGGCCG GGGCGGCGCT CGAAGCCGAC GAAATCCAGC TGGCTGCCTT CTCGGCGGTG
CCGCTGGCCG ATCTCGACCG CATCTCCAAA GTCGAGGGCA TCAAGGTGAT CTCGAAGGGG
TATGAGGCCT TGACCTACCA GCTCGTCGTC GAGATCAATC ACCGCCGCAA GGAACTCGCC
GACCTCAGGG TCCGTCAGGC GATCGCGCAG GCGATCGACA AGAAATTCGT GGTCGACACG
ATCTTCCTCG GTTATGCCGC CGCCGCCACA GGCCCCGTGC CGAAGAATGC GCCGGAATTC
TATACCTCCG ATGTCGCGAG TTATGATTTC AATCCTGCCG CCGCCAACGA TATTCTCGAC
AAGGCCGGGT ACAAACAGGG AGCGGACGGC AACCGTTTCA AGCTGAAGCT TCGCCCCGCG
CCCTATTTCA ACGAGACCCG CCAATTCGGC GATTACCTTC GCCAGGCGCT TGCGGTGATC
GGCATCGATG CGGAGATCGT CAACGCCGAC GCGGCCGCCC ATCAGAAGGC TGTTTATACC
GACCACGATT TCGACCTCGC CATCGGCCCG CCGGTCTTCC GCGGCGATCC GGCGATCTCC
ACCACCATTC TCGTCCAATC CGGCACGCCT GCTGGTGTGC CCTTCTCCAA CCAGGGCGGC
TACGTCAATC CGGAGCTCGA CAAGATCATC AAGCAGGCCT CCGAGACCGT CGACACGGCG
GCGCGCACCG ATCTCTACCG CAAGTTCCAG CAGCTCGTCG CCGCCGACTT GCCGCTGATC
AACGTGGCGG AATGGGGCTT CATCACCGTT GCCCGCGACA CCGTGCTCAA CGTCTCCGAC
AATCCGCGCT GGGCCGTCTC GAACTGGGGC GATACCGCGC TGCAGTCGTG A
 
Protein sequence
MTIPEISRRT LMKGTALLLA STALTRQVLA QAAPAGGRLI VAADSEPKNL NPAIVASNGV 
FFIASKVIEP LAEASFEGKD GLAPRLATSW EGSADGLSVT FKLRDGVTWH DGKPFTSLDV
AFSALNIWKP LQNLGRLVFA NLEAVDTPDD YTAVFRFSKP TPFQLIRNAL PVVTSVVAKH
IFDGSDIAAN PANNTLVGTG PFKFAEYKPG EYYRLTRNEN YWDNDQPKLD EIVFRVLPDR
ASAGAALEAD EIQLAAFSAV PLADLDRISK VEGIKVISKG YEALTYQLVV EINHRRKELA
DLRVRQAIAQ AIDKKFVVDT IFLGYAAAAT GPVPKNAPEF YTSDVASYDF NPAAANDILD
KAGYKQGADG NRFKLKLRPA PYFNETRQFG DYLRQALAVI GIDAEIVNAD AAAHQKAVYT
DHDFDLAIGP PVFRGDPAIS TTILVQSGTP AGVPFSNQGG YVNPELDKII KQASETVDTA
ARTDLYRKFQ QLVAADLPLI NVAEWGFITV ARDTVLNVSD NPRWAVSNWG DTALQS