Gene Rleg_6144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6144 
Symbol 
ID8016101 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012852 
Strand
Start bp187980 
End bp189581 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content63% 
IMG OID644827450 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002978650 
Protein GI241258766 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.5133 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.00348679 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCGTAT CATCTATCTC GCGGCGCACG CTGATGAAGG GCACTGCCCT GCTGCTTGCT 
TCGACGGCGC TCGCGCGACA GGCGCTTGCG CAAGCTGCTC CCGCCGGCGG CCGGCTGATC
GTTGCGGCCG ATTCCGAGCC GAAGAACCTC AATCCTGCGA TCGTCGCCTC GAACGGCGTC
TTCTTCGTCG CAAGCAAGGT GATCGAGCCG CTGGCCGAAG CGTCGTTCGA CGGCAAGGAC
GGGCTTGCGC CGCGCCTTGC CACCTCCTGG GAGGGCTCGG ATGACGGCCT CTCCGTCACC
TTCAAGCTGC GCGACGGCGT CACCTGGCAC GACGGCAAGC CGTTCACCTC AGTCGATGTC
GCCTTCTCCG CGCTCAATAT CTGGAAGCCG CTGCAGAATC TCGGCCGCCT GGTCTTCGCC
AATCTCGAAG CCGTCGACAC GCCCGACGAT TACACCGCCA TCTTCCGCTT CTCCAAGCCA
ACGCCGTTCC AATTGATCCG CAACGCGCTG CCTGTCGTCA CCAGCGTCGT CGCCAAGCAC
ATCTTCGACG GCACCGACAT CGCCACCAAC AACACGCTGA TCGGCACCGG CCCGTTCAAG
TTCGCCGAAC ACAAGCCTGG CGAATATTAC CGGCTGGCGC GCAACGAGAA TTATTGGGAC
AAGGACCAGC CGAAACTCGA TGAGATCGTC TTCCGCGTGC TGCCCGATCG CGAAGCGGCG
GGTTCGGCGC TCGAAGCCGA GGAAATCCAG CTTGCCGCCT TCTCGGCGGT GCCGCTGGCC
GATCTCGACC GCATCTCGAA GGTCGCCGGC ATCAAGGTGA TTTCGAAGGG CTATGAGGCT
TTGACCTATC AGCTCGTCGT CGAGATCAAT CACCGCCGCA AGGAGCTGGC CGACCTCAGG
GTCCGTCAGG CGATCGCGCA GGCGATCGAC AAGAAATTCG TGGTCGACAC GATCTTCCTG
GGTTACGCCG CCGCCGCGAC CGGCCCGGTG CCGAAGAATG CGCTGCAGTT TTATACGCCT
GACGTCGCGG CCTATGATTT CAATCCGGCT GCGGCCAACG ACATTCTCGA CAAGGCCGGA
TATAAGCAGG GCCCTGATGG CAACCGCTTT ACGCTGAAGC TCCGCCCCGC GCCCTATTTC
AACGAGACCC GCCAGTTCGG CGATTATCTT CGCCAGGCGC TGGCCGTGAT CGGCATCAAT
GCCGAGATCG TCAATGCCGA TGCGGCCGCA CACCAGAAGG CTGTTTATAC CGACCACGAC
TTCGACCTCG CCGTCGGCCC ACCGGTTTTC CGCGGCGATC CGGCGATCTC CACCACCATT
CTCGTCCAGT CCGGCACCCC AGCTGGCGTG CCCTTTTCCA ACCAGGGCGG CTACGTCAAT
CCGGAGCTCG ACAAGATCAT CAAGCAGGCC TCCGAAACCG TCGACACGGC GGCGCGCACC
GATCTCTACC GCAAGTTCCA GCAGCTCGTC GTCGCTGACC TGCCGCTGAT CAACGTCGCG
GAATGGGGCT TCATAACCGT TGCGCGCGAC ACCGTGCTTA ACGTCTCGAA CAATCCGCGC
TGGGCCGTCT CGAACTGGGG CGATACCGCG CTGCAATCGT GA
 
Protein sequence
MTVSSISRRT LMKGTALLLA STALARQALA QAAPAGGRLI VAADSEPKNL NPAIVASNGV 
FFVASKVIEP LAEASFDGKD GLAPRLATSW EGSDDGLSVT FKLRDGVTWH DGKPFTSVDV
AFSALNIWKP LQNLGRLVFA NLEAVDTPDD YTAIFRFSKP TPFQLIRNAL PVVTSVVAKH
IFDGTDIATN NTLIGTGPFK FAEHKPGEYY RLARNENYWD KDQPKLDEIV FRVLPDREAA
GSALEAEEIQ LAAFSAVPLA DLDRISKVAG IKVISKGYEA LTYQLVVEIN HRRKELADLR
VRQAIAQAID KKFVVDTIFL GYAAAATGPV PKNALQFYTP DVAAYDFNPA AANDILDKAG
YKQGPDGNRF TLKLRPAPYF NETRQFGDYL RQALAVIGIN AEIVNADAAA HQKAVYTDHD
FDLAVGPPVF RGDPAISTTI LVQSGTPAGV PFSNQGGYVN PELDKIIKQA SETVDTAART
DLYRKFQQLV VADLPLINVA EWGFITVARD TVLNVSNNPR WAVSNWGDTA LQS