Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_6144 |
Symbol | |
ID | 8016101 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012852 |
Strand | - |
Start bp | 187980 |
End bp | 189581 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644827450 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002978650 |
Protein GI | 241258766 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.5133 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.00348679 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCGTAT CATCTATCTC GCGGCGCACG CTGATGAAGG GCACTGCCCT GCTGCTTGCT TCGACGGCGC TCGCGCGACA GGCGCTTGCG CAAGCTGCTC CCGCCGGCGG CCGGCTGATC GTTGCGGCCG ATTCCGAGCC GAAGAACCTC AATCCTGCGA TCGTCGCCTC GAACGGCGTC TTCTTCGTCG CAAGCAAGGT GATCGAGCCG CTGGCCGAAG CGTCGTTCGA CGGCAAGGAC GGGCTTGCGC CGCGCCTTGC CACCTCCTGG GAGGGCTCGG ATGACGGCCT CTCCGTCACC TTCAAGCTGC GCGACGGCGT CACCTGGCAC GACGGCAAGC CGTTCACCTC AGTCGATGTC GCCTTCTCCG CGCTCAATAT CTGGAAGCCG CTGCAGAATC TCGGCCGCCT GGTCTTCGCC AATCTCGAAG CCGTCGACAC GCCCGACGAT TACACCGCCA TCTTCCGCTT CTCCAAGCCA ACGCCGTTCC AATTGATCCG CAACGCGCTG CCTGTCGTCA CCAGCGTCGT CGCCAAGCAC ATCTTCGACG GCACCGACAT CGCCACCAAC AACACGCTGA TCGGCACCGG CCCGTTCAAG TTCGCCGAAC ACAAGCCTGG CGAATATTAC CGGCTGGCGC GCAACGAGAA TTATTGGGAC AAGGACCAGC CGAAACTCGA TGAGATCGTC TTCCGCGTGC TGCCCGATCG CGAAGCGGCG GGTTCGGCGC TCGAAGCCGA GGAAATCCAG CTTGCCGCCT TCTCGGCGGT GCCGCTGGCC GATCTCGACC GCATCTCGAA GGTCGCCGGC ATCAAGGTGA TTTCGAAGGG CTATGAGGCT TTGACCTATC AGCTCGTCGT CGAGATCAAT CACCGCCGCA AGGAGCTGGC CGACCTCAGG GTCCGTCAGG CGATCGCGCA GGCGATCGAC AAGAAATTCG TGGTCGACAC GATCTTCCTG GGTTACGCCG CCGCCGCGAC CGGCCCGGTG CCGAAGAATG CGCTGCAGTT TTATACGCCT GACGTCGCGG CCTATGATTT CAATCCGGCT GCGGCCAACG ACATTCTCGA CAAGGCCGGA TATAAGCAGG GCCCTGATGG CAACCGCTTT ACGCTGAAGC TCCGCCCCGC GCCCTATTTC AACGAGACCC GCCAGTTCGG CGATTATCTT CGCCAGGCGC TGGCCGTGAT CGGCATCAAT GCCGAGATCG TCAATGCCGA TGCGGCCGCA CACCAGAAGG CTGTTTATAC CGACCACGAC TTCGACCTCG CCGTCGGCCC ACCGGTTTTC CGCGGCGATC CGGCGATCTC CACCACCATT CTCGTCCAGT CCGGCACCCC AGCTGGCGTG CCCTTTTCCA ACCAGGGCGG CTACGTCAAT CCGGAGCTCG ACAAGATCAT CAAGCAGGCC TCCGAAACCG TCGACACGGC GGCGCGCACC GATCTCTACC GCAAGTTCCA GCAGCTCGTC GTCGCTGACC TGCCGCTGAT CAACGTCGCG GAATGGGGCT TCATAACCGT TGCGCGCGAC ACCGTGCTTA ACGTCTCGAA CAATCCGCGC TGGGCCGTCT CGAACTGGGG CGATACCGCG CTGCAATCGT GA
|
Protein sequence | MTVSSISRRT LMKGTALLLA STALARQALA QAAPAGGRLI VAADSEPKNL NPAIVASNGV FFVASKVIEP LAEASFDGKD GLAPRLATSW EGSDDGLSVT FKLRDGVTWH DGKPFTSVDV AFSALNIWKP LQNLGRLVFA NLEAVDTPDD YTAIFRFSKP TPFQLIRNAL PVVTSVVAKH IFDGTDIATN NTLIGTGPFK FAEHKPGEYY RLARNENYWD KDQPKLDEIV FRVLPDREAA GSALEAEEIQ LAAFSAVPLA DLDRISKVAG IKVISKGYEA LTYQLVVEIN HRRKELADLR VRQAIAQAID KKFVVDTIFL GYAAAATGPV PKNALQFYTP DVAAYDFNPA AANDILDKAG YKQGPDGNRF TLKLRPAPYF NETRQFGDYL RQALAVIGIN AEIVNADAAA HQKAVYTDHD FDLAVGPPVF RGDPAISTTI LVQSGTPAGV PFSNQGGYVN PELDKIIKQA SETVDTAART DLYRKFQQLV VADLPLINVA EWGFITVARD TVLNVSNNPR WAVSNWGDTA LQS
|
| |