Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_6559 |
Symbol | |
ID | 6983629 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011371 |
Strand | - |
Start bp | 233025 |
End bp | 234635 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643399555 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002284311 |
Protein GI | 209552396 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.67626 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.403463 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCATTC CAGAAATCTC GCGGCGTACG CTGATGAAGG GCACCGCCCT TCTGCTTGCC TCGACGGCGC TCACCCGACA GGTGCTGGCG CAGGCCGCGC CCGCCGGCGG CCGGCTGATT GTTGCGGCCG ATTCCGAACC GAAGAATCTC AATCCTGCCA TCGTCGCCTC GAACGGCGTC TTCTTCATCG CCAGCAAGGT GATCGAACCA TTGGCCGAAG CCTCGTTCGA GGGCAAGGAC GGGCTTGCGC CGCGGCTTGC CACCTCCTGG GAGGGTTCGG CGGACGGTCT CTCCGTTACC TTCAAGCTGC GCGACGGCGT TACTTGGCAC GACGGCAAGC CGTTCACCTC ACTCGATGTC GCCTTCTCCG CCCTCAACAT CTGGAAGCCG CTGCAGAATC TCGGCCGCCT GGTCTTTGCC AATCTCGAAG CCGTCGATAC TCCTGACGAT TACACCGCCG TGTTCCGCTT CTCCAAGCCG ACGCCGTTCC AGCTCATCCG CAACGCCCTG CCCGTCGTCA CCAGCGTCGT TGCCAAGCAC ATCTTCGACG GCAGCGATAT CGCCGCCAAT CCGGCCAACA ACACGCTCGT CGGCACCGGT CCGTTCAAAT TCGCCGAATA CAAGCCCGGC GAATATTACC GCCTGACGCG CAACGAGAAC TACTGGGACA ACGATCAGCC GAAGCTCGAC GAGATCGTCT TCCGGGTGCT GCCCGACCGC GCGTCGGCCG GGGCGGCGCT CGAAGCCGAC GAAATCCAGC TGGCTGCCTT CTCGGCGGTG CCGCTGGCCG ATCTCGACCG CATCTCCAAA GTCGAGGGCA TCAAGGTGAT CTCGAAGGGG TATGAGGCCT TGACCTACCA GCTCGTCGTC GAGATCAATC ACCGCCGCAA GGAACTCGCC GACCTCAGGG TCCGTCAGGC GATCGCGCAG GCGATCGACA AGAAATTCGT GGTCGACACG ATCTTCCTCG GTTATGCCGC CGCCGCCACA GGCCCCGTGC CGAAGAATGC GCCGGAATTC TATACCTCCG ATGTCGCGAG TTATGATTTC AATCCTGCCG CCGCCAACGA TATTCTCGAC AAGGCCGGGT ACAAACAGGG AGCGGACGGC AACCGTTTCA AGCTGAAGCT TCGCCCCGCG CCCTATTTCA ACGAGACCCG CCAATTCGGC GATTACCTTC GCCAGGCGCT TGCGGTGATC GGCATCGATG CGGAGATCGT CAACGCCGAC GCGGCCGCCC ATCAGAAGGC TGTTTATACC GACCACGATT TCGACCTCGC CATCGGCCCG CCGGTCTTCC GCGGCGATCC GGCGATCTCC ACCACCATTC TCGTCCAATC CGGCACGCCT GCTGGTGTGC CCTTCTCCAA CCAGGGCGGC TACGTCAATC CGGAGCTCGA CAAGATCATC AAGCAGGCCT CCGAGACCGT CGACACGGCG GCGCGCACCG ATCTCTACCG CAAGTTCCAG CAGCTCGTCG CCGCCGACTT GCCGCTGATC AACGTGGCGG AATGGGGCTT CATCACCGTT GCCCGCGACA CCGTGCTCAA CGTCTCCGAC AATCCGCGCT GGGCCGTCTC GAACTGGGGC GATACCGCGC TGCAGTCGTG A
|
Protein sequence | MTIPEISRRT LMKGTALLLA STALTRQVLA QAAPAGGRLI VAADSEPKNL NPAIVASNGV FFIASKVIEP LAEASFEGKD GLAPRLATSW EGSADGLSVT FKLRDGVTWH DGKPFTSLDV AFSALNIWKP LQNLGRLVFA NLEAVDTPDD YTAVFRFSKP TPFQLIRNAL PVVTSVVAKH IFDGSDIAAN PANNTLVGTG PFKFAEYKPG EYYRLTRNEN YWDNDQPKLD EIVFRVLPDR ASAGAALEAD EIQLAAFSAV PLADLDRISK VEGIKVISKG YEALTYQLVV EINHRRKELA DLRVRQAIAQ AIDKKFVVDT IFLGYAAAAT GPVPKNAPEF YTSDVASYDF NPAAANDILD KAGYKQGADG NRFKLKLRPA PYFNETRQFG DYLRQALAVI GIDAEIVNAD AAAHQKAVYT DHDFDLAIGP PVFRGDPAIS TTILVQSGTP AGVPFSNQGG YVNPELDKII KQASETVDTA ARTDLYRKFQ QLVAADLPLI NVAEWGFITV ARDTVLNVSD NPRWAVSNWG DTALQS
|
| |