Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_6191 |
Symbol | |
ID | 8016204 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012852 |
Strand | - |
Start bp | 234806 |
End bp | 236320 |
Gene Length | 1515 bp |
Protein Length | 504 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 644827497 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002978697 |
Protein GI | 241258813 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.172417 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.779113 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAAACA GGTGGAAATC AATTGGGCTT GCAGCCTTGC TCGCCGGCCT GACGCTCAGC GCAAGCTATG CCGAGGCCGC CGGTGTACTC ACCATCGGCC GCCGCGAGGA TTCGACGACA TTCGATCCGA TCAAGACCGC GCAGAACATC GACAACTGGG TGTTCTCCAA CGTCTACGAC GTGCTGATCC GGGTCGACAA GACAGGCACG AAACTGGAGC CGGGCCTTGC CGAAAGCTGG ACTGCCTCGG ATGACGGGCT GACCTACACG TTCAAGATCC GCGACGCGAA ATTTTCCGAC GGTTCGCCGC TGACGGCGGA AGACGCGGCC TACAGCCTGC TGCGCATCCG CGACGACGCT GCCTCGCTCT GGAGCGATTC CTATAAGGTG ATCGACACGG CGGTGGCGAC CGACGCGCAC ACGCTGACGA TCAAGCTCAA GAACCCGTCC GCGCCGTTCC TTTCGACGCT GGCGCTGCCG AATGCCTCCG TCATCTCCAA GAAGGGCATG GAATCGCTGG GCGCCGACGC CTATGGCGAA AAGCCGATCG CGTCCGGCGC ATTCACCGTC GAGGAATGGC GGCGCGGCGA CCGCGTCATA CTGAAGAAGA ACCCGAATTT CTGGCAGGCG GACCGGGTGA AGCTCGACGG TGTCGAGTGG ATCTCGGTGC CTGACGACAA TACCCGCATG CTGAACGTTC AGGCCGGCGA GCTGGATACG GCGATCTTCG TGCCCTTCTC CCGCGTCGAG GAGCTGAAGA AGGACCCGAA CCTCAACGTC GATATCGACG CTTCGACCCG TGAGGACCAT CTTCTGATCA ACCATGCGCA TGGCGCGCTC GGCAAGAAGG AAGTCCGCCA GGCGCTTGAT CTCGCGATCG ACAAGAAGGC GATCGTCGAT ACCGTCACCT TCGGCCAGGG AACGGTCGCC AACTCCTATA TTCCAAAGGG CGCTCTCTAT TATTATGCCG ACAATCTGCA GCGGCCTTAT GATCCCGGAA AGGCAAAGGA GCTGTTGGCG GCCGCCGGCG CCTCGGACCT GACGCTGAAT TATCTGGTCC GTGCCGGCGA CGAAGTCGAC GAACAGACGG CCGTTCTGGT CCAGCAGCAG TTGCAGAAGG CCGGCATCAC CGCCAATCTG CAGAAGGTCG ACCCGAGCCA GGAATGGGAC ATGATCGTTG CCGGCGACTA CGACGTTTCG GTCAACTACT GGACAAACGA CATTCTCGAT CCGGACCAGA AGACCACCTT CGTCCTCGGC CACGATTCCA ACAACAATTA TTCGACCAAT TACAAGAGCG AGGCAGTGAA GGAACTGGTC GCCAAGGCGC GTCTCGAACT CGACCCGAAG AAGCGCGAGC AGATGTATGT CGATCTGCAG AAGATGGCCA AGGACGACGT CAACTGGATC GACCTCTATT ACAGCCCCTA TATCAACGTC TCGCGCAAGA ATATCGAGAA CTTCTATCAG AACCCGCTCG GCCGCTTCTT CCTGGAAGAC ACGGTCAAGA ACTGA
|
Protein sequence | MTNRWKSIGL AALLAGLTLS ASYAEAAGVL TIGRREDSTT FDPIKTAQNI DNWVFSNVYD VLIRVDKTGT KLEPGLAESW TASDDGLTYT FKIRDAKFSD GSPLTAEDAA YSLLRIRDDA ASLWSDSYKV IDTAVATDAH TLTIKLKNPS APFLSTLALP NASVISKKGM ESLGADAYGE KPIASGAFTV EEWRRGDRVI LKKNPNFWQA DRVKLDGVEW ISVPDDNTRM LNVQAGELDT AIFVPFSRVE ELKKDPNLNV DIDASTREDH LLINHAHGAL GKKEVRQALD LAIDKKAIVD TVTFGQGTVA NSYIPKGALY YYADNLQRPY DPGKAKELLA AAGASDLTLN YLVRAGDEVD EQTAVLVQQQ LQKAGITANL QKVDPSQEWD MIVAGDYDVS VNYWTNDILD PDQKTTFVLG HDSNNNYSTN YKSEAVKELV AKARLELDPK KREQMYVDLQ KMAKDDVNWI DLYYSPYINV SRKNIENFYQ NPLGRFFLED TVKN
|
| |