Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_6371 |
Symbol | |
ID | 6983445 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011371 |
Strand | - |
Start bp | 14374 |
End bp | 15888 |
Gene Length | 1515 bp |
Protein Length | 504 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643399371 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002284127 |
Protein GI | 209552212 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.37487 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAAATG GGTGGAAATC AATAGGGCTC GCAGCCTTGC TCGCCGGCCT GACGCTCAGC GTAAGCTATG CCGAGGCCGC CGGCGTGCTC ACCATCGGCC GCCGCGAGGA TTCGACGACG TTCGATCCGA TCAAAACAGC CCAGAACATC GACAACTGGG TATTCTCAAA CGTCTACGAC GTGCTGATCC GCGTCGACAA GACAGGCACC AAGCTGGAGC CGGGCCTTGC CGAAAGCTGG GCCGCCTCGG ATGACGGGTT GACCTATACG CTCAAGATCC GTGACGCGAA ATTCTCGGAC GGTTCGCCGC TGACGGCGGA GGACGCCGCC TACAGCCTGC TGCGCATCCG CGACGATGCC GCCTCGCTGT GGAGCGATTC CTACAAGGTG ATCGACACGG CGGTCGCCAC CGACGCGCAT ACGCTGACGA TCAAGCTGAA GAACCCGTCC GCACCGTTCC TGTCGACGCT GGCGCTGCCG AATGCCTCCG TCATCTCCAA GAAGGGCATG GAATCGCTGG GCTCCGACGC TTATGGCGAA AAGCCGATCG CATCCGGCGC GTTCACCGTC GAGGAGTGGC GGCGCGGCGA CCGGGTCATT TTGAAGAAGA ACCCGAATTT CTGGCAGGCC GACCGCGTTA AGCTCGACGT CGTCGAGTGG ATCTCGGTGC CCGACGACAA TACCCGCATG CTGAACGTCC AGGCCGGCGA ACTGGATGCG GCGATCTTCG TGCCCTTTTC CCGCGTCGAG GAGCTGAAGA AGGACCCGAA CCTCAACGTC GATATCGACG CGTCGACCCG TGAGGATCAT CTGCTGATCA ACCATGCGCA TGGTGCGCTC GGCAAGAAGG AAGTCCGCCA GGCGCTGGAT CTGGCGATCG ACAAGAAGGC GATCGTCGAT ACCGTCACCT TCGGCCAGGG CACGGTCGCC AATTCCTATA TTCCGAAGGG CGCCCTCTAT TATTACGCCG ACAATCTGCA GCGGCCCTAC GATCCCGCGA AGGCCAAGGA GATGCTGGCC GCGGCCGGCG CTTCCGACCT GACGCTGAAT TACCTGGTGC GCGCTGGCGA CGAAGTCGAC GAACAGACGG CGGTGCTGGT CCAGCAGCAG CTGCAGAAGG CCGGCATCAC CGCCAATCTG CAGAAGGTCG ATCCGAGCCA GGAATGGGAC ATGATCGTCG CCGGCGACTA TGACGTCTCG GTCAACTACT GGACTAACGA CATTCTCGAT CCGGACCAGA AGACCACCTT CGTGCTCGGC CACGATTCCA ACAACAACTA TGCGACCAAC TACAAGAACG AGGCCGTGAA GGAACTGGTC GCCAAGGCGC GCCTCGAGCT CGACCCGAAG AAGCGCGAAG CGATGTATGT CGATCTGCAG AAGATGGCCA AGGACGACGT CAACTGGATC GACCTCTATT ACAGCCCCTA TATCAACGTC ACGCGCAAGA ATATCGAGAA CTTCTACCAG AACCCGCTCG GCCGCTTCTT CCTGGAAGAC ACGGTGAAGA ACTAA
|
Protein sequence | MTNGWKSIGL AALLAGLTLS VSYAEAAGVL TIGRREDSTT FDPIKTAQNI DNWVFSNVYD VLIRVDKTGT KLEPGLAESW AASDDGLTYT LKIRDAKFSD GSPLTAEDAA YSLLRIRDDA ASLWSDSYKV IDTAVATDAH TLTIKLKNPS APFLSTLALP NASVISKKGM ESLGSDAYGE KPIASGAFTV EEWRRGDRVI LKKNPNFWQA DRVKLDVVEW ISVPDDNTRM LNVQAGELDA AIFVPFSRVE ELKKDPNLNV DIDASTREDH LLINHAHGAL GKKEVRQALD LAIDKKAIVD TVTFGQGTVA NSYIPKGALY YYADNLQRPY DPAKAKEMLA AAGASDLTLN YLVRAGDEVD EQTAVLVQQQ LQKAGITANL QKVDPSQEWD MIVAGDYDVS VNYWTNDILD PDQKTTFVLG HDSNNNYATN YKNEAVKELV AKARLELDPK KREAMYVDLQ KMAKDDVNWI DLYYSPYINV TRKNIENFYQ NPLGRFFLED TVKN
|
| |