Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5173 |
Symbol | |
ID | 8007069 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | - |
Start bp | 578724 |
End bp | 580310 |
Gene Length | 1587 bp |
Protein Length | 528 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644822083 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002973343 |
Protein GI | 241113508 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.12226 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.127243 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGAGAC AAGTACTAAC AACGATCGCA ATTACCGGCG CCATGATGAC GGCGCAGCCC AGCTTCGCGG CGTCGCCGCC CAATATGCTG GTGATCGGCA CCAATCTCAC CGGCATAAGG ACGCTCGATC CGGCGCAGAA CAATGCCCGC ACGGTTTCCG AACTGATCTC GAATATTTAC GACAACCTCG TGCAGCTGTC GCCGGACGAC CTTAAAACGC TGAAGCCGAT GCTCGCGAAG CAATGGAGTG TCTCGGCGGA TGGCAAGATC ATCACACTGA CATTACGCGA CGACGCGGTC TTCCAGAGCG GCAACAAGGT CACCGCCGAG GATGCGGCCT GGTCGATCCA GCGCGTCATC AAGATGGGCC AGGTCGGCTC CACCGACATC GCGCTCTGGG GCTTCACGCC TGAAAACGTC GAAAAGCTCG TTCGCGCAAA AGACGAACAC ACGCTCGAGA TCGAGCTGCC GCAGGCGGTC AATACCGATC TGGTGCTCTA TTCGCTGGCG GGCTCGTCGA TCGGCATCGT CGACAAGAAG ACGGTACTGT CGCACGAGGC AAACAGCGAT TTTGGCGGCG CCTGGCTTTC CGCCAATTCC GCCGGCAGCG GCCCATTCAG CCTGGCGCAG TGGCGGCCGA ACGATGTCGC GATCTTCAAT GCCCAGCCGA AATATTGGGG CGGCAAGCCG GCCATGGCCC GTGTCGTTGC GCGTCACATC CCGGAATCCG GCAATCTCCG GCTTCAGCTC GAAGCCGGCG ACGTCGATGT CGGCCAATAT GTTTCAAGCG GCGACCTCGA TGCGCTCGCC ACCAAGAAGG ACATGGTCAT CGAGAATGTC CCGGGTCTCG GCTTCTACTA TATCGCCCTC AATCAGAAAG ACCCGGATCT GCAGAAGCCA AAGGTTCGCG AGGCCTTCCA GCATGCCTTC GACTGGAAAG CGATCTCCGG CAACATCATG CGCTATACGG GCTTTCCCTG GCAGTCGATG ATCCCGCGCG GCATGATCGG CGCTCCCGGT GAGGCGGCGG TCCGCTACGA TTACGATCCC GCCAAGGCCA AGCAGTTGCT GGCGGAAGCC GGATATCCCA ATGGCCTGAA GAAGGTGCTC AATCCGTCGG GAGCCGCGAC CCTGCCCTTC GCCGAAGCGC TGCAGGCGAG TGCGCGCGCC GCCGGCCTTG ATCTCGATCT CGTGCCCGGC GAGTTCACGC CCGCCTTCCG CGAGCGCAAA TTCGAAGTGC TGCTCGGCAA TTCCGGCGCC CGCCTGCCCG ATCCCTTCGC GGTCGCCACG CAATATGCCT TCAACCCCGA CAATAGCGAC GAGGCACGCC TCGGCAGCTA TTACCTCTGG CGCACGGGCA TGAAGGTGGA CGAGCTCAAC ACGCTCATCG ACCAATCGAT GAAGGAGCGC GACACGGAAA AGCGCACAGA CATCTTCAAG AAGATGGATG GCATCTATGC CGGCATGGCT TCGCCGCTGG TCATCTTCTT CCAGCGAACC GACCCCTATG TCATGCGCGC CAACGTCAAG GGCTATCACG GGCATACGAC ATGGTCGACG CGCTGGCATG ACGTGACCAA GGAGTAG
|
Protein sequence | MLRQVLTTIA ITGAMMTAQP SFAASPPNML VIGTNLTGIR TLDPAQNNAR TVSELISNIY DNLVQLSPDD LKTLKPMLAK QWSVSADGKI ITLTLRDDAV FQSGNKVTAE DAAWSIQRVI KMGQVGSTDI ALWGFTPENV EKLVRAKDEH TLEIELPQAV NTDLVLYSLA GSSIGIVDKK TVLSHEANSD FGGAWLSANS AGSGPFSLAQ WRPNDVAIFN AQPKYWGGKP AMARVVARHI PESGNLRLQL EAGDVDVGQY VSSGDLDALA TKKDMVIENV PGLGFYYIAL NQKDPDLQKP KVREAFQHAF DWKAISGNIM RYTGFPWQSM IPRGMIGAPG EAAVRYDYDP AKAKQLLAEA GYPNGLKKVL NPSGAATLPF AEALQASARA AGLDLDLVPG EFTPAFRERK FEVLLGNSGA RLPDPFAVAT QYAFNPDNSD EARLGSYYLW RTGMKVDELN TLIDQSMKER DTEKRTDIFK KMDGIYAGMA SPLVIFFQRT DPYVMRANVK GYHGHTTWST RWHDVTKE
|
| |