Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5692 |
Symbol | |
ID | 8016655 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012853 |
Strand | - |
Start bp | 275350 |
End bp | 277017 |
Gene Length | 1668 bp |
Protein Length | 555 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 644827845 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002979045 |
Protein GI | 241518417 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.142163 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGACAAGT CTGAGCATTC CTATATCCCG ACTCTGGTCG AACAGATGTC GAGCGGCTCG ATCGGCCGCC GAGAATTTCT GCGAAAAGCG ACGCTTCTTG GTATTTCGGC CGCAGCCGCA TATTCTCTGG CCGGCGTTCC GGTGCCAGGC GGCGCCCGGG CCGACGATAT GCCGAAGGGC GGAAACCTGC GCATCGGCAT GCGTTGCATG GAAATCAAGG ATCCGCATCT GGCCGATTTC GCCGAAAAAT CGAACGTCAT CCGCCAAGTC TGCGAGTATC TGACCCTCAC CGATCGCCAC AACATTACCC ATCCTTATCT GCTGGAAAAG TGGGAAGTCA GCGACGACCT GAAGACGTGG ACGCTCCATC TCCGCACGGA CGTCAAGTGG CGCAAGGGCA GGCCATTGAC CGCAGACGAC GTGATCTGGA ACCTGAAGCG CGTCTGCGAT CCAGCAATCG GTTCGTCGAT GCTCGGGCTC TTCACCGGTT ATCTCGTGCA GGAATACGAA ACCGGCGAAA AGGACGAGAA AGGCAATCCC AAGAAGTCGA GCAAGCTCTG GGCCGACAAC GCCATCGAGA AAGTCAACGA CCACACCGTC CGTCTCAACT GCTCCTCGGC GCAGATTGCC GTGCCCGAGC ACCTCTATCA CTACCCGATG TTCATCATCG ATCCCGAAGA AAATGGCGCT TTCGGTCCTG ACGCGAACGG CACGGGTCCC TTCGTTATCA CCGAATACGT CGTCGGCAAG GGCGCCAAGT ACAAAGCCCG CACCGACTAT TGGGGCACCG GGCCTTATCT CGATACGTTC GAATATGTCG ACCTCGGCGA CAATCCGGGG GCCGGTATCG CGGCCATAGC TTCCAAGCAG GTCGATGGCC TGTCGGAAGC CGACGCGGTC CAGATCAATG CGATGAAGAA TTTCCCGCAT GTGGCGGTCC ACCAGGTCGA GACGACCCAG ACGGTCGTTG CCCGCATGCA CCCCGATATC GAGCAGTTCA AGGACAAGCG CGTGCGCCAG GCGATGCGCT ATTCGATCGA CCGCGACAAG GTCATTCAGA CGGCACTCCT CGGCGCCGGC ATTCCCGCCG AAGACCATCA CGTCGCGCCC TCGCATCCCG AATACGCGGC ACTGCCGAAA TATCCGCGCG ACATCGAAAA AGCGAAGAAG CTTCTGGCAG ATGCCGGCTA TCCGGATGGT TTCGAATTCG ACATGGTCAC ACGTCCCGAT CCGATCTGGG AACTGAACAC GGCGCAGGTC CTTGCCGAGC AGTTCAAGGA CATCGGCGTA AAGATCAACA TCAAGTCCCT GCCCAGCGCC CAATACTGGG AAGTCTGGAC AACGGCGCCA TTCAGCCTGA CTGCCTGGGG TCACCGGCCG CTGGCGATCA TGACATTGTC GCTTGCCTAT CGTTCGAATG CCGCCTGGAA CGAGTCCAAT TATTCCAACG CGGATTTCGA CAAGCTGCTG ACCGAAGCCG AGGGCATCCT CGATCCGAAG CAACGCAGCA AGGTCATGGC GAAGATCGAG GCGATCATGC AGGATGACGG GCCCATCGTT CAGCCCTTCT GGCGCGTCTT CTCGACCGTC ATGGACAAGA AGGTCAAGGG CTTCGAGCTC CATCCTTCTC AATACATCTT CGCGCATCAA TACGCGATCT CGGCGTAA
|
Protein sequence | MDKSEHSYIP TLVEQMSSGS IGRREFLRKA TLLGISAAAA YSLAGVPVPG GARADDMPKG GNLRIGMRCM EIKDPHLADF AEKSNVIRQV CEYLTLTDRH NITHPYLLEK WEVSDDLKTW TLHLRTDVKW RKGRPLTADD VIWNLKRVCD PAIGSSMLGL FTGYLVQEYE TGEKDEKGNP KKSSKLWADN AIEKVNDHTV RLNCSSAQIA VPEHLYHYPM FIIDPEENGA FGPDANGTGP FVITEYVVGK GAKYKARTDY WGTGPYLDTF EYVDLGDNPG AGIAAIASKQ VDGLSEADAV QINAMKNFPH VAVHQVETTQ TVVARMHPDI EQFKDKRVRQ AMRYSIDRDK VIQTALLGAG IPAEDHHVAP SHPEYAALPK YPRDIEKAKK LLADAGYPDG FEFDMVTRPD PIWELNTAQV LAEQFKDIGV KINIKSLPSA QYWEVWTTAP FSLTAWGHRP LAIMTLSLAY RSNAAWNESN YSNADFDKLL TEAEGILDPK QRSKVMAKIE AIMQDDGPIV QPFWRVFSTV MDKKVKGFEL HPSQYIFAHQ YAISA
|
| |