Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3373 |
Symbol | |
ID | 8014253 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 3385184 |
End bp | 3387109 |
Gene Length | 1926 bp |
Protein Length | 641 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644825932 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002977159 |
Protein GI | 241206063 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0279191 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.223354 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCAAGC TGAAACGACG TGCGTTTCTG GCCGGGACGT CGGCCGCCCT GATATTGCCG GCCCTGCCGG TTTTCGCCGC TGACTTCAAG GAAGCGGATA TTCTGAAAAC GAAGGTGTCG AGCGGCGCCC TGCCGGGTCT GAAGGACCGG CTTCCTGAAA ATCCGCTCGT CGTCAAACCA GTCGAAAGCG TCGGCAAATA CGGCGGCGAC TGGAACATGG CGCTGGTCGG CGGCGGGTCG CTGTCGATGC TGTTCCGCTA CCAGGCCTAC GAGCCGCTGC TGCGCTATAC GCCCGACTGG TCTGGCGTGA CGTTGAACGT CGCCGAGTCC TTCGAGGGCG ATGCCGACTC CAAGGTCTAT ACGATCCGCC TGCGCAAGGG CATGAAATGG TCGGACGGCC ATCCCTACAC CACCGCCGAC ATCAAGTTCT GGTACGACAC CGTTTTCACC GACAAGCGCG TCGCCTTCGT CGGTCAGGAT CATTGGAAGT CTGGCGGCAA GCCGGCCAAG CTGGAGATCG TCGACGAACA GACCTTCAAG GTCATCTTCG ACAAGCCGAA CGGCCTGTTC CCGCTGCAGG TCGCCTGGGC GAACAACGAC CAGACGACGC GCACGCCGAA GCATTATCTG GAGCAGTTCC ACATCGACCA CAATCCGAAG GCCGATGAAC TCGCCAAGCA ACGCGGCTTT GAAAGCTGGA TCGCCTCCTT CCAGGCGGCC GCCGGTTTCC AGGACGACAA CGCCTTCTTC CTCAACTCCT CGAAGAAGCC CTGCGTGCAT GCCTGGATGT TCACGATAGC GCCCGGCGAA AACACCGAAC GCGCCGTTGC CGAGCGCAAT CCCTACTATT GGAAAGTCGA CACCGAGGGC AACCAGCTGC CCTATATGGA CCGTATCGTC TACCAGATGG TCGCCGACCC GCAGGTTCTA TTGCTGAAGG CCATGCAGGG CGAAGTCGAC CTGATGGACC AGTATATCGC CACACCGAAC AACAAGTCGG TTCTCTACGA TGCGCGTGAG CAGGGCGGCT ATGATTTCTA CACACTGACC TCGACCGAAG CCAATGTGAT GAATTTCATC TTCAACCTGA ACCACAATGA CGAGACCAAG CGGAAGCTCT TCCGGAACAA GGATTTCCGT GCGGCACTCT CGACAGCACT CGACCGGCAG TCGCTGATCG ATGCCGTGCT CGTTGGTCAG GGGGCGCCCG CCCAGCCGTC GATCAAGAAG GAGGATCCGC TTTACAACGA GCAGCTCGCC ACGCAGTTCA CGGCCTATGA CGTCGACAAG GCGAATGCCA TGCTCGACCA GATCGTGTCG AAGCGCGACG ACCAGAACTT CCGCCTCGAC GAAAAGGGCC GCCGCCTGAC GATCATCTTC GAGATCGACC AGGCGCGCGC CGTCTTCCTC GATCTCTTCC AGCTGGTGAT CCCGATGTTC CAGGCGGTCG GCATCGATGC ACAGATGCGC TCGATGGACC GTTCGCTCTG GGAAACCCGC GTCCGTCAGG GCCGTGATTT CGATGCGACT GCCCACCAGT TCGGCGCAAA CGGCGGCGTC GCCGCCATGC TCGACCCGCG CTATTACGTG CCGACCGATG CCAACGCCAT GTATGCCCCC GCTTGGCAGC TCTGGTATCG CGACCGCGCC AATGCCAATG CTGAGGAACC GCCGGAAAGC ACGAAGAACC AGCTTGCGCT CTACGACAAG CTGAAGGCGA CCTCCGACGC ATCAGGCCAG CGCGAGGTCA TGAAGCAGAT CCTGCAAGGC GCCGCCGACA ATTTCTATGT CTTCGGCATC TCGCTGCCGC CGGACGGATA CGGCGTCGTC AAGAACAACA TGAAAAACGT CACGAAGATC ATGCCGAACT CTTTCGGCTG GCCGACGCCC GCTCCGACCA TGCCGGAGCA GTTCTACAAG GCCTGA
|
Protein sequence | MIKLKRRAFL AGTSAALILP ALPVFAADFK EADILKTKVS SGALPGLKDR LPENPLVVKP VESVGKYGGD WNMALVGGGS LSMLFRYQAY EPLLRYTPDW SGVTLNVAES FEGDADSKVY TIRLRKGMKW SDGHPYTTAD IKFWYDTVFT DKRVAFVGQD HWKSGGKPAK LEIVDEQTFK VIFDKPNGLF PLQVAWANND QTTRTPKHYL EQFHIDHNPK ADELAKQRGF ESWIASFQAA AGFQDDNAFF LNSSKKPCVH AWMFTIAPGE NTERAVAERN PYYWKVDTEG NQLPYMDRIV YQMVADPQVL LLKAMQGEVD LMDQYIATPN NKSVLYDARE QGGYDFYTLT STEANVMNFI FNLNHNDETK RKLFRNKDFR AALSTALDRQ SLIDAVLVGQ GAPAQPSIKK EDPLYNEQLA TQFTAYDVDK ANAMLDQIVS KRDDQNFRLD EKGRRLTIIF EIDQARAVFL DLFQLVIPMF QAVGIDAQMR SMDRSLWETR VRQGRDFDAT AHQFGANGGV AAMLDPRYYV PTDANAMYAP AWQLWYRDRA NANAEEPPES TKNQLALYDK LKATSDASGQ REVMKQILQG AADNFYVFGI SLPPDGYGVV KNNMKNVTKI MPNSFGWPTP APTMPEQFYK A
|
| |