Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_1878 |
Symbol | |
ID | 8012930 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 1862826 |
End bp | 1864421 |
Gene Length | 1596 bp |
Protein Length | 531 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 644824467 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002975699 |
Protein GI | 241204603 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.558624 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.335683 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCATT TTTCTAAGAG CTTGTTCGTC GGTGCCGTCA TCGGCGCCCT GACGATATCG GCAGCGCAGC TGCAGGCCGC CACGCCGCAG GACCAGCTGG TGATTGGCAC TTCGCTGGCG CAGGTTTTGT CGCTCGATCC GCAGCAGGCG ACCGAAGGCA AGGCGGTCGA AATCATGTCG AATCTGTACG ATCGGCTGGT TGCCAGCACG GCTGATGGCA AGATCCTTCC GCAACTGGCG GAAAGCTGGA AGATTGACGA CAAGGGCATC ACCTTTACGC TGCGCAAGGC CAATTTCGCC TCCGGTAACC CGGTCACCTC GAAGGATGTC GTCTATTCAC TGGCGCGGCT CCTGAAAATG GACCAGGCCG CCGCCGCTAA CCTCAAGCGC GTCGGCTACG ACAAGGACAA TGTCGAAAAG CTCGTCAAGG CCGTCGACGA TCAGACGGTG CGGATCGATC TCTCCGACCA GGTGACGGCA GAGCTTCTGC TCTATCGGCT GACAACGACG ACGACCAGCG TGGTCGACAG CGTCGAAGTC GAGAGCCACG CCGTCGATAA CGACTACGGA AACGCGTGGA TGCGAACGCA TTCTGCCGGC TCCGGTCCGT TTACCCTCAA TCGCTGGTCT CCGAACGAAC TGGTGATCCT CGACGCCAAC AAGGACTATA TGGCAGGCAC GCCGAAGATG CGTCGCGTCA TCGTCCGGCA TGTGCCTGAA AGCCAGGTCG AGCGGCTGAT GCTTGAACGC GGCGATATCG ATATTGCCAG CGCCTTGACC GCATCGGATC TCGCGACGTT CCAGACCAAG AAAGGCTTTG CCATCCAGCG TATTCCGACG GGCGGTTTCT ACGTGCTGTC GATGAATGCC GGCAACAAAT ACCTCGCCAA TCCGAAGGTT CGCGAAGCCA TCGCCTATGG CATCGACTAC AAGGGCATCG AAAAGACGAT CATGGGCCCT TACGGCCGGG CGAGAAACGT TCCCGTTCCG GAGAATTTCG AATATGCCAT CCCGAACCCC GATTGGCATC TCGACGTCGA AAAGTCGAAA CAGCTGCTGA GCGAGGCAGG CTTCAAGGAC GGCTTTTCGC TGACGCTGAA GACCATCGCG CAAACGCCGC GCATCGATCT TGCCACCGCC ATCCAGGCAT CGCTTGCTCA AGTTGGCATC AAGATCGACA TCCAGCAGGG CAACGGCTCG GAAATCATCG CCGCCCATCG CGCCAGGGAT TTCGATCTGC TGATCCCGCA GACCAGCGCC TATATGCCGA ACGTGCTCGG CTCGATGGAG CAGTTTTCCT CCAATCCCGA CAATTCGAAG GAAGCCAACA ATGCCGGCAA TTTCGTCTGG CGCTCGGCCT GGGATATTCC GGAACTCACG GCGCTGACGG CGAAAGCATC GATGGAGCCG GACGCCAAGA AGCGTGGCGA ACTCTATGTT CAGATGCAGA AGATGTTCGT CGAACAGAAG CCGGCGGTGC TTCCGCTCTT CGAGCGCTTT GAGCCGATCG TCCTCAATAG CAAGGTCGAG GGATATGTGG GGCATCCGTC TCAGCTGACG CGTCTCGAGA ACGTCACCAA GGTCGAAACC CAGTAA
|
Protein sequence | MKHFSKSLFV GAVIGALTIS AAQLQAATPQ DQLVIGTSLA QVLSLDPQQA TEGKAVEIMS NLYDRLVAST ADGKILPQLA ESWKIDDKGI TFTLRKANFA SGNPVTSKDV VYSLARLLKM DQAAAANLKR VGYDKDNVEK LVKAVDDQTV RIDLSDQVTA ELLLYRLTTT TTSVVDSVEV ESHAVDNDYG NAWMRTHSAG SGPFTLNRWS PNELVILDAN KDYMAGTPKM RRVIVRHVPE SQVERLMLER GDIDIASALT ASDLATFQTK KGFAIQRIPT GGFYVLSMNA GNKYLANPKV REAIAYGIDY KGIEKTIMGP YGRARNVPVP ENFEYAIPNP DWHLDVEKSK QLLSEAGFKD GFSLTLKTIA QTPRIDLATA IQASLAQVGI KIDIQQGNGS EIIAAHRARD FDLLIPQTSA YMPNVLGSME QFSSNPDNSK EANNAGNFVW RSAWDIPELT ALTAKASMEP DAKKRGELYV QMQKMFVEQK PAVLPLFERF EPIVLNSKVE GYVGHPSQLT RLENVTKVET Q
|
| |