Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_1692 |
Symbol | |
ID | 6980429 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 1722237 |
End bp | 1723832 |
Gene Length | 1596 bp |
Protein Length | 531 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643396416 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002281206 |
Protein GI | 209549289 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.122173 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCATT TTTCTAAAGG CTTGTTCGTC GGCGCCGTTT TCGGCGCCCT GACAATCGCC GCGCCGCAGC TGCAGGCCGC CACGCCGCAG GATCAATTGG TGATCGGGAC ATCGCTGGCC CAGGTTCTGT CCCTCGATCC GCAGCAGGCG ACCGAAGGCA AGGCCGTCGA GATCATGTCC AATCTTTACG ACCGGCTGGT CGCCAGCACG GCTGATGGTA AGATCCTTCC GCAGCTGGCG GAAAGCTGGA AGGTTGACGA CAAGGGCATC ACCTTCACGC TGCGCAAGGC CAATTTCGCC TCCGGCAATC CGGTGACTTC GAAGGACGTC GTCTATTCGC TGGCGCGGCT CTTGAAGATG GATCAGGCCG CCGCCGCCAA CCTCAAGCGC GTCGGCTACG ATAAGAACAA TGTCGATAAG CTCGTCAAGG CGGTCGACGA CCAGACGGTG CGCATCGATC TTTCCGACCA GGTGACGGCA GAGCTTCTGC TCTACCGGCT GACGACGACC ACCACCAGCG TGGTCGACAG CGTCGAGGTC GAAAGCCACG CCGTCGACAA TGACTACGGC AACGCCTGGA TGCGCACGCA TTCGGCCGGC TCCGGCCCGT TCACCCTCAA TCGCTGGTCT CCGAACGAAT TGGTCATTCT CGACGCCAAC AAGAATTATA TGACCGGCGC GCCGAAGATG AAGCGCGTCA TCGTTCGCCA TGTGCCGGAA AGCCAGGTCG AGCGGCTGAT GCTGGAGCGC GGCGATATCG ATATTGCCAG CGCCCTGACA GCCTCCGATC TCGCGACATT CCAGGCCAAG CAGGGCTTTG CCATCCAGCG CATTCCGACC GGCGGCTTCT ACGTGCTGTC GATGAATGCC GGCAACCAGT ACCTTTCCAA TCCCAAGGTT CGGGAAGCGA TTGCCTATGG CATCGATTAC AAGGGCATCG AAAAGACGAT CATGGGACCT TACGGACGGG CAAGAACCGT TCCCGTTCCG GAGAACTTCG AATATGCGAT CCCAAGCCCG GATTGGCAGC TCAACGTTGA AAAGTCCAAG CAGCTGCTGA GCGAGGCAGG CTTCAAGGAC GGCTTCTCGC TGACGCTGAA GACCATTGCG CAAACGCCGC GCATCGATCT TGCCACCGCC ATCCAGGCAT CGCTTGCCCA GGTCGGCATC AAGATCGACA TCCAGCAGGG CAACGGTTCG GAAATCATCG CCGCCCATCG CGCCCGGGAT TTCGATCTGC TGATCCCGCA GACCAGCGCC TATATGCCGA ATGTGCTCGG CTCGATGGAG CAGTTTTCCT CCAATCCGGA CAACTCGAAA GAGGCCAACA ATGCCGGCAA TTTCGTCTGG CGCTCGGCTT GGGACATTCC AGAGCTGACA GCCCTGACCG CAAAAGCATC GATGGAGCCG GACGCCAAGA AGCGCGGCGA ACTCTACGTT CAGATGCAGA AGATGTTCGT CGAACAGAAG CCGGCCGTGC TGCCGATGTT CGAGCGCTTT GAGCCGATCG TCCTCACCGG CAGGGTCCAG GGATATGTCG GACATCCGTC GCAAATGACG CGTCTCGAGA ACGTGACCAA GGTCGAAACC CAGTAA
|
Protein sequence | MKHFSKGLFV GAVFGALTIA APQLQAATPQ DQLVIGTSLA QVLSLDPQQA TEGKAVEIMS NLYDRLVAST ADGKILPQLA ESWKVDDKGI TFTLRKANFA SGNPVTSKDV VYSLARLLKM DQAAAANLKR VGYDKNNVDK LVKAVDDQTV RIDLSDQVTA ELLLYRLTTT TTSVVDSVEV ESHAVDNDYG NAWMRTHSAG SGPFTLNRWS PNELVILDAN KNYMTGAPKM KRVIVRHVPE SQVERLMLER GDIDIASALT ASDLATFQAK QGFAIQRIPT GGFYVLSMNA GNQYLSNPKV REAIAYGIDY KGIEKTIMGP YGRARTVPVP ENFEYAIPSP DWQLNVEKSK QLLSEAGFKD GFSLTLKTIA QTPRIDLATA IQASLAQVGI KIDIQQGNGS EIIAAHRARD FDLLIPQTSA YMPNVLGSME QFSSNPDNSK EANNAGNFVW RSAWDIPELT ALTAKASMEP DAKKRGELYV QMQKMFVEQK PAVLPMFERF EPIVLTGRVQ GYVGHPSQMT RLENVTKVET Q
|
| |