Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4497 |
Symbol | |
ID | 6977591 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 131924 |
End bp | 133513 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643393675 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002278493 |
Protein GI | 209546575 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.122706 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGACT ATAGCAAATA TCTCGCAGGC CGCGTCACCG CGGGCGGTCT CAGCCGCCGC GAATTCATGG GACGCGCCGT GGCGGCGGGC ATCACACTTG CCGTCGCCGA CAAGCTCTTC ACCGAAAGTG CGCAAGCCGC CGAACCGAAG CGTGGCGGTC ACTTGAAACT CGGCCTCGAA GGCGGTGCAG CCACCGATTC CAAGGACCCG GCAAAATTCC TGTCGCAGTT CATGTTCTGC GTCGGCCGCT GCTGGGGCGA CATGCTTGTT GAATCCGACC CGCTGACCGG TGCCGCCGTG CCGGCGCTCG CCGAGTCCTG GGAAGCCTCG AAAGACGCGG TCACCTGGAC CTTCAAGATC CGCAAGGGCG TCAAGTTCCA CGACGGCAAG GAAATGACGA TCGACGACGT TGTCGCGACG CTAAAGCGCC ACACCGATAA AAAGTCGGAG TCCGGCGCGC TCGGCGTTCT CGGCTCGATC AAGGAGATCA AGGCCGACGG CGGCAATCTC GTCCTGGCGC TCAGCGAAGG CAATGCCGAT ATGCCGCTGC TGCTGTCGGA CTACCATCTG GTCATCCAGC CGAATGGCGG CGTCGACGAC CCGCTCGCCT CGATCGGCAC CGGTCCCTAT AAGCTGACGA GCTTCGAGGC CGGCGTCCGC GCCACCTTCG AAAAGAACAA GGAGGACTGG CGCAGCGACC GGGGTTATGT CGATTCGATC GAGGTGATCG GCATGAACGA CGCCACCGCC CGTATCGCAG CACTTTCGTC CGGCCAGGTG CATTACATCA ACCGCGTCGA CCCGAAGACT GTCAACCTGT TGAAACGCGC GCCTAATGTC GAGATCCTCT CGACCGCCGG CCGCGGCCAT TACGTCTTCA TCATGCACTG CGACAAGGCG CCTTTCGACA ATAACGACCT GCGCCTGGCG CTCAAATACG CCATGGACCG CGAGACCATG GTGCAGAAGA TCCTCGGCGG TTACGGCAAG GTCGGCAACG ACTTCCCGAT CAACAGCACC TACGCGCTGT TTCCCGAGGG CATCGAGCAG CGTGTTTACG ATCCTGACAA GGCCGCCTTC CACTACAAGA AATCGGGCCA TAGCGGCTCG GTGCTGCTGC GCACCTCCGA AGTCGCCTTC CCCGGCGGCG TCGATGCGGC CGTGCTCTAT CAGGAAAGCT GCAAGAAGGC CGGGATCGAG ATCGAGGTCA AGCGCGAACC GGGCGACGGC TACTGGACCA ACGTCTGGAA CGTCCAGCCC TTCTCGACCT CCTATTGGGG CGGCCGGCCG ACGCAGGACC AGATGTATTC CACAGCCTAT CTCTCGACGG CGGACTGGAA CGACACCCGG TTCAAGCGTC CCGATTTCGA CAAGCTGCTG CTACAGGCCC GCTCGGAACT TGATGAAGCC AAGCGCAAGG ACATGTACCG CACCATGGCG ATGATGGTGC GCGACGAAGG CGGCGTGATC CTGCCGATGT TCAACGACTT CGTGAACGCC TCCACCAAGC AGGTGAAGGG TTACGTCCAC GACATCGGCA ACGACATGTC GAACGGCTAT GTCGCCACCC GCGTCTGGTT GGACGCCTGA
|
Protein sequence | MNDYSKYLAG RVTAGGLSRR EFMGRAVAAG ITLAVADKLF TESAQAAEPK RGGHLKLGLE GGAATDSKDP AKFLSQFMFC VGRCWGDMLV ESDPLTGAAV PALAESWEAS KDAVTWTFKI RKGVKFHDGK EMTIDDVVAT LKRHTDKKSE SGALGVLGSI KEIKADGGNL VLALSEGNAD MPLLLSDYHL VIQPNGGVDD PLASIGTGPY KLTSFEAGVR ATFEKNKEDW RSDRGYVDSI EVIGMNDATA RIAALSSGQV HYINRVDPKT VNLLKRAPNV EILSTAGRGH YVFIMHCDKA PFDNNDLRLA LKYAMDRETM VQKILGGYGK VGNDFPINST YALFPEGIEQ RVYDPDKAAF HYKKSGHSGS VLLRTSEVAF PGGVDAAVLY QESCKKAGIE IEVKREPGDG YWTNVWNVQP FSTSYWGGRP TQDQMYSTAY LSTADWNDTR FKRPDFDKLL LQARSELDEA KRKDMYRTMA MMVRDEGGVI LPMFNDFVNA STKQVKGYVH DIGNDMSNGY VATRVWLDA
|
| |