Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5763 |
Symbol | |
ID | 6977153 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011366 |
Strand | - |
Start bp | 167872 |
End bp | 169398 |
Gene Length | 1527 bp |
Protein Length | 508 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643393219 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002278037 |
Protein GI | 209546147 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000000235373 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.926009 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAACAC TTGTCGCATT CCTGCTCGGC ACCGCGCTCG TCGCCCTGCC TTCGACCTTG CTCGCCCAGG AAAAGGGCGG CGTCATCAAT GTCGCGACGA TCGGCGAGCC GCCGACGCTC GATCCGATGT CGTCGACGGC CGATCTCGTC GGCATCGTCA CGCAGCATAT TTTCGAAACC CTCTACACTT TCGACAAGAG CTGGAACGTC ACACCGCTGC TGGCCGAAAG CCTGCCTGAG ATCAGCGCCG ACGGCAAAAC CTATACGATC AAGCTCAGGA CCGGCATCAA GTTCCACGAC AATAGCGACA TGACCTCGGA CGATGTCGTC GCCTCGCTTG GCCGCTGGAT GAAGATCGCC TCGCGCGGCA AGCAGGTGGC CGGCTTCATC GACAAGATTA CCGCCGCTGA TGCCTCGACA GTGACCATCA CGCTGAAGCA GCCCTATGCG CCGCTGACCT CGCTGCTTGC CTTCAACAAT TCGGCGGCAA TCATCATCCC TGCCGAAAAG CAGGACGAGC CGATGAAGGA CTTCATCGGC ACCGGTCCCT ATATGCTGAA GGAGCGCAAG GCCGACCAAT ATATCCAGCT CGTCCGCTTC GACGGCTACA AGTCCCGCGA AGGCGACAGC AATGGGTATG GCGGCGCCCG CCATCAATAT CTCGATGAAA TCCGCTTCGT GCCGGTGCCG GATCCGAACA CCCGCGTCGA GGCCGCCATC TCAGGCCAGT ATGATTATGT CGACTCGATC GCGGTCGAAT CCTACGACAA GCTGAAAGCT TCCAACGCCT CGCAGCCGGT CATGTTGAAG CCCTTCGGCT ACCCGGTCTT CGTCTTCAAT ACCAAGGAAG GTGTGGCCGG GAATGTCGAG GTTCGCAAGG CGATCCGCCA GGCGCTCAGC ATGGAAGACA TGCTGGCCGC CGCCTTCGGC AGCAAGGATT TTTATGCGCT CGACGGCGCC ATCTATCCGA AGACCTTTTC CTGGTCGACG GATGCCGGCG TCGAGGGCGC CTATAACGTC GCCGATCCGG AAGGGGCTGC CGCTGCCGCC AAGAAGGCCG GCTACAACGG TGAGCCGATC CGGATTCTGA CCAGCCGCCA GTACGAATTC CACTACAAGA TGGCGCAGGT CGCCGCCGAA TATCTGAAGC TTGCCGGCTT CACCGTCGAT ATGCAGGTGG TGGACTGGGC GACGCTGACG CAGCGCCGCA CCGACCCGAA GCTCTGGGAT ATCTACATCA CCCACAGCCC CTTCCTGCCG GAGCCGGCGC TGATCGGCTC GCTCTCGACC AGCTCGCCCG GCTGGTGGGA TACGCCGGCC CGCAAGGCCG CCGTCGATGC CTTCACCTCC GAGGTCGACC CGAAGAAGCG CGTGGCACTC TGGGCCGATG TCCAGAAGGC TATCTATACA GACGCGCCCT TCATGAAGAT CGGCGATTTC AACGCCGTCG CGGCAAAGTC GGTCAAGCTT GAAGGCGTCG ATGCGGCCCC GTGGCCATAT TTCTGGAACG CTTCGATCAA GAAGTAA
|
Protein sequence | MKTLVAFLLG TALVALPSTL LAQEKGGVIN VATIGEPPTL DPMSSTADLV GIVTQHIFET LYTFDKSWNV TPLLAESLPE ISADGKTYTI KLRTGIKFHD NSDMTSDDVV ASLGRWMKIA SRGKQVAGFI DKITAADAST VTITLKQPYA PLTSLLAFNN SAAIIIPAEK QDEPMKDFIG TGPYMLKERK ADQYIQLVRF DGYKSREGDS NGYGGARHQY LDEIRFVPVP DPNTRVEAAI SGQYDYVDSI AVESYDKLKA SNASQPVMLK PFGYPVFVFN TKEGVAGNVE VRKAIRQALS MEDMLAAAFG SKDFYALDGA IYPKTFSWST DAGVEGAYNV ADPEGAAAAA KKAGYNGEPI RILTSRQYEF HYKMAQVAAE YLKLAGFTVD MQVVDWATLT QRRTDPKLWD IYITHSPFLP EPALIGSLST SSPGWWDTPA RKAAVDAFTS EVDPKKRVAL WADVQKAIYT DAPFMKIGDF NAVAAKSVKL EGVDAAPWPY FWNASIKK
|
| |