Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_6670 |
Symbol | |
ID | 8022580 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012858 |
Strand | - |
Start bp | 100181 |
End bp | 101707 |
Gene Length | 1527 bp |
Protein Length | 508 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644833537 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002984671 |
Protein GI | 241666587 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000010094 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.0327722 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAACAC TGGTCGCATT CCTTCTTGGC ACTGCGCTCG TCGCCCTGCC TTCGACGCTC CTTGCCCAGG AAAAGGGCGG CGTCATCAAT GTCGCGACGA TCGGCGAACC GCCGACGCTC GATCCGATGT CGTCGACGGC CGATCTGGTC GGCATCGTCA CGCAGCACAT TTTCGAAACG CTCTACACCT TCGACAAGAG CTGGAACGTC ACGCCGCTTC TGGCCGAAAG CCTGCCGGAG ATCAGCGCCG ACGGCAAAAC CTATACGATC AAGCTCAGGA CCGGCATCAA GTTCCACGAC AATACCGATA TGACCTCGGA AGATGTCGTC GCCTCGCTTG GCCGCTGGAT GAAGATCGCT TCGCGCGGCA AGCAGGTGGC CGGCTTCATC GATAAGGTCA CCGCCGTCGA TCCCTCAACC GTCACGATCA CGCTGAAGCA GCCCTATGCG CCGCTGACCT CGCTGCTCGC CTTCAACAAT TCGGCCGCCA TCATCATCCC ATCCGAGAAG CAGGACGAGC CGATGAAGGA CTTCATCGGC ACCGGTCCCT ACATGCTGAA GGAGCGCAAG GCCGACCAGT ATATCCAGCT TGTCCGCTTC GATGGCTACA AGTCACGTGA AGGCGACAGC GATGGCTATG GCGGCGCCCG CCACCAGTAT CTCGATGAGA TCCGCTTCGT GCCGGTGCCG GATCCGAACA CCCGCGTCGA GGCTGCCGTT TCCGGCCAGT ACGACTACGT CGACTCGATC CCGGTCGAAT CCTACGACAA GCTGAAGGCC TCCACCGCCT CGCAGCCGAT CATCCTGAAG CCCTTCGGTT ATCCCGTCTT CGTCTTCAAT ACGAAGGAAG GTATTGCTGG GAATGTCGAG GTTCGCAAGG CGATCCGCCA GGCGCTCAGC ATGGAAGACA TGCTGGCGGC GGCTTTCGGC AGCACGGATT TCTACGCGCT CGACGGCGCC ATCTATCCCA AGACCTTTGC CTGGTCGACA GATGCTGGCG TCGAGGGCGC CTATAACGTC GCCGATCCGG AAGGGGCGGC GGCTGCCGCC AAGAAGGCCG GCTACAACGG CGAACCGATC CGCATCCTGA CCAGCCGCCA GTATGAGTTC CACTACAAGA TGGCGCAGGT CGCCGCCGAA TATCTGAAGC TTGCCGGCTT CACCGTCGAT ATGCAGGTTG TGGACTGGGC GACGCTGACG CAGCGCCGTA CCGATCCAAA GCTCTGGGAT ATCTACATCA CCCATAGCCC CTTCCTGCCG GAGCCTGCCC TGATCGGCTC GCTCTCGACC AGCTCGCCCG GCTGGTGGGA TACCCCGGCC CGCAAGGCCG CCGTCGATGC CTTCACCTCG GAAGTCGATC CGAAGAAGCG CGTGGCGCTC TGGGCCGATG TCCAGAAGGC GATCTATGCC GACGCCCCCT TCATGAAGAT CGGCGACTTC AACGCCGTTT CGGCAGAATC GACCAAGCTT GAGGGCGTCG ATCCGGCTCC GTGGCCGTAT TTCTGGAATG CTTCGATCAA GAAGTAA
|
Protein sequence | MKTLVAFLLG TALVALPSTL LAQEKGGVIN VATIGEPPTL DPMSSTADLV GIVTQHIFET LYTFDKSWNV TPLLAESLPE ISADGKTYTI KLRTGIKFHD NTDMTSEDVV ASLGRWMKIA SRGKQVAGFI DKVTAVDPST VTITLKQPYA PLTSLLAFNN SAAIIIPSEK QDEPMKDFIG TGPYMLKERK ADQYIQLVRF DGYKSREGDS DGYGGARHQY LDEIRFVPVP DPNTRVEAAV SGQYDYVDSI PVESYDKLKA STASQPIILK PFGYPVFVFN TKEGIAGNVE VRKAIRQALS MEDMLAAAFG STDFYALDGA IYPKTFAWST DAGVEGAYNV ADPEGAAAAA KKAGYNGEPI RILTSRQYEF HYKMAQVAAE YLKLAGFTVD MQVVDWATLT QRRTDPKLWD IYITHSPFLP EPALIGSLST SSPGWWDTPA RKAAVDAFTS EVDPKKRVAL WADVQKAIYA DAPFMKIGDF NAVSAESTKL EGVDPAPWPY FWNASIKK
|
| |