Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5539 |
Symbol | |
ID | 8016430 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012853 |
Strand | - |
Start bp | 125745 |
End bp | 127025 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644827706 |
Product | ABC transporter substrate-binding protein |
Protein accession | YP_002978906 |
Protein GI | 241518278 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0803518 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.00000350864 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTTTACCC GACGCGATTT TCTGAAGACG ACGGCCGCCA CCGGTGCATT GGCGGCGACA TCCGGGCTCG CCGCCCCCGC GATCGCGCAG GACGCCGCGA TCAAGCTCGG CTATGTCAGC CCGCAGACGG GACCGCTTGC CGCCTTCGGT GAGGCCGACA AGTTCGTCAT CGACAGTTTT CTGGCAGTCA CCAAGTCGAA GGGTCTCAAC TACGAGGTTG TCGTCAAGGA CAGCCAATCC AATCCGAACC GGGCGGCGGA GGTCGCCAAG GAACTGATCG TCACCGACGA GGTGAACCTG ATCCTCGTCG CCTCGACGCC GGAGACCACC AATCCGGTGG CGACCACCTG CGAGGCTGAG GAAATGCCCT GTATTTCGAC GGTGGCTCCC TGGCAGCCGT GGTTCATCGG CCAGCAGGGC AATCCCGGCG ACCCGACCTC CTGGAAACCA TTGAACTACG CCTATCACTT CTTCTGGGGT CTCGAGGACG TCATCTCGGT CTTCACCAAC ATGTGGGCGC AGATCGAGAC CAACAAGAAG GTTGGCGGCC TCTTCCCAAA TGACGGCGAC GGCAATGCCT GGGGCGACAA GGTCGTCGGC TTCCCGCCGG TGCTGGAAAA GATGGGCTAC GGGCTGATCG ACCCCGGCCG CTATCAGAAC ATGACGGATG ATTTCTCGGC GCAGATCAAC GCCTTCAAAT CGGGCCAGTG CGAAATCATC ACCGGCGTGG TGATCCCGCC TGACTTCACC ACCTTCTGGA ACCAGGCCAA GCAGCAGGGT TTCGCCCCGA AGATCGCCTC GATCGGCAAG GCACTGCTGT TCCCGCAGAC GGTGGAGGCG CTCGGCAATG CCGGGCATAA TCTGTCGTCG GAAGTCTGGT GGACGCCGAG CCATCCGTTC AAATCGTCCT TGACGGGCGA AAGTACAGCA GAGGTGGCGG CCGCCTTTAC CAAGGCGACT AGCAGGCCGT GGACGCAGCC GATCGGTTTT GCCCATGCGC TGTTCGAGCT GGCGGTGGAT GCGATGAAGC GGGCCGGAGA TCCGACAGAC GGGGATGCCG TCGCGCAGGC GATTGCCGCC ACCAAGCTCG ATACGCTGGT CGGGCCGATT GCTTGGGACG GCAAGGGCCT GCCGCCTTTC GCGGCCAAGA ACATTGCCAA GACGCCGCTC GTCGGCGGCC AGTGGCGGTT GAAGGACGGC GGCGGCTACG ATCTCGTCAT CACCGACAAC AAGACGGCGC CGAACATTCC GGTCGGCGGC AAGATGGAAG CAATCGCCTG A
|
Protein sequence | MFTRRDFLKT TAATGALAAT SGLAAPAIAQ DAAIKLGYVS PQTGPLAAFG EADKFVIDSF LAVTKSKGLN YEVVVKDSQS NPNRAAEVAK ELIVTDEVNL ILVASTPETT NPVATTCEAE EMPCISTVAP WQPWFIGQQG NPGDPTSWKP LNYAYHFFWG LEDVISVFTN MWAQIETNKK VGGLFPNDGD GNAWGDKVVG FPPVLEKMGY GLIDPGRYQN MTDDFSAQIN AFKSGQCEII TGVVIPPDFT TFWNQAKQQG FAPKIASIGK ALLFPQTVEA LGNAGHNLSS EVWWTPSHPF KSSLTGESTA EVAAAFTKAT SRPWTQPIGF AHALFELAVD AMKRAGDPTD GDAVAQAIAA TKLDTLVGPI AWDGKGLPPF AAKNIAKTPL VGGQWRLKDG GGYDLVITDN KTAPNIPVGG KMEAIA
|
| |