Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5006 |
Symbol | |
ID | 8007597 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | - |
Start bp | 390696 |
End bp | 392306 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644821921 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002973181 |
Protein GI | 241113346 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.264167 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGACA AGATCACCAA TTGGACCAGA TCCGACGACT CCATGGTCGA AAGCGCCATC CGTCGTGGCG CCACCCGTCG CGAGTTGCTG CATATGATGC TCGCGGGCGG CGTGGCCCTG TCTGCCGGCG GGCTCGTGCT TGGCCGTGCC GGCAAGGCGC TCGCCGCCAC GCCCGTTTCC GGCGGCTCGC TCAAGGCGGC CGGCTGGTCG TCCTCGACGG CCGATACGCT CGACCCCGCC AAGGCGTCGC TCTCCACCGA CTATGTCCGG TGCTGCTCCT TCTATAACCG CCTTACCTTC CTCGACAAAT CAGGCACGCC GCAGATGGAG CTTGCCGACG CGATCGAGTC CAAGGATGCG AAGACCTGGA CGGTCAAGCT GAAGAACGGC GTTACTTTCC ATGACGGCAA GCCGCTGACC GCCGATGACG TAGTTTTCTC GCTGAAGCGC CATCTCGACC CATCCGTCGG CTCGAAGGTC GCCAAGATCG CCGCCCAGAT GACCGGCTTC AAGGCGGTCG ACAAACAAAC CGTCGAGATC ACGCTCGCCA GCCCGAATGC CGACCTGCCG ACCATTCTGT CGATGCATCA CTTCATGATC GTCGCCGACG GCACGACCGA TTTCACCAAG GCCAACGGCA CCGGCGCCTT CGTCAAGGAA GTCTTCGAGC CGGGCGTTCG CTCGGTCGGG ATCAAGAACA AGAATTACTG GAAATCCGGC CCGAACGTCG ATTCCTTCGA ATATTTCGCC ATCAGCGACG ACAATGCCCG CGTTAACGCG CTGCTTTCGG GCGACATCCA CCTCGCAGCC TCGATCAATC CGCGCTCGAT GCGCCTCGTC GAGACCCAGG GCGACGGCTT CACCTTGTCG AAGACCACCT CCGGCAACTA CACCAATCTC AACATGCGAC TGGATATGGA GCCCGGCAAC AAGCGGGATT TCGTCGAGGG CATGAAGTAT CTCGTCAACC GCGAACAGAT CGTCAAAGCG GCGCTGCGCG GTCTCGGCGA AGTTGGCAAC GACCAACCCG TTTCGCCTGC GAACTTCTAT CATGACGCAG AGCTGAAAGC GCGGGCCTTC GATCCCGACA AGGCGAAGTT CCACTTCGAC AAGGCCGGGG TTCTCGGCCA ATCCATTCCG ATCATCGCTT CCGATGCGGC GGCTTCCTCG ATCGACATGG CCATGATCAT ACAGGCGGCC GGCGCCGAAA TCGGCATGAA GCTCGACGTC CAGCGAGTGC CATCTGATGG CTATTGGGAC AATTACTGGC TCAAGGCGCC GATCCACTTC GGCAATATCA ACCCGCGTCC GACCCCGGAT ATCCTCTTCT CCCTGCTCTA CACCTCGGAC GCTCCGTGGA ACGAAAGCCA GTACAAGTCG GAGAAGTTCG ACAAGATGCT GATCGAGGCG CGCGGCTCTC TCGATCAAGA CAAGCGCAAG ACGATCTACA ACGAGATGCA GGGCATGGTC GCCCAGGAAG CTGGTACTAT CATTCCCGCC TATATCTCGA ACGTCGACGC CACGACCGCC AAGCTCAAGG GCCTGGAAGC CAACCCGCTC GGCGGCCAGA TGGGATATGC TTTCGCGGAA TATGTCTGGC TTGAAGCCTG A
|
Protein sequence | MNDKITNWTR SDDSMVESAI RRGATRRELL HMMLAGGVAL SAGGLVLGRA GKALAATPVS GGSLKAAGWS SSTADTLDPA KASLSTDYVR CCSFYNRLTF LDKSGTPQME LADAIESKDA KTWTVKLKNG VTFHDGKPLT ADDVVFSLKR HLDPSVGSKV AKIAAQMTGF KAVDKQTVEI TLASPNADLP TILSMHHFMI VADGTTDFTK ANGTGAFVKE VFEPGVRSVG IKNKNYWKSG PNVDSFEYFA ISDDNARVNA LLSGDIHLAA SINPRSMRLV ETQGDGFTLS KTTSGNYTNL NMRLDMEPGN KRDFVEGMKY LVNREQIVKA ALRGLGEVGN DQPVSPANFY HDAELKARAF DPDKAKFHFD KAGVLGQSIP IIASDAAASS IDMAMIIQAA GAEIGMKLDV QRVPSDGYWD NYWLKAPIHF GNINPRPTPD ILFSLLYTSD APWNESQYKS EKFDKMLIEA RGSLDQDKRK TIYNEMQGMV AQEAGTIIPA YISNVDATTA KLKGLEANPL GGQMGYAFAE YVWLEA
|
| |