Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5178 |
Symbol | |
ID | 8007074 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | + |
Start bp | 584225 |
End bp | 586162 |
Gene Length | 1938 bp |
Protein Length | 645 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644822088 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002973348 |
Protein GI | 241113513 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.758105 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.361664 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCAAAG AAATACTGTC GCGTGGCGTC ACCCGCCGCG CCGTGCTTGG TGGCATGGCT GGTGTCGCGG CGCTGTCTAT CGCCGGTCGC GTGTCTGCCG CAGGCGGTGA AGCGCCGGCA CTTGCCCAAC TCGCCAAGGA TGGAAAGCTG CCGCCGCTCG CCGAGCGCCT GCCGAAAAAG CCGATGGTCG TGACGCCGTT CGAGAAAGTC GGCACCTATG GCGGTTCGCT GCGCCGTGGC CTTCGCGGTT CATCCGACCA TAACGGCATC CTGCGTATGG TCGGCAACCA GAGCCTTGTG CGCTGGAATC TCGATTTTAC CGCCGTGCAG CCGAACCTTG CCGAGCGCTG GGAAGTCAGC GACGACGCAA CACAATTCAC GTTCCATCTC ATCGAAGGCG TGCGCTGGTC GGACGGTCAT CCCTTTACGG CCGATGACGT CGTTTTCGCG ATCGAAGACT GCGTCAAGAA CACCGAGCTC TACAGCTCGA CACCGGCGCA GCTTGCCGTC GCCGGCAAGC CAGTCACCGT CGAGAAGATC GACGATTACA CGGTGAAGTT CACCTTTGCC GCGGCCAATG CGCTTTATCT GGAAAACCTC GCCACGCCGC TTGGTCAGCA TCCGACGCTG TTCCCGAAGC ATTACTGCAG CCAGTTCCTG CCGAAATATA ATCCCAATAT CGAGGCCGAT GCGAAGAAGG CCGGCGTCAC CAGCTGGACG GAGCTGTTCC GCAGCCGTTG CGGCGACATC GAGATCCCGT CGCGATGGGG CAATGTCGAC AAGCCGACGC TCGATCCATG GGTGGTCAAG GAGCCCTATG CCGGCGGTGC GACGCGTGTC GTCATGACCC GCAATCCCTA TTTCTGGCAG GTCGATACCG AGGGCAACCA GCTTCCCTAC ATCGATGAAA TCAACTTCGG CATCTCACAG GACGTCGAAT CGCTGATGCT GAACGTCATC TCTGGAAAGA TCGACATCCA GGAACGCCAC ATCAGCGTTC TCGCCAACAA GCCGACGCTG TCCAAGAACA TGGAAAAGGG CGATTATCGG CTGCTGACGC TCGTGCCTTC GGCCTCGCAA CAGTGCCAGA TCTATTTCAA CATCACCCAC AAAGATCCTG CCATGCGCAA GATGTTTGCC GACAAGGCGT TCCGGCAAGC GCTTTCGATC GGCATCAATC GCCAGGAGCT CATCGACATC GTCTATTTCG GACAGAGCGA GCCCTACCAG GCAGGGCCGC GTCCGACCCA TCCGTGGTAT AACGAAAAAT ACGCGCGCCA ATTCACCGAA TTCGACGCCG ACAAGGCAGG CGCGATGCTC GATGAGGCCG GCTATAAGAA AGGCGGCGAC GGTTTCCGCC TCCGGCCCGA CGGCCAGAAG GTGTTCTTCT CGATCGACGT CATTCCGACG CTTTATCCCG ACCTCGTCGA TGCCCTGGAA CTGGTCAAGG CGCATTGGGC TCAGATCGGT GTCGACATGA AGGTCAACAC GATCGAGCGG GCGCTCTACT ACACCCGCGG CGACGACAAC GCCCATGACG CGGCGGTGTG GCCGGGTCCT GGCGGTCTCG ATCCAATGCT CGATCCGCGC GATTTCTTCG CCTTCCATCC GCAGGGTTCG CGTTACGCCA TTCCGTGGAC GCTTTGGTAC ACCTCCAACG GCGCACGCGG CGAAGAACCG CCAGAAAGCC AGAAGAAGCG CATGAAGCTC TTCGACGAAG CGCGTTCGAC GGCCGATCTC GACAAGCGCG GCGCAATCAT GAAGCAGATC TTCGACATCG CGGCCGAGGA GTTCGAGACC GTCGGCCTTT GCCTTGCCGT CGGCGGTTTC GGCATCATCC GCAACAATCT GCGCAATGTT CCCGAGAAGG AGCCGGATAG CTGGTCCTGG CCCAATCCCG GTCCGGCAAT GCCGCAGCAA TTCACCTTCA CGAGCTGA
|
Protein sequence | MAKEILSRGV TRRAVLGGMA GVAALSIAGR VSAAGGEAPA LAQLAKDGKL PPLAERLPKK PMVVTPFEKV GTYGGSLRRG LRGSSDHNGI LRMVGNQSLV RWNLDFTAVQ PNLAERWEVS DDATQFTFHL IEGVRWSDGH PFTADDVVFA IEDCVKNTEL YSSTPAQLAV AGKPVTVEKI DDYTVKFTFA AANALYLENL ATPLGQHPTL FPKHYCSQFL PKYNPNIEAD AKKAGVTSWT ELFRSRCGDI EIPSRWGNVD KPTLDPWVVK EPYAGGATRV VMTRNPYFWQ VDTEGNQLPY IDEINFGISQ DVESLMLNVI SGKIDIQERH ISVLANKPTL SKNMEKGDYR LLTLVPSASQ QCQIYFNITH KDPAMRKMFA DKAFRQALSI GINRQELIDI VYFGQSEPYQ AGPRPTHPWY NEKYARQFTE FDADKAGAML DEAGYKKGGD GFRLRPDGQK VFFSIDVIPT LYPDLVDALE LVKAHWAQIG VDMKVNTIER ALYYTRGDDN AHDAAVWPGP GGLDPMLDPR DFFAFHPQGS RYAIPWTLWY TSNGARGEEP PESQKKRMKL FDEARSTADL DKRGAIMKQI FDIAAEEFET VGLCLAVGGF GIIRNNLRNV PEKEPDSWSW PNPGPAMPQQ FTFTS
|
| |