Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5447 |
Symbol | |
ID | 6978541 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 1090073 |
End bp | 1092010 |
Gene Length | 1938 bp |
Protein Length | 645 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643394548 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002279366 |
Protein GI | 209547448 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.00390622 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCTAATG AAATTCTGTC GCGCGGTGTT ACCCGCCGCG CCGTGCTTGG CGGCATGGCC GGTGCAGCGG CACTGTCCAT CGCGGGCCGC GCATTCGCCG CAGGCGGCGA AGCGCCCGCA CTTGCGCAGC TTGTGAAAGA TGGAAAACTG CCGCCGCTCG CCGAGCGCCT GCCGAAGAAG CCGATGGTCG TCAAGCCGTT CGAAAAAGTC GGCACCTATG GCGGCTCTCT GCGCCGCGGC CTTCGCGGTT CATCCGACCA TAACGGCATT CTGCGCATGG TCGGTAACCA GAGCCTCGTG CGCTGGAACC TCGAATTTAC CGCTGTGGAA CCGAACCTTG CCGAGCGCTG GGAAGTCAGC GACGACGCGA CGCAATTCAC CTTCCATCTG ATCGAAGGCG TGCGCTGGTC GGACGGTCAT CCCTTTACAT CAGATGACGT CGTTTTCGCG ATCGAAGACT GCGTCAAAAA CACCGAGCTC TACAGCGCGA CACCGGCGCA GCTTGCTGTC GCCGGCAAGC CGGTTGTCGT CGAGAAGATC GACGATCATA CGGTGAAGTT CACCTTTGCG GCGGCCAACG CGCTTTACCT GGAAAACCTC GCCACGCCGC TCGGCCAGCA TCCGACGCTC TTTCCGAAGC ACTACTGCAG CCAGTTCCTG CCGAAATATA ATCCCAACAT CGAAGCCGAT GCGAAGAAAG CCGGCGTCAC CAGCTGGACG GAGCTGTTCC GCAGCCGCTG CGGCGACATC GAGATCCCGA CACGATGGGG CAATGTCGAC AAGCCGACGC TCGATCCATG GGTGGTCAAG GAACCCTATG TCGGTGGCGC CACGCGTGTC GTCATGACCC GCAATCCTTA TTTCTGGCAG GTCGATACCG AGGGCAATCA GCTTCCCTAT ATCGATGAGA TCAACTTCGG CATCTCGCAG GACGTCGAAT CACTGATGCT GAACGTCATC TCGGGCAAGA TCGACATCCA GGAGCGCCAT ATCAGCGTGC TTGCCAACAA GCCGACGCTG TCCCAGAACA TGCAGAAGGG CGACTATCGT CTGCTGACGC TGGTGCCTTC GGCCTCGCAG CAGTGCCAGA TCTACTTCAA CATCACCCAC AAAGATCCTG CCATGCGCAA GATGTTTGCG GATAAGTCGT TCCGGCAGGC GTTGTCGATC GGCATTAATC GTCCGGAGCT CATCGATATC GTCTATTTCG GCCAGAGCGA GCCTTACCAG GCCGGACCGC GTCCGACGCA TCCCTGGTAT AACGAGAAAT ACGCGCGCCA ATTCACCGAA TTTGACGCCG ACAAGGCAGG CGCGATGCTC GACCAGGCCG GCTACAAGAA AGGCGGCGAC GGTTTCCGCC TCCGGCCGGA CGGCCAGAAG GTGTTTTTCT CGATCGACGT CATTCCGACG CTCTATCCCG ACCTCGTCGA CGCGCTCGAG CTGGTCAAGA CGCATTGGGC CCAGATCGGC ATCGACATGA AGGTCAACAC CATCGAACGG GCGCTCTATT ACACCCGCGG CGACGACAAT GCCCATGATG CGCAGGTATG GCCGGGCCCT GGCGGTCTCG ATCCGATGCT CGATCCGCGC GATTTCTTCG CCTTCCATCC GCAGGGCTCG CGTTACGCCA TTCCGTGGAC GCTGTGGTAC ACCTCCAACG GTGCACGCGG CGAAGAACCC CCGGAAAGCC AGAAGAAGCG CATGAAGCTC TTCGACGAGG CGCGCTCGAC GGCCGATCTC GACAAGCGCG GTGCGGTCAT GAAGCAGATC TTCGATATCG CCGCGGAGGA ATTCGAGACC GTCGGGCTCT GCCTTGCCGT CGGCGGCTTC GGCATCATCC GCAACAATCT GCGCAATGTT CCCGAAAAAG AGCCGGACAG CTGGTCCTGG CCCAATCCGG GGCCTGCGCT GCCGCAGCAG TTCACCTTCA CGAGCTGA
|
Protein sequence | MPNEILSRGV TRRAVLGGMA GAAALSIAGR AFAAGGEAPA LAQLVKDGKL PPLAERLPKK PMVVKPFEKV GTYGGSLRRG LRGSSDHNGI LRMVGNQSLV RWNLEFTAVE PNLAERWEVS DDATQFTFHL IEGVRWSDGH PFTSDDVVFA IEDCVKNTEL YSATPAQLAV AGKPVVVEKI DDHTVKFTFA AANALYLENL ATPLGQHPTL FPKHYCSQFL PKYNPNIEAD AKKAGVTSWT ELFRSRCGDI EIPTRWGNVD KPTLDPWVVK EPYVGGATRV VMTRNPYFWQ VDTEGNQLPY IDEINFGISQ DVESLMLNVI SGKIDIQERH ISVLANKPTL SQNMQKGDYR LLTLVPSASQ QCQIYFNITH KDPAMRKMFA DKSFRQALSI GINRPELIDI VYFGQSEPYQ AGPRPTHPWY NEKYARQFTE FDADKAGAML DQAGYKKGGD GFRLRPDGQK VFFSIDVIPT LYPDLVDALE LVKTHWAQIG IDMKVNTIER ALYYTRGDDN AHDAQVWPGP GGLDPMLDPR DFFAFHPQGS RYAIPWTLWY TSNGARGEEP PESQKKRMKL FDEARSTADL DKRGAVMKQI FDIAAEEFET VGLCLAVGGF GIIRNNLRNV PEKEPDSWSW PNPGPALPQQ FTFTS
|
| |