Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3951 |
Symbol | |
ID | 8014766 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 4025366 |
End bp | 4027267 |
Gene Length | 1902 bp |
Protein Length | 633 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644826520 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002977731 |
Protein GI | 241206635 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTTCC GTAAATATGC AGCCTTTGCC TTATTGTCGC TCATGGCCCT TCCGGCATTT GCGCAGGATT TCATTGCCGG CATACCGCGC AACGAAACGC TGATCATCCA GGGCACGCCG CAGCAGAATG CCGACTGGTT CAACGTCTGG GCTCCGGGCG GCGGGGCGGC GGCAAACCTC AACGGCCTGC AGCAGCTGAC CACCGATACG CTGTGGTTCA TCAATCCCGA GGGCGGCAAG GATGCCTGGC AGAATGCGCT TGCCAGCGAA CCGCCGAGCT ACAACGCCGA TTTCACCGAG ATGAAGGTAA AGCTGCGCAA GGGGATCTTC TGGAGCGACG GTGTCGAATT TACCAGCGAC GACGTGGTCT ATACCGTGCA GACGCAGATC GACCATCCCG GCATGGCCTG GAGCGCGCCC TTCACCGTCA ATGTCGCGAG TATCGAAAAT CCTGATCCGC AGACCGTCGT CTTTCATCTG AAGAAGCCGA ACTCGCGCTT CCACACGCTG TTCACGGTGC GTTGGAATGC CGCCTGGATC ATGCCGAAGC ACGTCTTCGA GAAGGCCGCC GATCCGCTGT CCTTCAATAA TAATCCGCCG GTTTCGCTCG GGCCCTACCA GCTGCAGAGC TATGACAAGG GCGGCAACTG GACGATCTGG AAGCTGCGTG ATGACTGGCA GCGCACGTCG ATCGGCCTGG CTGCGGCACA ACCGCCTGAG GTCAAATACG TCGTCTATCG GGCCGCCGGC AATCCGGAAG CGCGCGTCAT CGAACAGCGC AACCACAATC TCGACGTCAT CAACGACATG GCGCCCGAGG GCATGTTCTC GATCATGCGC GACAGCAAGA GCACGGCCTC CTGGCTGAAG GGCTTTCCTT TCGCACATCC GGACCCGACG CTGCCTTCCG TTCTCTTCAA TACGAAGAAG GCGCCCTTCG ACAATAAAGA CGTGCGCTGG GCTCTCGCTC TACTGATCGA TATCCGCGAA GTGGCGCTCG GCTCCTACCG CGGCGCGGCC AATATCGCCG CCCTTGCCAC GCCGCCGACG GGTTCTGCCC CGGACGACTA TTACGCACCG ATGCAGGACT GGCTGACGAA TTTCGAGCTC GACACCGGAT CCCGCAAGAT CAAGCCCTAC GATCCCAACA TCTCGGCGCA GATCGCCAAT ATGGTGCGCA GCCAATGGGC CGATCAGATC CCGACCGATC CGGCCAAGCT GCAGCGTACA TTCGGCTTCG GCTGGTGGAA GAAGGACGTT CAGGCTGCAA CCGAGCTGCT GCAGAAGGCT GGTTTCAAGA AAAGCGGCCG CCAGTGGGTG AAGCCTGACG GCACGCCCTT TACGATCCGC CTGCAGGTGG AAGGCGATGC CATCCCGACG CTTGCCCGCG CCGGCACGGT GATCGCCCAG CAATGGTCGC AGGCCGGCAT CGCGACCAAG GTCGATGTCG CCGGCCCGAC CAATGGCCAG CGCCTCAGCA CCGGCGATTT CGAGACGGCG ATCTACTGGA GCATCGAGAC CTGGGGCGGT CATCCCGACC TCTCCTTCTT CCTCGACAGC TATCATTCGG AGTTCATCAA GCCGGTCGGG CAGATCCAGC CGCCGCGCAA TCTGCAGCGC TGGCAGGATC CGCGTCTCGA CCAGCTGATC GAGCGCAATC GGTCGATCGC CTTCGATTCG CCCGATGTCG CCAAGCTCGG CCAGGACTTC CTGAAGCTTG CCGTCGAGGA AATGCCGATG ATCCCGCTGA TGGCCTACAA CAAGTTCGCA CCGCTCGATA CGACCTACTG GACCAACTAT CCGAGCGCTG ACAATCCCTA TTCGGCCTCG GGTCCGAACT GGTCGAACAT TCGCTACATG GTGGTCGGGC TGAAGGCCAA TCCGGATGCG CCGAAGCCTT GA
|
Protein sequence | MKFRKYAAFA LLSLMALPAF AQDFIAGIPR NETLIIQGTP QQNADWFNVW APGGGAAANL NGLQQLTTDT LWFINPEGGK DAWQNALASE PPSYNADFTE MKVKLRKGIF WSDGVEFTSD DVVYTVQTQI DHPGMAWSAP FTVNVASIEN PDPQTVVFHL KKPNSRFHTL FTVRWNAAWI MPKHVFEKAA DPLSFNNNPP VSLGPYQLQS YDKGGNWTIW KLRDDWQRTS IGLAAAQPPE VKYVVYRAAG NPEARVIEQR NHNLDVINDM APEGMFSIMR DSKSTASWLK GFPFAHPDPT LPSVLFNTKK APFDNKDVRW ALALLIDIRE VALGSYRGAA NIAALATPPT GSAPDDYYAP MQDWLTNFEL DTGSRKIKPY DPNISAQIAN MVRSQWADQI PTDPAKLQRT FGFGWWKKDV QAATELLQKA GFKKSGRQWV KPDGTPFTIR LQVEGDAIPT LARAGTVIAQ QWSQAGIATK VDVAGPTNGQ RLSTGDFETA IYWSIETWGG HPDLSFFLDS YHSEFIKPVG QIQPPRNLQR WQDPRLDQLI ERNRSIAFDS PDVAKLGQDF LKLAVEEMPM IPLMAYNKFA PLDTTYWTNY PSADNPYSAS GPNWSNIRYM VVGLKANPDA PKP
|
| |