Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4446 |
Symbol | |
ID | 8015212 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 4579098 |
End bp | 4580930 |
Gene Length | 1833 bp |
Protein Length | 610 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 644827021 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002978223 |
Protein GI | 241207127 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.215863 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGCTC TGTGGTCGAA GATCGGTTTG TTTCTATCGC TTGCGGGTGC TCTGGCGCCA ATGTCGGCAA CGGGTCAGGA CCAGCCCTTT CAGATCGGAA GCTCGGTCAT CAGCGAGATG AAGTACAAGC CGGGCTTTGC GCATTTCGAC TACGTCAATC CCGATGCACC GAAAGGCGGA GATCTGCGCC TCTCCGCAAG CGGCGCCTTC GACACCTTCA ACCCGCTGCT CGCCAAAGGC CAGGCGGCAG TGGGCCTGAC GCTCGTTTAC GACACGTTGA TGAAGCCCGC CGACGACGAG CTGCTCGTCT CCTACGGTCT GCTTGCCGAG GGATTGTCTT TTCCCGCTGA CGTCTCAAGC GCGACCTTCC GCCTGCGCAA GGAAGCGAAA TGGGCGGATG GTCAGCCGGT CACGCCGGAA GACGTCATCT TCAGTCTGGA CAAGACCAAG GAATTAAATC CCCTCGCCTC GAACTATTAC CACCACGTTG TCAAAGCGGA AAAGACCGGC GAGCGTGACG TCACCTTCAC CTTCGATGAG AAGAACAACC GCGAACTGCC GAATATTCTC GGCCAGTTGA TGGTCGTGCC GAAGCATTGG TGGGAAGCGC CCGGACCGGA TGGCAAGCCG CGCGACATTT CTAAAACGAC GCTGGAGCCC GTGATGGGTT CGGGGCCGTA CAAGATCGCT TCCTTCTCGC CTGGCGCAAC GATCCGTTAT GAATTGCGTG ACGACTATTG GGGCAAGGAC CTCAATGTGA ATGTCGGCCA GAACAATTTC CGCAACGTCA ACTACACCTA TTTCGGTGAT CGGGATGTCG AGTTCGAGGC CTTTCGCGCT GGCAACAGCG ACTACTGGCA GGAAACCACG GCTGCCCGCT GGGCGACGGG ATATGATTTT CCCGCGGTGA AGGAAGGGCG TGTCAAAAAA GAGGAGGTTG CAAACCCCCT GCGCGCCACC GGCATCATGC AAGCTCTCGT GCCCAACATG CGACGTGACC TCTTCAAGGA CATCAGGGTC CGCGAGGCGC TGAACTACGG TCTGGATTTT GAGGAACTGA ACCGCACCGT TGCCTTTAAT AGTTACAAAC GCATCGACAG CTATTTCTGG AATACCGAAC TCGCCTCCTC CGGCCTGCCG CAGGGTAAAG AACTGGAAAT TCTGCAGGGC ATGAAGGATA AGGTTCCGCC CGAAGTCTTC ACCACGCCCT ATACCAATCC CGTCGGCGGC GATCCGCAAA AGAGCCGCGA CAACCTCCGC AAGGCGATTG CGCTTTTCAA AGAAGCCGGC TGGGAGCTCA AGGGCAATCG CATGGTCAAT ACCAAGACTG GCCAGCCGAT GAGTTTCGAG ATCCTGTTGT CGAGCCCCAT GCTGGAGCGC TGGGCGGTGC CCTATGCCAA CAATCTCAGG AAAATCGGCA TCGATGCGCG GATCCGGACA GTCGATGCGT CGCAATCTGT CAATCGTGAA CGCAGCTTCG ACTACGATAT GATCTGGAAT GTCTGGGCGG AGACCATGAA TCCGGGCAAC GAACAAGCCG ACTATTGGGG ATCCGGTTCG GTCAATCAGC AGGGTTCCCG CAATTATGCC GGCATTGCCA ACCAAGCCGT TGATGAGCTC ATTCGCATGA TTATCTTCGC GCCGAACCGC GGCGAGCAGA TCGCAGCAAT CAAGGCCATG GATCGGGTCT TGCTTGCAAA TCACTACGTC ATCCCGCTGT TCTACCGCGA TACCTATAAC ATCGCCTATT GGAACACGGT CACGCATCCG GCCGAGTTTC CGGCCTACAG CCTTGGCTTC CCCGATGCCT GGTGGTCGAC CTCGGCAAAA TGA
|
Protein sequence | MAALWSKIGL FLSLAGALAP MSATGQDQPF QIGSSVISEM KYKPGFAHFD YVNPDAPKGG DLRLSASGAF DTFNPLLAKG QAAVGLTLVY DTLMKPADDE LLVSYGLLAE GLSFPADVSS ATFRLRKEAK WADGQPVTPE DVIFSLDKTK ELNPLASNYY HHVVKAEKTG ERDVTFTFDE KNNRELPNIL GQLMVVPKHW WEAPGPDGKP RDISKTTLEP VMGSGPYKIA SFSPGATIRY ELRDDYWGKD LNVNVGQNNF RNVNYTYFGD RDVEFEAFRA GNSDYWQETT AARWATGYDF PAVKEGRVKK EEVANPLRAT GIMQALVPNM RRDLFKDIRV REALNYGLDF EELNRTVAFN SYKRIDSYFW NTELASSGLP QGKELEILQG MKDKVPPEVF TTPYTNPVGG DPQKSRDNLR KAIALFKEAG WELKGNRMVN TKTGQPMSFE ILLSSPMLER WAVPYANNLR KIGIDARIRT VDASQSVNRE RSFDYDMIWN VWAETMNPGN EQADYWGSGS VNQQGSRNYA GIANQAVDEL IRMIIFAPNR GEQIAAIKAM DRVLLANHYV IPLFYRDTYN IAYWNTVTHP AEFPAYSLGF PDAWWSTSAK
|
| |