Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_0390 |
Symbol | |
ID | 6979105 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 403328 |
End bp | 404923 |
Gene Length | 1596 bp |
Protein Length | 531 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643395103 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002279915 |
Protein GI | 209547998 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.18086 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.296586 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAGA TCTCTGTCAT GCTGGCAGCG ACGGCCCTGA TTTCGGTCAT GGCGACGTCG GCCTGGTCCA AGACCCTTGT TTATTGCTCC GAGGGTTCGC CTGAGGGTTT CGACCCGAGC CTCTATACGG CTGGCACGAC CTTCGACGCC TCGTCGCGTA CGGTCTATAG CCGCCTGGTC GAATTCAAGC ATGGCGGTAC CGAGATCGAA CCGGGCCTCG CCGACAGCTG GAGCGTTTCG GCCGACGGCA CGGAATACAC CTTCAAGCTT CATCCCGGCG TCAAGTATCA GACCACCGAC TTCTTCACGC CGACGCGCGA TTTCAACGCC GACGACGTCG TGTTCTCCTT CGAGCGCCAG CTGAAGGCCG ACAATCCGTG GAACAAGTAT GTCGAGGGCG GTTCTTATGA ATACGCCGCC GGCATGGGCT TCCCGGATCT GATCAAGTCG ATCGAAAAGG TCGACGACCT CACGGTCAAG TTCACGCTCA ACCATCCCGA AGCGCCGTTC CTTGCCGATC TGGCGATGGA CTTCGCCTCG ATCGTTTCCA AGGAATATGC CGACAAGCTC GCCGCCGACG GCAAGATGGC GCAGCTCAAC CAGCAGCCCC TCGGCACCGG CCCCTTCACC TTCGTCGCCT ACCAGCCGGA TGCCGTCATC CGCTACAAGG CCAACGAAAC CTATTTCAAG GGCAAGGAAA AGATTGACGA TCTGGTTTTC GCCATCACCT CTGACGCCGC CGTCCGCGCG CAGAAGCTGA AGGCCGGCGA ATGCCACCTG ATCCCCTATC CGAATGCGGC TGACGTTCCC GAGTTGAAGA AGGACGGCAA TCTGACGGTG ATGGAACAGG CCGGCCTGAA TGTCGGCTTC CTCGCCTACA ACACGCAGAT GGCCCCGTTC GACAAGCCGG AAGTTCGCCG TGCGCTCAAC ATGGCGATCA ACAAGCAGGC GATCATCGAC GCCGTCTTCC AGGGCGCAGC GGCTGTTGCC AAGAACCCGA TCCCGCCGAC GATGTGGTCC TATAACGACG CCGTTCAGGA CGACAAGTAC GATCCGGATG CCGCCAAGAA GGCTCTCGCC GATGCCGGCG TCAAGGATCT CAGCATGAAG GTCTGGGCGA TGCCGGTGTC GCGTCCCTAC ATGCTGAACG CGCGCCGCGC CGCCGAACTG ATCCAGGCCG ATTTCGCCAA GGCCGGCGTC AAGGTCGAGA TCGTTACCCA TGAATGGGCC GAATATCTGA AGCTCTCCTC CGACGTGAAG CGCGACGGCG CCGTCATCCT CGGCTGGACC GGCGACAACG GCGACCCGGA TAACTTCATG GATACGCTGC TTGGCTGCGA TGCCGTCGGC GGCAACAACC GTGCTCAGTG GTGCAACAAG GAATATGACG ACCTGATGAC CAAGGCCAAG CTGACCGCCG ATGTCGGCGA GCGCACCAAG GCCTATGAGC AGGCACAGCT GATCTTCAAG AAGGAAGCCC CCTGGGCGAC CCTCGATCAC TCGCTCGTCT TCGTTCCGAT GAGCAAGAAG GTCTCCGGCT TCTTCATGGA TCCGCTCGGC ATTCACCGCT TCGACGGCGT CGACGTATCC GAATAA
|
Protein sequence | MKKISVMLAA TALISVMATS AWSKTLVYCS EGSPEGFDPS LYTAGTTFDA SSRTVYSRLV EFKHGGTEIE PGLADSWSVS ADGTEYTFKL HPGVKYQTTD FFTPTRDFNA DDVVFSFERQ LKADNPWNKY VEGGSYEYAA GMGFPDLIKS IEKVDDLTVK FTLNHPEAPF LADLAMDFAS IVSKEYADKL AADGKMAQLN QQPLGTGPFT FVAYQPDAVI RYKANETYFK GKEKIDDLVF AITSDAAVRA QKLKAGECHL IPYPNAADVP ELKKDGNLTV MEQAGLNVGF LAYNTQMAPF DKPEVRRALN MAINKQAIID AVFQGAAAVA KNPIPPTMWS YNDAVQDDKY DPDAAKKALA DAGVKDLSMK VWAMPVSRPY MLNARRAAEL IQADFAKAGV KVEIVTHEWA EYLKLSSDVK RDGAVILGWT GDNGDPDNFM DTLLGCDAVG GNNRAQWCNK EYDDLMTKAK LTADVGERTK AYEQAQLIFK KEAPWATLDH SLVFVPMSKK VSGFFMDPLG IHRFDGVDVS E
|
| |