Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_0422 |
Symbol | |
ID | 8011624 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 438308 |
End bp | 439903 |
Gene Length | 1596 bp |
Protein Length | 531 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644823017 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002974271 |
Protein GI | 241203175 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00166095 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.00221721 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAAAGA TCTCTGTCCT GCTGGCAGCG ACGGCCTTGA TTTCCGTCAT GGCGACGTCG GCTTGGTCCA AGACCCTTGT TTATTGCTCC GAGGGCTCGC CGGAGGGCTT CGACCCGAGC CTCTATACGG CAGGCACGAC CTTCGACGCG TCGTCGCGCA CGGTCTATAG CCGCCTCGTC GAATTCAAGC ATGGCGGTAC CGAGATCGAA CCGGGCCTGG CCGACAGCTG GAGCGTTTCG GCCGACGGCA CGGAATACAC CTTCAAGCTT CATCCTGGCG TCAAGTACCA GACCACCGAC TTCTTCACGC CGACGCGCGA TTTCAACGCC GACGACGTCG TGTTCTCCTT CGAGCGCCAG CTGAAATCCG ACAATCCGTG GAACAAGTAT GTCGAGGGCG GCTCTTACGA ATACGCCGCC GGCATGGGCT TTCCCGAGCT GATCAAGTCT GTCGAGAAGG TCGATGACCT CACCGTCAAG TTCACGCTCA ACCACCCCGA AGCGCCGTTC CTCGCCGACC TCGCCATGGA CTTCGCCTCG ATCGTCTCCA AGGAATATGC CGACAAGCTT GCCGCCGACG GCAAGATGGC GCAGCTCAAC CAGCAGCCGC TCGGCACCGG CCCGTACACC TTCGTCGCCT ACCAGCCGGA TGCCGTCATC CGCTACAAGG CGAACGAAAC CTATTTCAAG GGCAAGGAAA AGATCGACGA TCTGGTTTTC GCCATTACCT CTGACGCCGC CGTGCGCGCC CAGAAGCTGA AGGCCGGCGA ATGCCACCTG ATCCCCTATC CGAATGCAGC CGACGTACCC GAACTGAAGA AGGATGAAAA TCTGACCGTT CTGGAACAGG CCGGCCTCAA TGTCGGCTTC CTCGCTTACA ACACCCAGAT GGCCCCGTTC GACAAGCCGG AAGTTCGCCG TGCGCTGAAC ATGGCGATCA ACAAGCAGGC GATCATCGAC GCCGTCTTCC AGGGTGCCGC GGCCGTTGCC AAGAACCCGA TCCCGCCGAC GATGTGGTCC TATAACGACG CCGTTCAGGA CGACAAGTAC GATCCGGACG CTGCCAAGAA GGCTCTTGCC GATGCTGGCG TCAAGGATCT CAGCATGAAG ATCTGGGCAA TGCCGGTGTC GCGTCCCTAC ATGCTGAACG CGCGCCGCGC CGCCGAACTG ATGCAGGCGG ATTTCGCCAA GATCGGTGTC AAGGTCGAGA TCGTCACCCA TGAATGGGCC GAATATCTGA AGCTCTCCTC CGACGTGAAG CGCGACGGCG CCGTCATCCT CGGCTGGACC GGCGACAACG GCGACCCGGA CAACTTCATG GATACGCTGC TTGGCTGCGA TGCCGTCGGC GGCAACAACC GTGCTCAGTG GTGCAACAAG GAATATGACG ACCTGATGAC CAAGGCCAAG CTGACGGCCG ATGTCGGCGA GCGCACCAAG GCCTATGAGC AGGCGCAGCT GATCTTCAAG AAGGAAGCTC CCTGGGCAAC CATCGACCAT TCGCTCGTCT TCGTTCCGAT GAGCAAGAAG GTCTCGGGCT TCCAGATGGA CCCGCTCGGC ATTCACCGTT TCGACGGCGT CGACGTATCC GAATAA
|
Protein sequence | MKKISVLLAA TALISVMATS AWSKTLVYCS EGSPEGFDPS LYTAGTTFDA SSRTVYSRLV EFKHGGTEIE PGLADSWSVS ADGTEYTFKL HPGVKYQTTD FFTPTRDFNA DDVVFSFERQ LKSDNPWNKY VEGGSYEYAA GMGFPELIKS VEKVDDLTVK FTLNHPEAPF LADLAMDFAS IVSKEYADKL AADGKMAQLN QQPLGTGPYT FVAYQPDAVI RYKANETYFK GKEKIDDLVF AITSDAAVRA QKLKAGECHL IPYPNAADVP ELKKDENLTV LEQAGLNVGF LAYNTQMAPF DKPEVRRALN MAINKQAIID AVFQGAAAVA KNPIPPTMWS YNDAVQDDKY DPDAAKKALA DAGVKDLSMK IWAMPVSRPY MLNARRAAEL MQADFAKIGV KVEIVTHEWA EYLKLSSDVK RDGAVILGWT GDNGDPDNFM DTLLGCDAVG GNNRAQWCNK EYDDLMTKAK LTADVGERTK AYEQAQLIFK KEAPWATIDH SLVFVPMSKK VSGFQMDPLG IHRFDGVDVS E
|
| |