Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4491 |
Symbol | |
ID | 8015253 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 4622177 |
End bp | 4623706 |
Gene Length | 1530 bp |
Protein Length | 509 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644827067 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002978268 |
Protein GI | 241207172 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCAAGT TTTCGCTTGC GCCTTCAGCG CGTTTCGCCC GGCGGCTTTC GCTCGGTGCC GCACTTTCGG CCGGCCTGGT GATGACGGCG ATGACGCCGG CCGAGGCGGC AAAGACTACC CTCAATCTCG GCATGAGCGT CGAGCCGGCC GGTCTCGACC CGACGATCGC AGCACCGGTC GCGATCGGCC AGGTGACCTG GCAGAACGTG TTCGAAGGGC TGGTGACGAT CGACCAGTCC GGCAAGATCC AGCCGCAGCT GGCAAAAAAC TGGGAGATCT CTCCCGATGG CCTGACCTAT ACGTTCAAGC TGCAGACCGG CGTCAAATTC CATGACGGCG AGGCCTTCGA TGCCACTGCC GCCAAGTTTT CGCTCGACCG TGCCCGTGGC GCCGATTCGG TCAATCCGCA GAAGCGCTTC TTCGCTTCGA TCGCCTCGAT CGATACGCCG GATGCCGAAA CGCTGGTGCT GCATCTCTCT GCGCCGACCG GCAGCCTGAT CTACTGGCTC GGCTGGCCAG CCTCTGTGAT GGTCGCACCG AAGACGGCTG CCGACGACAA GACGACGCCA GTGGGGACCG GCCCCTTCAG TTTCGCCAGC TGGGCGAAGG GCGACAAGGT CGAACTCACC AGGAATGCCG ATTATTGGAA CAAGGATGCG GCCGCCAAGC TCGACAAGGT GACCTTCCGC TTCATCGCCG ATCCGCAGGC GCAGGCGGCA GCGCTGAAAT CCGGCGATCT CGATGCCTTT CCGGAATTTG CCGCGCCTGA GCTGATGAGT TCTTTCGACG GCGATGCGAG GCTCGTCACC AAGATCGGCA ATACCGAGCT CAAGGTCGTT GCCGGCATGA ACACTGCCAA GAAGCCGTTC GACGACAAAC GCGTCCGCCA AGCGCTGATG ATGGCGATCG ACCGCAAGAC GGTGATCGAC GGCGCATGGT CGGGCCTCGG CACGCCGATC GGCAGCCACT ACACGCCGAA CGATCCGGGC TATCAGGACA TGACAGGCGT GCTGCCTTAC GACGTCGAGA AGGCGAAGGC GCTGCTTGCC GAAGCAGGCT ACCCCAACGG TTTCACCTTC ACGATCAAAT CGCCGCAGAT GGCTTATGCG CCGCGCAGCG CCCAGGTGAT GCAGGCGATG TTTGCCGAGA TCGGCGTGAC GATGAATATC GAGCCGACGG AATTTCCGGC GAAATGGGTC CAGGACATCA TGAAGGACCG CAACTTCGAC ATGACGATCG TCGCCCATGC CGAACCGCTC GACATCGACA TCTATGCGCG CGATCCCTAT TATTTCAATT ATAAGAACCC CGCTTTCAAC GCGCTGATGA AGAAGGTTCA GGAGACGACC GATCCCGCCG CGCAGAATGC GATCTATGGC GAAGCGCAGA AGATCCTCGC CGAGGACGTG CCGGCGCTCT ACCTCTTCGT CATGCCGAAA CTCGGCGTCT GGGACAAGAA GCTGAAGGGC CTGTGGGAGA ACGAGCCTAT CCCTTCCAAC GTGCTGACTG GTGTTTCCTG GGACGAGTGA
|
Protein sequence | MIKFSLAPSA RFARRLSLGA ALSAGLVMTA MTPAEAAKTT LNLGMSVEPA GLDPTIAAPV AIGQVTWQNV FEGLVTIDQS GKIQPQLAKN WEISPDGLTY TFKLQTGVKF HDGEAFDATA AKFSLDRARG ADSVNPQKRF FASIASIDTP DAETLVLHLS APTGSLIYWL GWPASVMVAP KTAADDKTTP VGTGPFSFAS WAKGDKVELT RNADYWNKDA AAKLDKVTFR FIADPQAQAA ALKSGDLDAF PEFAAPELMS SFDGDARLVT KIGNTELKVV AGMNTAKKPF DDKRVRQALM MAIDRKTVID GAWSGLGTPI GSHYTPNDPG YQDMTGVLPY DVEKAKALLA EAGYPNGFTF TIKSPQMAYA PRSAQVMQAM FAEIGVTMNI EPTEFPAKWV QDIMKDRNFD MTIVAHAEPL DIDIYARDPY YFNYKNPAFN ALMKKVQETT DPAAQNAIYG EAQKILAEDV PALYLFVMPK LGVWDKKLKG LWENEPIPSN VLTGVSWDE
|
| |