Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5398 |
Symbol | |
ID | 8007356 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | + |
Start bp | 813501 |
End bp | 815021 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644822302 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002973562 |
Protein GI | 241113727 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAGA TGAAGGCAAT TGGTGCGCTG AGCATCGGAC TGGCATTTCT GCTGGGTCCG GGCAGCGCAA TTCGCGCCGA CGCTGCCTCG GACAAGCTGA CGGTCGTGGT GACCGACGAA CCGAAGTCGC TCGATCCCTG TGACACCGAC CTTTCGGGCA ATTCTCGCAT TCTGCACAAC AACATCACCG AAGCGCTGGT CAATCTGAGC CCGGCCGACG GATCGGTCGT CCCGAGCCTT GCCGCCAGCT GGCGGCAAGT GGACGAGCTG ACCTGGGAAT TCAAGCTCCG CGACGACGTC ACCTTCCACG ACGGCAAGGC GTTTGATGCC AGTGCAGTCG TTGCCGCTCT CAAGCGGGCG CAGGATCCGG CACTGGCCTG CGAAGTCGGA CTTGCGACGC TGAAGGGCGT CAAGTTCAAT GCAGAAGCGG TGAACCCCAC CACCCTCCTC ATCAAAACGG ATATCGTCGA GCCGATCCTG CCGAACAAGA TGTCCGCCGT GGACATCGGT TCGCCGGCGA CCCCGAACGA CGGCAAGTCG CGCTCCCCGG CCGGAACCGG ACCTTACAAG CTTGCAGCGT GGACGCCCGG GCAATCGGTC GACCTCGTGG CCTATGACGG CTATTGGGGC GACAAGCCGG CGATCAAGAA CGCGACCATC ATCTGGCGCG CCGAATCCGC TGTTCGCGCG GCAATGGTCG CCACCGGTGA GGCGCAGATC GCCTATGAAA TAGCGCCCCA GGACGGCACG TCGGAACAGG ATCATGCCTT CCCGAACGCC GAAACCTCGC TGCTGAGGAT CGACGCAGAA ATTGCGCCAC TCAACGACAA ACGCGTGCGC GAAGCCCTTA ATCTTGCGAT CGACCGGGAC GGACTTGTCG GCACCATCTT CCACCAGGAT GCCCAAAAGG CGATGCAGGC GGTGCCGCCG TCCGTCTTCG GCTTCAATCC CGACATCCCC GTCTGGACGT ATGATCCCGA AAAAGCGAAG TCCCTGCTCG CCGCGGCGAA AGCCGATGGC GTGCCGGTTG ACAAGGAAAT CGTCATCTAC GGCCGCATCG GCATCTATCC CAATTCGTCC GAAAGCCTGG AAGCCATTCA GGCGATGCTC GCGGATGCAG GTTTCAATGC CCGGCTCGAA ATGCTTGAAA CAAGCCCGTG GCTGAAGAAG CTTCTTAAGC CCTGGGACAA GGAACGTCAG CCGTCGATCC TGCAGACGCA GATCGACAAC ACCGAAGGCG ACGCCGTATT CACGCTGCCG AACCGTTTTA CTACCGACGG CAACCAGTCG ACCATCGCCG ATGCCAAACT CGACACATTG ATTACCGACG CGTCGAAGGC AACCGGCGAC GAACGCAGGA AACTGTTCGA GGAGGCCTTC AGCTACATCG CCGTCGATGC GGTCAATATC GTGCCGCTGT TCCACATGGT CACGATTGCC CGCGTCGCCG AGAACGTCAC CTACACGCCC GATGTGCAGG CCGGCAACGA GATCAAGCTT AAATCGATCA GCTACCGCTG A
|
Protein sequence | MKKMKAIGAL SIGLAFLLGP GSAIRADAAS DKLTVVVTDE PKSLDPCDTD LSGNSRILHN NITEALVNLS PADGSVVPSL AASWRQVDEL TWEFKLRDDV TFHDGKAFDA SAVVAALKRA QDPALACEVG LATLKGVKFN AEAVNPTTLL IKTDIVEPIL PNKMSAVDIG SPATPNDGKS RSPAGTGPYK LAAWTPGQSV DLVAYDGYWG DKPAIKNATI IWRAESAVRA AMVATGEAQI AYEIAPQDGT SEQDHAFPNA ETSLLRIDAE IAPLNDKRVR EALNLAIDRD GLVGTIFHQD AQKAMQAVPP SVFGFNPDIP VWTYDPEKAK SLLAAAKADG VPVDKEIVIY GRIGIYPNSS ESLEAIQAML ADAGFNARLE MLETSPWLKK LLKPWDKERQ PSILQTQIDN TEGDAVFTLP NRFTTDGNQS TIADAKLDTL ITDASKATGD ERRKLFEEAF SYIAVDAVNI VPLFHMVTIA RVAENVTYTP DVQAGNEIKL KSISYR
|
| |