Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5668 |
Symbol | |
ID | 6977059 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011366 |
Strand | + |
Start bp | 60142 |
End bp | 61815 |
Gene Length | 1674 bp |
Protein Length | 557 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643393125 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002277943 |
Protein GI | 209546053 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTCACA ACATTTCCGT GCACGCCGGC TTAGCAGCGG CAACCGCGCT GACGGCGCTC GTCGCTTTCG GGGCGGGCAG TGCGGCGGCT GAATCGGTGC TGACGATGCA CATCGAAGAG CAGACCAGTT GGGTTCAGAA CTTCAATCCG TTCGATCTTG CCGGCCGTCG GCAGAGCACG ATGGATTTCA TCTACGAGCC GCTGGTCATC TTCAATGCCG AGGATGGCGG CAAGCCGGTT TTCCGCCTGG CGACCGCTTA CAAATTCTCC GAGGATATGA AATCGGTCAC CTATACGCTG CGACCCGGTG TGAAATGGTC CGACGGCCAG CCGCTGACCT CGGCCGATGT GAAATATACG ATCGACCTGA TGCTGAAGAA CGCAGCCCTC GACACGGTCG GCGTCGGTCA AACGGTTGCC TCGGTCGAGA CGCCGTCGGC GACCGAGGTG AAGGTCGATC TCAAGGCCGT CAACTCCGAT TTCCCGGAAA CGCTGGCGGA CCTCGCCATC GTTCCCGAGC ATATCTGGAA GGACGTGTCC GATCCCGTTG CCTTCAAGAA CGAGAAGCCG GTCGGTTCCG GCCCGATGAC CGAGCTGCGC CGCTTCACGC CGCAGGTCTA CGAGCAGTGC CGCAACCCGA ACTACTGGGA TGCCGCCTCG CTGCATGTCG ATTGCCTGAG ACTGCCGCAG ATCTCCGGCA ACGACCAGAT GCTTGCCATC CTGCCTGAGG GCAATATGGA CTGGATCGGC TCCTTCATTC CCCAGATCGA CAAGACTTTC GTCGGGCTCG ATGCCGACCA TAACGGCTAC TGGCAGCCGC CGGCCGAAAC CGTCGCTTTC CAGATGAATT TCAAGAGCGG CAATGACGGC AACCTCGAGG CCTATAAGGA CCTTAACTTC CGCCATGCCT TCAGCCTCGC GATGGATCGC AAGTCCATGG TCGATATTGC CGGCTTCGGC TATCCCGTCG TCAACGAACA TGCGACCGGC CTGCCGCCGC GTTTCGAAAG CTGGCGCAAC AAGGATGCCG AGGGCGGCAA GGACGCCTTC ATGGGCTTCG ACACCGAAAA GGCCAGCAAG ATTCTCGACG ATGCCGGCTA CAAGAAGGGC GCCGACGGCT TCCGCACGAC GCCGAGCGGC AAACCGATCG CCTTCCCGAT CATCGTTCCG AACGGCTGGA CGGACTGGAT CGATGCGGTC CAGATCGCGG TCGAAGGCCT GCGCGCCGCC GGCATCAATG CGTCGGTCGC CACGCCCGAA TATGAACAAT GGCGCAAAGA GATCATCGAC GGCAGCTTCG AGGTCGTCAT GAACTCCCGC GCCGACGGCG CAACGCCGTT CCGCGGCTAT TACCAGAGCC TTTCCACAGC CTATGGCGGG CGCATCACCG GCGCGCCCTC GCGTTATTCG AACCCGAAAC TGGACGCGCT TTTCGATCAA TATCTGCAGG CAACGTCGGA CGACGATCAC AAGAAGATCT TCAACGACAT TCAAATGCTG ATCGCCGACG ACTTCCCCGT CGTTCCCGTC TTCAACGGGC CGACCTGGTA TCAGTTCTCC AGCAAGCGCT TCACCGGCTG GGTCACCGAC AAGGATCCGG TGATGAATCC CGAGGATCAC GACAACAACC GCATGCGCCT GATGCATCTC TTGCGTCTCA AGCCGGTCAG CTAA
|
Protein sequence | MFHNISVHAG LAAATALTAL VAFGAGSAAA ESVLTMHIEE QTSWVQNFNP FDLAGRRQST MDFIYEPLVI FNAEDGGKPV FRLATAYKFS EDMKSVTYTL RPGVKWSDGQ PLTSADVKYT IDLMLKNAAL DTVGVGQTVA SVETPSATEV KVDLKAVNSD FPETLADLAI VPEHIWKDVS DPVAFKNEKP VGSGPMTELR RFTPQVYEQC RNPNYWDAAS LHVDCLRLPQ ISGNDQMLAI LPEGNMDWIG SFIPQIDKTF VGLDADHNGY WQPPAETVAF QMNFKSGNDG NLEAYKDLNF RHAFSLAMDR KSMVDIAGFG YPVVNEHATG LPPRFESWRN KDAEGGKDAF MGFDTEKASK ILDDAGYKKG ADGFRTTPSG KPIAFPIIVP NGWTDWIDAV QIAVEGLRAA GINASVATPE YEQWRKEIID GSFEVVMNSR ADGATPFRGY YQSLSTAYGG RITGAPSRYS NPKLDALFDQ YLQATSDDDH KKIFNDIQML IADDFPVVPV FNGPTWYQFS SKRFTGWVTD KDPVMNPEDH DNNRMRLMHL LRLKPVS
|
| |