Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_6448 |
Symbol | |
ID | 6983519 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011371 |
Strand | - |
Start bp | 110113 |
End bp | 111750 |
Gene Length | 1638 bp |
Protein Length | 545 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643399445 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002284201 |
Protein GI | 209552286 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00292129 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.318047 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTTTTG ATCCATCTGA TAAAACCAAT CTATCGCGCC GCAACGCGCT GAAGCTCGGT CTGGCGGCCG GCGTCGGCCT CACCGTGTTC GGGATGAATG CCCGCATCGT GATGGCCGAT GAAGGCCAGG TCCTGAAGGT CGCACATCCG GCCTTCGACC AGGACTGGTC GCCGCTGCGC GGCGGCGGCA GGACGTTCCG CTGGAATTCG ATCTGGTGGG CTTCGCCGAT GTATTTCGAC AGCCAGGGCA ATATCAAGCC TTACGTCTTC GCCAGCTGGG AATCGGCCGA CAACACGGTG TGGACCTTCA AGATCGACCC GAAGGCCGTC TTCTCCGACG GCAGCAAGAT CACCTCGGCC GACGTCAAGG GATCGTGGGA AGTCGCCTCG ATGCCGAACA CCAAGAGCCA GCGCGCCGAC CAGGTGCTGA GCAAGGTCAA GGGTTACGCC GAAATCGCCG CCGGTTCCGG CAAGGAGCTG ACCGGTGTGG CGACTCCTGA TGAGGGAACA GTCGTGGTGA CGCTCGCCGC TGCCGATCCG ATCTTCTTCA TGCGTCTCGC AAACCACATC GCGCCGATCA CCAAAGCGTC GCAATCGCGC GGCAGCGACG GCGAGGAAAT CATCGACTGG TATAAGCCCG AAAACAAGCC GGTCTTCTCC GGCCCCTTCA AGCTGACGAG CATCGATATC GATGCCGGCA AGATCACATT CGAGCCGAAT GAAAACTTCT TTGGGTCGAA GCCGAAGCTT GCCCGCATCG ACATCACCTC GATCGAGGAC AATGTGACGG CGACCTCGCT GATCAAGTCC GGCGAGTTCA ACGCCCATAC CGAACTCGTT ACCTCGACGA TCATCCAGGA TCTCGGCCCA GAATTCTCGG CCGGCCCGCT GATCCCGACC AGCCAGCACT TCTGGTTCAA CATCTCCCGC GCGCCGATGG ACGATCCGAA GGTCCGCCAG GCGCTGATCA TGGCGGTCGA TCGCGACGGC CTGTTCAAGG CGTCCTATCC CGATGGGCCG CACAAGAAGG CCGATCAGAT CCTCAATTCG GTTCCCGGCG CCGACAATTC CGGCTTCGAG TCCTTTCCCT ATGATCCGGC AGCCGCCAAG AAGCTGCTTG CCGAATCGAG CTATGGCGGG CCCGAGCGCC TGCCGAAGAT CCTGTTCGTC GGCATTTCGG CGCCGGCCAT TCAGGCCGCC GCCCAGTTCA TCGCCGAGCA GTGGCGCCAG AATCTCGGCA TCACGGCCGT CGACATGAAA CCGCAACAGG ACGCCTATGC CGGCCCGGAC CAGAACTCGG TGCAAATCTT CCGCGACGAC GTCGGCACCC GTGTCCCCGA CGCCGTTTCG TATCTGGCGG GCAGCATCGC CTCGACCTCG TCGAACGCGC AGAACAAGCT CGGCGGATAC AAGAACGACA AGGTCGACAG CGCCCTTGCC GAAGCGGCGA CCAAGGCTGC GGACGATCCG CAGCGCATCT CTCTCGCCCA GGAGGCCCAG AAGGCGTTCC GCGACGATTG GGCCTTCATC CCGTGGTATT CTCAGGCGAT GTCGCGCTGG GCCACCAAGG AGGTCAAGGG CATGGAGAAG AACCTCGACT GGCAGATAGC CGAACCCTGG AACATTTCGA TCGGTTGA
|
Protein sequence | MSFDPSDKTN LSRRNALKLG LAAGVGLTVF GMNARIVMAD EGQVLKVAHP AFDQDWSPLR GGGRTFRWNS IWWASPMYFD SQGNIKPYVF ASWESADNTV WTFKIDPKAV FSDGSKITSA DVKGSWEVAS MPNTKSQRAD QVLSKVKGYA EIAAGSGKEL TGVATPDEGT VVVTLAAADP IFFMRLANHI APITKASQSR GSDGEEIIDW YKPENKPVFS GPFKLTSIDI DAGKITFEPN ENFFGSKPKL ARIDITSIED NVTATSLIKS GEFNAHTELV TSTIIQDLGP EFSAGPLIPT SQHFWFNISR APMDDPKVRQ ALIMAVDRDG LFKASYPDGP HKKADQILNS VPGADNSGFE SFPYDPAAAK KLLAESSYGG PERLPKILFV GISAPAIQAA AQFIAEQWRQ NLGITAVDMK PQQDAYAGPD QNSVQIFRDD VGTRVPDAVS YLAGSIASTS SNAQNKLGGY KNDKVDSALA EAATKAADDP QRISLAQEAQ KAFRDDWAFI PWYSQAMSRW ATKEVKGMEK NLDWQIAEPW NISIG
|
| |