Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4502 |
Symbol | |
ID | 6977596 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 138023 |
End bp | 139603 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643393680 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002278498 |
Protein GI | 209546580 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.146925 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACACGTG AAAAGGATCA ATCTTCGCCT CTGGGCATCG GCCGCCGTCA CTTCGTTGGC GGTGCCGTCG CGCTGGGTGC TTTGCCAGTC CTCACCTCCG GCCTCCTGAT GCCGCGCGAT GCCCGCGCGC AGGAAGCGAA GCGCGGTGGT CATCTGAAAC TCGGTCTCAA GGGCGGCGCC ACCAGTGATG CGCTCGACCC CGCGACCTAC AGCGCCTCCG TGTTGTTCGT GATTGGCCGT CTCTGGGGCG ACACACTTGT CGAATCCGAC CCTAAGACCG GTACGCCCTT GCCGTCGCTG GCGACTTCCT GGACGCCATC GGCGGACGCA TCCGTCTGGA CTTTCAAAAT CAGGAACGAC GTGCAGTTTC ACGACGGCAG CAAGATGACC GTCGCGGACA TCGTCGCGAC ACTGAAGCGA CACGCGGACA AGAATTCGCA GTCGGGCGCT CTGGGGCTCA TGGCCTCGAT CACCGGTATC GAAGAAAAAG CGGGCGACCT TGTCCTCACA TTTTCGGAAG GCAATGCGGA CCTGCCTTTG CTTTTGACCG ACTATCACCT AATCATCCAA CCGAAGGGCG GCCTCGACAA GCCTGCGGCG GCCATCGGTA CAGGTCCTTA CATTCTGAAA AGCTTCGAAC CGGGCGTTCA CGCAACCTTC GAGAAAAACC CGAAGGATTG GCGCTCCGAC CGCGGCTTTG TCGACAGTGT CGAGATCCTC GTCATCAACG ACAACACCGC CCGCGTTGCC GCACTTGCCT CCGGCCAAGT CCATTTCGTC AACAATGTTG ACCCCAAGAC AGTCCCGATG CTGCAGCGAG CACCGACCGT CGAAATCCTC CGGAATGCAG GCAAGGGCTT CTATTGTTTC CTGATGCATT GCGACGCGGC TCCCTTCGAC AACACCGATC TTCGGCTTGC GCTGAAATAT GCCATCGATC GTCAGGCGAT CCTCGACAAG GTTCTGGGCG GCTACGGAGT CATCGGCAAC GACTATCCGG TCAATTCCAA CTACGCTCTC GCCCCGACCG ATATCGAGCA GCGCCCTTAT GATCCAGACA AGGCCGCCTT CCACTTCAAG AAGGCGGGTC TCGACCGCTC CATTCAGCTG CTCACGTCAG ACGCAGCCTT TCCGGGCGCT GTGGATGCGG CGATCCTGTT CCAGCAAAGC GCGCGCAAGG CCGGCATCAC GATCGACGTC AAGCGCGAAC CGGAAGACGG CTACTGGACC AATGTCTGGA ACAAGCAGCC CTTCTGCGCC TCGTTCTGGG GCGGTCGTCC GACCCAGGAT TCACGCTATT CGACCTCTTA CCTGTCGACC GCAGAATGGA ACGACACGCG TTTCAAGCGC CCCGACTTCG ACAAATTGGT TTTGCAAGCA AGGTCAGAAC TCGATGAGGC CAAGCGCAAG GTGCTTTATC GGCAATTGGC CCTGATGGTG CGAGACGATG GCGGTCTGAT CCTGCCCGTC TTCAACGACT ACATCATGGC CTCTTCGAAA ATGCTGAAGG GATATGTCGA CGATATTGGC AACGATATGT CCAACGGCTA CATCGGCAGC CGCGTGTGGC TTAATGCCTA A
|
Protein sequence | MTREKDQSSP LGIGRRHFVG GAVALGALPV LTSGLLMPRD ARAQEAKRGG HLKLGLKGGA TSDALDPATY SASVLFVIGR LWGDTLVESD PKTGTPLPSL ATSWTPSADA SVWTFKIRND VQFHDGSKMT VADIVATLKR HADKNSQSGA LGLMASITGI EEKAGDLVLT FSEGNADLPL LLTDYHLIIQ PKGGLDKPAA AIGTGPYILK SFEPGVHATF EKNPKDWRSD RGFVDSVEIL VINDNTARVA ALASGQVHFV NNVDPKTVPM LQRAPTVEIL RNAGKGFYCF LMHCDAAPFD NTDLRLALKY AIDRQAILDK VLGGYGVIGN DYPVNSNYAL APTDIEQRPY DPDKAAFHFK KAGLDRSIQL LTSDAAFPGA VDAAILFQQS ARKAGITIDV KREPEDGYWT NVWNKQPFCA SFWGGRPTQD SRYSTSYLST AEWNDTRFKR PDFDKLVLQA RSELDEAKRK VLYRQLALMV RDDGGLILPV FNDYIMASSK MLKGYVDDIG NDMSNGYIGS RVWLNA
|
| |