Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5891 |
Symbol | |
ID | 6977280 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011366 |
Strand | + |
Start bp | 301634 |
End bp | 303538 |
Gene Length | 1905 bp |
Protein Length | 634 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643393346 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002278164 |
Protein GI | 209546274 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAACAGT TGAAGAGGCT CGCATTTGGC GTCGCGCTGG CCGCACTCGG CCTGACGGCG GCGGCCAAAG CCGATGACTA TACCTCATTG CCGCGCAAGG AGACGCTCAT AGTCGAAAAT CCGGAAGGGA CGATCAAAAA TCCCGGCTGG TTCAACATCT GGGTCAATGC CGGCGCCGGT GTCTCCACCG GTCTGCAGCA GCTGACCATG GATACGCTCT GGTATATCGA CCCCGAACAA GGGCTCGGCG GCGCGACCTG GGATAATTCG CTGGCCGCCG ACAAGCCGCA ATATAATGCC GACTTTACCG AAATGACCGT GAAACTGCGC AAGGGGCTCT TCTGGAGCGA CGGCGTCGAG TTCACGGCCG ACGACGTGGT CTATACCGTC AAGACGCAGA TGGATCATCC CGGCATGGTC TGGAGTGCAG CCTTCTCGGT GCAGGTGGCA AGCGTCGAGG CGACCGATCC TCAGACCGTG GTGTTCAAGC TGAAGAAGCC CAATTCGCGC TTCCACGCCC TTTTCACCGT TCGCTGGAAC GGCGCATGGA TCATGCCCAA GCATGTGTTC GAGAAGGTCG CCGATCCGCT TCGCTACGAT TTCGCCAATC CGGTTTCGCT CGGCGCCTAC AAGCTCAAGG CCTACGATCC TCAGGGCAAG TGGTACACCT GGGAGAAACG CGACGACTGG CAGAAGACAT CGCTTGCCCG CTTCGGCGAG CCGGCCCCGA AATATGTAAC TTACGTCGAC CCCGGCCCGC CGGATAAACG CACCATCGCC CAGCTCGAGC ACAATCTCGA TATCATCCAC GACAATACGC CTGAGGGCAT GTTCACCCTC AAGGAGAAAT CCAAGTCGGT CGAGACCTGG TTCCCGGGCT TCCCCTTCGC CCATCCGGAT CCGACGCTGC CGGCTGTCAT TTTCAACACC CAGGATCCGA CCTTCAACAA TCCTGACGTG CGCTGGGCGC TGGCCCTGCT GATCGACATC AAGGCCGTCG ACATGGCGAG CTATCGCGGC ACCGCCACGC TCTCGGCACT CGGTGTGCCG CCGACGGCGA TCGCCATGAA AGACTATCAG GCGCCGATGC AGGATTGGCT GAAGGATTTC GAGATCGACA CCGGCAAGCA GAAGATCAAG CCTTATGACC CGACGATCGG GCAGCAGGTC GCCGATCTCC TGCGCAAGCA GCCGAAGTTC AAGGATCAGA TCCCGACCGA TCCTAAGGCG ATCAGCACGG CCTTCGGCTA TGGCTGGTGG AAGCCCAACC CGCAGGCAGC CGCCGAGCTG CTTCAAAAGG CGGGCTTCAA GAAGAGCGGC GGCAAATGGA TGACCCCTGA TGGCCAGCCG TTCAGGATCC GGATGACCGT CGAGGGCGAC ACCCGCTCCG TCTTCACCCG GGCAGGCACG CTGATCGCCC AGCAATGGGC CGCCTTCGGC ATCGATGCCA AAGCCGTACC GACGACCAAC CTGTGGCAGG TTGCACTGCA GCCTGGCGAT TTCCAGGTGG CGATTGCCTG GAGCGTCGAA ACCTGGGGCG GCGATCCCGA CCTGTCCTTC TTCCTCGACA GCTGGCACTC GCAGTTCGTG GCCAAGAAGG GTGAGAACCA GCCGCCGCGC AACTGGCAGC GCTGGAGCAA TCCGGAGCTC GACAAGATCA TCGAGACCAT CCGCGGCATC AGCGCCGACG ATCCGAAGGG CATCGAACTC GGCAAGGATT ATCTGAAGCT GGTTGCCCGC GAAATGCCGA CGATCCCGCT GATGTCCTAT AACGTCTTCA CCTCGATGGA TACGACCTAT TGGACCGGTT ATCCAACGAT CAAGGACCCC TATACCGACC CGGTGCCGAA CTGGGCCAAC TCCAGGCTGA TGATGGTCAA GCTGAAGCCG GCCCAACCGA AATAA
|
Protein sequence | MQQLKRLAFG VALAALGLTA AAKADDYTSL PRKETLIVEN PEGTIKNPGW FNIWVNAGAG VSTGLQQLTM DTLWYIDPEQ GLGGATWDNS LAADKPQYNA DFTEMTVKLR KGLFWSDGVE FTADDVVYTV KTQMDHPGMV WSAAFSVQVA SVEATDPQTV VFKLKKPNSR FHALFTVRWN GAWIMPKHVF EKVADPLRYD FANPVSLGAY KLKAYDPQGK WYTWEKRDDW QKTSLARFGE PAPKYVTYVD PGPPDKRTIA QLEHNLDIIH DNTPEGMFTL KEKSKSVETW FPGFPFAHPD PTLPAVIFNT QDPTFNNPDV RWALALLIDI KAVDMASYRG TATLSALGVP PTAIAMKDYQ APMQDWLKDF EIDTGKQKIK PYDPTIGQQV ADLLRKQPKF KDQIPTDPKA ISTAFGYGWW KPNPQAAAEL LQKAGFKKSG GKWMTPDGQP FRIRMTVEGD TRSVFTRAGT LIAQQWAAFG IDAKAVPTTN LWQVALQPGD FQVAIAWSVE TWGGDPDLSF FLDSWHSQFV AKKGENQPPR NWQRWSNPEL DKIIETIRGI SADDPKGIEL GKDYLKLVAR EMPTIPLMSY NVFTSMDTTY WTGYPTIKDP YTDPVPNWAN SRLMMVKLKP AQPK
|
| |