Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_6274 |
Symbol | |
ID | 6983347 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011370 |
Strand | - |
Start bp | 220856 |
End bp | 222391 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643399283 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002284039 |
Protein GI | 209552123 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.0245039 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGTACC TGAATCGCAG ACGTTTCATG CAACTCTCCG CTGCCGTGGC AGCGTCGGGA TTTGCGCCTG GATTTGCAAG GGCCGAAGGT AAGCGCGGCG GCCATCTCCG CGTGGGTTTG GCCGGAGGCT CGTCACAGGA TACCCTTGAT CAGCTCACCT ACGTGTCCGA TGCGACTTGG ATATTTTCCA GCAACGTAAG AAACAACCTC GTTGAAATTG ATGAACTGAA CCAACAGGTG CCTGGTCTCG CGGAGCGCTG GGAGGTTTCA CCGGACGCGA CCCGGTTCTC CTTTTTTATA CGAAAAGGCG TGACCTTCCA CAGCGGAAAA ACTCTCACCG CCGACGATGT CGTTGCCTCT CTCAATATTC ATCGGGGAGA GACATCCCAG TCGCCGGCAA AGGAGGAAAT GCGCGATGTT GTCGACATCA AGGCTGACGG ATCCCGCGTC GATGTCACGT TCTCTACCCC CAACATCGAT TTTCTTAGTC TCGTCACCAC GTTCAATTTT GGTATCCTGC CCGTTGCCGA TGGCAAGATC GACAGGCTGA CGAAGGACGG CACCGGCCCT TACATGCTTG AAAGCTTCGA ACCCGGTCAA AGCATAATCC TGAAACGAAA TCCCAATTAC TGGAAACCGG AGGCGGGCTT TTTCGATACC GCTGAAGTGA CGTTCATCGA AGATGATGCC GCTCGCATGA ACGCTATTAG GACAGGCTTG GTCGACGTCG TGAACAAGGT CGACTTAAAG ACCGCATCGG TGCTGAAGCG CGTCAAAGGT ATCCGGGTGG AGGATATCAA GACTGAGCAA TTCAACTCCT TCGCCATGAT GATCGACACC GCCCCGTTCA ATGACAACAA TATCCGGCTG GCATTGAAAT ACGGGGTCAA CCGCGAGGAA CTGGTCAAAA AAATTCTCCT AGGTTACGGC TCGATCGGCA ATGACCATCC GGTCGGCGTC ACCAACAAGT TTTTCAATTC ACAAATACAG CAGACCGAGT TCGACGCCGA CAAGGCGAAA TACTACTTGA AGCAAGCTGG CCTCACGAGG CTCGACGTGT CGCTCAGCGC CTCAGATGCA GGCTTTCCCG GCGCCGTCGG ATCCTCCTCC CTTTACCAGT CGTCCGCTGC GGCGGCGGGC ATCAACATCA ACGTCGTCCG GGAACCCAAT GACGGCTTCT ATGAGAATGT CTGGTTGAAG AAGCCGTTTG CGACTGTCTT CTGGGGGAAG CTCGCATCGG TCGGGCTGCA GTTTTCGCAA GCGTATCTGC CCGGAGCCAC GTGGAACGAA ACCCATTGCA ATCTACCGCA GGTCACGGAG TTGATCCGCA CCGCCCGCGG AATCGTGGAC GAAACGAAGC GAGGCGAGAT CTATCACGAA CTCCAGTCGG TCATTCACGA ACAAGGGGGA TCGATCATCC CGATGTTCAC CAATTTCGTC TGGGCCGTGC GAGACAACGT GCAGCACGGA CCGAACTTGC AGAACGATCT GACACTGGAC GGTCTGAAAT GCTTTCAGCG TTGGTGGTTC GCCTAG
|
Protein sequence | MQYLNRRRFM QLSAAVAASG FAPGFARAEG KRGGHLRVGL AGGSSQDTLD QLTYVSDATW IFSSNVRNNL VEIDELNQQV PGLAERWEVS PDATRFSFFI RKGVTFHSGK TLTADDVVAS LNIHRGETSQ SPAKEEMRDV VDIKADGSRV DVTFSTPNID FLSLVTTFNF GILPVADGKI DRLTKDGTGP YMLESFEPGQ SIILKRNPNY WKPEAGFFDT AEVTFIEDDA ARMNAIRTGL VDVVNKVDLK TASVLKRVKG IRVEDIKTEQ FNSFAMMIDT APFNDNNIRL ALKYGVNREE LVKKILLGYG SIGNDHPVGV TNKFFNSQIQ QTEFDADKAK YYLKQAGLTR LDVSLSASDA GFPGAVGSSS LYQSSAAAAG ININVVREPN DGFYENVWLK KPFATVFWGK LASVGLQFSQ AYLPGATWNE THCNLPQVTE LIRTARGIVD ETKRGEIYHE LQSVIHEQGG SIIPMFTNFV WAVRDNVQHG PNLQNDLTLD GLKCFQRWWF A
|
| |