Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5907 |
Symbol | |
ID | 6977294 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011366 |
Strand | - |
Start bp | 324247 |
End bp | 326337 |
Gene Length | 2091 bp |
Protein Length | 696 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643393360 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002278178 |
Protein GI | 209546288 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGAAAT TCCGCAAACT TGGCATTCTG GCCGGACTGG CGCTTGGCGT CTCGGTCCTG GCGCTGAATG CATATGCCTC CGAGGCCACC GTTCCGCCGG CGCCGGCGGA CTTCCCCGCC GAAGGTAAGA TCAAATATGT GGCGCGCGAC TCCATTCTGG AGTTCAAGGC GCTGCCCGAA TATCACGAGC CCGACTGGGT CACGAAGAAT TTCGTCGCCA CCGGCAAGCT GCCGGCGGTC AAAGACCGCC TGCCGAAGGA GCCGATGGTC TTCAAGACCG GCAACATGCC CGACGGCATC GGTGTCTATG GCGATACGAT GCGCCACGTC ATCGGCGGCC GGCCGGAAGG CTGGAACTAT GGCGCCGGCC AGACGCAGGG CTGGGGCGGC ATCGATATCG GCCTGTCCGA ATGCCTGACG CGCACCGCGC CGCTGTTTCA GGTGGAGGCC AAGGACACCG AGCCGCTGCC GAACCTTGCC AAGAGCTGGG ATTGGTCCGA GGACGGCCAC AAGCTGACCA TGCACCTGGT CGAAGGCGCC AAATGGTCCG ACGGTGCGCC GTTCAATGCC GACGACATCA TGTTCTACTG GGATGACGAA GTCGTCGATC CGAACGTCTC GCCGCTCGGC GGCGGCGCCT CGCCTGAGGC TTTCGGCGTC GGCACGACGC TGAAGAAGAT CGACGATTAC ACCGTCGAAT GGACCTTCAA GGATGCCTTC CCGAAGCAAT ATCTCTACAC GATGGCCTAC CCGAACTTCT GCCCGGGTCC GTCGCATATC CTGAAACCGC AGCATCCGAA ATATTCGAAG AACACCTACG ACCAGTTCAA GAACGCCTTC CCGCCGGAGT TCATGAACAT GCCGGTGATG GGCGCCTGGG TGCCGGTGGA ATATCGCTCC GACGACATCA TCGTCATGCG CCGCAACCCC TATTACTGGA AGGTCGACGA GAAGGGCAAT CAACTGCCCT ATCTGAACGA GCTGCACTAT AAGCTCTCGA CCTGGGCCGA CCGCGACGTT CAGGCAGTGG CGGGCTCCGC CGACATCTCC AACCTCGAGC AGCCGGAAAA CTTCGTCGCC TCGCTGAAGC GCGCCGCCGA GAAGACCGCG CCGGCCCGTC TGGCTTTCGG CCCGCGCCTC ATCGGCTATA ACCTGCGCAT GAACTTCTCC GCCAACGGCT GGGGCAACCC GGACGAACGC GGCCAGGCGA TCCGCGAGCT GAACCGCAAC GAGGACTTCC GCAAGGCCGT CACCATGGCG CTCGACCGCA AGGCGATCGG CGACTCGCTG GTCAAGGGGC CGTTCACCGC GATCTATCCG GGCGGGCTCT CTTCCGGCAC CAGCTTCTAC GACCGCAACT CCACCGTCTA CTATCCCTTC GATCTCAAGG GCGCCAAGGA GGAACTGGCC AAGGCCGGTC TCAAGGATAC GGATGGCAAC GGCATCGTGA ACTTCCCGGC CGGCACCGCC GGCGGCAAGG ATGTCGAAAT CGTCATGCTG ATCAACAATC AGTACACCAC CGACAAGAGC CTTGCCGAAG GTGTCGTCGC CCAGATGGAG AAGCTCGGCC TCAAGATCGT CATCAATGCG CTCGACGGCC CGAAGCGGGA TGACGCTCAT TATGCCGGCC GCTTCGACTG GCTGATCCAG CGCAACACGA CGGAACTGTC GTCGGTGGTG CAGAATACCG AGCAGCTTGC GCCGATCGGT CCCCGCACCA GCTGGCACCA CCGCGCCGGC AAGGATGACC AGCTCGATCT GATGCCGTTC GAAAAGGAAC TGAACGATAT TGTCGCCAAG TTCACGACCA GCCCGGACAA TGATGCCCGT GTCGCCCTGA TGAAACAGTA CCAGAAGATC TCGACGGAGC ACGTCAACAC CGTGGGTCTG ACCGAATATC CTGGCGCCCT GATCATCAAC AAGCGGTTCT CCAACGTGCC GCAGGGCACC CCGATCTTCA TGTTCAACTG GGCTGAAGAT TCGGTGATCC GCGAACGCCT GTGGGTGGCC GCCGACAAGC AGGGCAAATA TGAGCTGTTC CCCGAACAGC TGCCCGGCAA GCCCGGCGAC AAGGGCCCTG TGAACAACTA G
|
Protein sequence | MMKFRKLGIL AGLALGVSVL ALNAYASEAT VPPAPADFPA EGKIKYVARD SILEFKALPE YHEPDWVTKN FVATGKLPAV KDRLPKEPMV FKTGNMPDGI GVYGDTMRHV IGGRPEGWNY GAGQTQGWGG IDIGLSECLT RTAPLFQVEA KDTEPLPNLA KSWDWSEDGH KLTMHLVEGA KWSDGAPFNA DDIMFYWDDE VVDPNVSPLG GGASPEAFGV GTTLKKIDDY TVEWTFKDAF PKQYLYTMAY PNFCPGPSHI LKPQHPKYSK NTYDQFKNAF PPEFMNMPVM GAWVPVEYRS DDIIVMRRNP YYWKVDEKGN QLPYLNELHY KLSTWADRDV QAVAGSADIS NLEQPENFVA SLKRAAEKTA PARLAFGPRL IGYNLRMNFS ANGWGNPDER GQAIRELNRN EDFRKAVTMA LDRKAIGDSL VKGPFTAIYP GGLSSGTSFY DRNSTVYYPF DLKGAKEELA KAGLKDTDGN GIVNFPAGTA GGKDVEIVML INNQYTTDKS LAEGVVAQME KLGLKIVINA LDGPKRDDAH YAGRFDWLIQ RNTTELSSVV QNTEQLAPIG PRTSWHHRAG KDDQLDLMPF EKELNDIVAK FTTSPDNDAR VALMKQYQKI STEHVNTVGL TEYPGALIIN KRFSNVPQGT PIFMFNWAED SVIRERLWVA ADKQGKYELF PEQLPGKPGD KGPVNN
|
| |