Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5442 |
Symbol | |
ID | 6978536 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | - |
Start bp | 1084725 |
End bp | 1086311 |
Gene Length | 1587 bp |
Protein Length | 528 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643394543 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002279361 |
Protein GI | 209547443 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.0169825 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGAAAC AGCTACTGAC AACGATCGCA ATTACCGGCG CCCTGATGAC GACGCAGCCT ACCTGGGCTG CCTCGCCGCC CAATATGCTG GTCATCGGCA CCAATCTCAC CGGCATCCGC ACGCTCGATC CGGCCCAGAA CAATGCCCGC ACCGTTTCCG AGCTGATCTC GAACCTCTAC GACAACCTGG TGCAGCTGTC GCCGGACGAT CTCAAGACAC TGAAGCCGAT GCTTGCGACG CAATGGAGCG TCTCGGACGA CGGCAAGATC ATCACGCTCA CCCTGCGCGA CGACGCCGTC TTCCAGAGCG GCAACAAGGT CACCGCCGAG GATGCGGCCT GGTCGATCCA GCGCGTCATC AAGATGGGCC AGGTCGGCTC CACCGACGTG GCGCTTTGGG GCTTCAAGCC CGATAACGTC GAAAAACTCG TTCGGGCAAA GGACGAGCAT ACGCTCGAAA TCGAGCTGCC GCAGTCGGTG AATACCGATC TGGTGCTTTA TTCGCTGGCA GGCTCGTCGA TCGGCATCGT CGACAAGAAG ACGGTGCTGT CGCATGAGGC GAACGGCGAT TTCGGCGGCG CCTGGCTTTC GGCCAATTCC GCCGGCAGCG GACCATTCAG CCTGGCGCAA TGGCGGCCGA ACGACGTGGC GATCTTCAAT GCGCAGCCGA AATACTGGGG TGGCAAGCCC GCCATGGCCC GCGTCGTCGC CCGCCACATC CCTGAATCCG GCAATCTGCG ACTGCAGCTC GAAGCCGGCG ACGTCGATGT CGGCCAGTAC GTGGCAAGCG GTGATCTCGA TGCGCTGTCC ACCAACAAGG ATATGGTCAT CGACAATGTG CCGGGTCTCG GCTTCTATTA TATCGCCCTC AATCAGAAAG ACCCGGATCT GCAGAAGCCG AAGGTTCGCG AGGCCTTCCA GCACGCCTTC GACTGGAAGG CGATCTCCGG CAACATCATG CGCTATACGG GCTTTCCCTG GCAGTCGATG ATTCCGCGCG GCATGATCGG CGCACCCGAC GAGGCCGCCG CCCGCTACGA CTACGATCCC GCCAAAGCCA AGCAGTTGCT GGCGGAGGCC GGATATCCGA ACGGCTTGAA GAAGGTGCTC AATCCGTCGG GGGCAGCGAC CCTGCCCTTC GCCGAAGCGC TGCAGGCGAG CGCGCGGGCC GCCGGCCTCG ATCTCGATCT GGTGCCAGGC GAGTTCACGC CCGCCTTCCG CGAACGCAAA TTCGAAGTGC TGCTCGGCAA TTCCGGCGCC CGCCTGCCGG ATCCCTTTGC TGTCGCCACG CAATATGCCT TCAACCCCGA CAATAGCGAC GAGGCGCGCC TCGGCAGCTA TTATCTCTGG CGCACGGGCA TGAAGGTGGA CGACCTCAAC ACTCTCATCG ATCAATCGAT GAAAGAGCGC GATACGGCCA AGCGCACGGA TATCTTCAAG AAGATGGACG GCATCTATGC CGGCATGGCC GCCCCGCTCG TCATCTTCTT CCAGCGAACC GACCCCTATG TCATGCGCGC CAACGTCAAG AATTATCACG GGCACACGAC CTGGTCGACG CGCTGGCACG ACGTGACCAA GGAGTAG
|
Protein sequence | MLKQLLTTIA ITGALMTTQP TWAASPPNML VIGTNLTGIR TLDPAQNNAR TVSELISNLY DNLVQLSPDD LKTLKPMLAT QWSVSDDGKI ITLTLRDDAV FQSGNKVTAE DAAWSIQRVI KMGQVGSTDV ALWGFKPDNV EKLVRAKDEH TLEIELPQSV NTDLVLYSLA GSSIGIVDKK TVLSHEANGD FGGAWLSANS AGSGPFSLAQ WRPNDVAIFN AQPKYWGGKP AMARVVARHI PESGNLRLQL EAGDVDVGQY VASGDLDALS TNKDMVIDNV PGLGFYYIAL NQKDPDLQKP KVREAFQHAF DWKAISGNIM RYTGFPWQSM IPRGMIGAPD EAAARYDYDP AKAKQLLAEA GYPNGLKKVL NPSGAATLPF AEALQASARA AGLDLDLVPG EFTPAFRERK FEVLLGNSGA RLPDPFAVAT QYAFNPDNSD EARLGSYYLW RTGMKVDDLN TLIDQSMKER DTAKRTDIFK KMDGIYAGMA APLVIFFQRT DPYVMRANVK NYHGHTTWST RWHDVTKE
|
| |