Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4103 |
Symbol | |
ID | 8014901 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 4177324 |
End bp | 4178832 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644826673 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002977883 |
Protein GI | 241206787 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATCG CCAAATTGCT GACCGCCACC GTTGCCGGCG CGCTGTTTGC GCTTCCTGCC TTCGCCGTCG ACCTGAAGAT CGGATTGCAG GACGATGCCG ACGTGCTCGA TCCCGCGCAG TCACGCACCT TCGTCGGCCG CATCGTCTAT ACTGCAATGT GCGACAAGCT GGTCGACGTC TCGCCGGATC TGAAGATCGT GCCGCAGCTT GCCACCGAAT GGAGCTGGTC GGCCGACGGC AAAGAGCTCA CCATGAAGCT GCGCCAAGGC GTCAAGTTCC ATGACGAGAC CCCGTTCAAC GCCGAAGCCG TCGTCGCCAC CATCGAGCGC AACATCACCT TGCCGGAATC ACGCCGCAAG AGCGAGCTGA CCTCGGTCGC AAAGGTCGAG GCGACAAGCG AATACGAGGT CAAGTTCACC CTCAAGGCGC CTGACGTCAC CCTTCTTGCC CAGCTTTCCG ACCGCGCCGG CATGATCGTC TCGCCGAAGG CCGCCAAGGA GCTCGGCGCC AAGTTCGGCG ATCACCCGGT CTGCGCCGGT CCGTTCAAGT TCGTCGAACG CATCCAGCAG GATCGCATCG TGCTCGAAAA GTTCCAGGAC TACTGGAACA AGGACAAGAT CTTCATCGAC AAGCTCACCT ATCTGCCGAT CCCGGATACG ACGGTGCGCC TCGCCAATCT GCGTTCCGGC GATCTCGACA TGATCGAGCG CCTGGCCGCG ACCGATGCTG AAGCCGTGAA GGCGGATTCG AGCCTGGTCT ATGCCGACGC CGTCGGCACC GGCTACATGG CGCTCTATAC CAATATCGGC AACGGCGCGC GTGCCGACAA TCCTTTCGGC AAGGACAAAC GCCTGCGCCA GGCTTTCTCG CTGGCGATCG ACCGCGATGC CGTCAACCAG ATCGTCTATG AAGGCACGGC CGTCTCCGGC AACCAGCCCT TCCCGCCGAG CAGCCCGTGG TTCGACAAGG ATATTCCCGT CCCCGCCCGC GATCTCGACA AGGCCAAGGC GCTGATCAAG GAAGCCGGCT TCGACCGCGT GCCGATCGAG CTGCAGATCC CGAACAATCC CGTTGCCATG CAGATGATGC AGATCATCCA GTCGATGGTC GGCGAAGCCG GCTTCGACGT CAGCCTGAAG TCGACGGAAT TCGCCACGCT GCTGAGCGAG CAGACCGCCG GCAACTATCA GCTCAGCCGT TCCGACTGGT CCGGCCGCGT CGATCCCGAC GGCAATATCC ATCAGTTCAT CACCTGCAAG GGCGGCATCA ACGACACGAA ATACTGCAAC GCCGAGGTGG ACAAGCTGCT GAACGAAGCT CGTGCTTCGA CCGACGATGC CGTGCGCAAG CAGAAATACG ATGCCGCCGC CGTCATCCTC AACGACGATC TGCCGATCAT CTATCTCGGC CATCAGTCCT GGATCTGGGC GCTGCACAAG AACATCACCG GCTTCGTTCC GTCGCCGGAT GGCATGATCC GCCTCGTCGG CGTCAAGAAA GCCGGCTGA
|
Protein sequence | MKIAKLLTAT VAGALFALPA FAVDLKIGLQ DDADVLDPAQ SRTFVGRIVY TAMCDKLVDV SPDLKIVPQL ATEWSWSADG KELTMKLRQG VKFHDETPFN AEAVVATIER NITLPESRRK SELTSVAKVE ATSEYEVKFT LKAPDVTLLA QLSDRAGMIV SPKAAKELGA KFGDHPVCAG PFKFVERIQQ DRIVLEKFQD YWNKDKIFID KLTYLPIPDT TVRLANLRSG DLDMIERLAA TDAEAVKADS SLVYADAVGT GYMALYTNIG NGARADNPFG KDKRLRQAFS LAIDRDAVNQ IVYEGTAVSG NQPFPPSSPW FDKDIPVPAR DLDKAKALIK EAGFDRVPIE LQIPNNPVAM QMMQIIQSMV GEAGFDVSLK STEFATLLSE QTAGNYQLSR SDWSGRVDPD GNIHQFITCK GGINDTKYCN AEVDKLLNEA RASTDDAVRK QKYDAAAVIL NDDLPIIYLG HQSWIWALHK NITGFVPSPD GMIRLVGVKK AG
|
| |