Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_6105 |
Symbol | |
ID | 6983178 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011370 |
Strand | - |
Start bp | 33381 |
End bp | 34937 |
Gene Length | 1557 bp |
Protein Length | 518 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643399130 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002283886 |
Protein GI | 209551970 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0000312198 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGGATT TGAACCGTAG AACGCTACTA AAAGGTGCAG CCGCCGCTGC GGCATACACG TTTACTTCTT TGGGCCCGGC AAGGGCTACT TCAAGCTCGC CGCGCCGAGG AGGGCATCTT CGCATCGGTC TTTGGGGAGG ATCCTCACAG GACACGCTAG ATCCGGCTAG CATCACTACC GATGCGGGGT TCCTCACGGC GGCCACCGCA CGGAACAAGT TGTTAGAGGT CGAACCGAAT GGTGAGCTTA CCCCAGCCCT CGCATTAAAG TGGGAGCCAT CGGATGACCT CATGCGTTGG ACCTTTGAAA TTCGGCCTGG GGTGACTTTT CACAGCGGCA AGTCGCTTGA AATGTCGGAC ATCGTCGCTT CGCTCAACCT GCATCGCGGA AAGGACTCAA CCTCTCCCGC AAAATCCTTC CTTGACGCGG TCACTGATAT CAAGGCCGAG GGCAGTAACA GAGTTGTCGT CTCGCTCAAT GCTCCGAACG TCGACTTTCC AAGTGCCCTC GCGGACCTCT CGCTGTCCAT CGTGCCCGCG AAGGATGGGG TAGCAGATCG GAACACGATG GACGGTACTG GCCCGTACGC AATCGAGAGC TTTGAGCCTG GCCAGCGCAT AAGGTTCAAG CGAAACCCTA ATTACTGGAA TCTGGATAAG GCTGCATTCT TCGACTCGGC CGAGGTCCTG ATCCTCGCTG ATGCCGCTAC AAGAATGAAC GCTTTGCGCT CGGGTCAGGT TGATTTGATA AACCAAGCCG ACCTTAAGAC ACTTAGCATG CTTCAGCGGG TCCCGGGAAT AACCGTTGAA GACGTGCCCA GCGGCCGGTT TTATATTTTT GGCATGATGT CGGACGTTGC TCCTTTCAAT GACAAGGATG TACGACAGGC TCTGAAATTT GCGATCAACC GAAAGGAGAT GACCCAGAAG ATTCTCCTTG GGCATGGGAG CATTGGGAAT GACCAGCCTA TCAAGCCCAG CCACAAGTAC TTCAATACGA ACCTTCCGCA ACGTGAGTAT GATCCCGAAA AAGCGAAATT TCATCTTAAA CAAGCTGGCG TGACGTCACT TCAAGTACCT TTGAGTGTGG CCGAGGCCGC ATTCGCCGGG GCTGTAAATG CGGGGCAGCT TTTTGTCGCC TCGGCTGCTG AAGCAGGCAT CAACATCGTC GCAACGCGGG AGCCCGATGA TGGTTACTTC GACAACGTTT GGCTGAAAAA GCCGTTTACC GCCGACTATT GGACTGAACT GCCGTCCGCT GATGCGCAGT TCACGCAAGG CTATGCTAAG GGAGCAGCTT GGAACGAGAC GCACTTCGAC AACCCTCGCT TCAACGAACT CTTGCTGAAG GCCCGAGCTA CGCTGGATGA GCAGCAGCGC GCGGGGATGT ACCACGAAAT GCAACAACTC ATCCATGATG AAAGCGGTGC GATCATTCCA ATGTTCGCGA ACAATACCTG GGCTTCCAAA TCAACGTTGA AGCATCAGGA CGGATTGTCC AGCCATCGCG ATCTCGATGA TTTCCGTTGC ATTGAGAGGT GGTGGTTCGA ATCCTAA
|
Protein sequence | MLDLNRRTLL KGAAAAAAYT FTSLGPARAT SSSPRRGGHL RIGLWGGSSQ DTLDPASITT DAGFLTAATA RNKLLEVEPN GELTPALALK WEPSDDLMRW TFEIRPGVTF HSGKSLEMSD IVASLNLHRG KDSTSPAKSF LDAVTDIKAE GSNRVVVSLN APNVDFPSAL ADLSLSIVPA KDGVADRNTM DGTGPYAIES FEPGQRIRFK RNPNYWNLDK AAFFDSAEVL ILADAATRMN ALRSGQVDLI NQADLKTLSM LQRVPGITVE DVPSGRFYIF GMMSDVAPFN DKDVRQALKF AINRKEMTQK ILLGHGSIGN DQPIKPSHKY FNTNLPQREY DPEKAKFHLK QAGVTSLQVP LSVAEAAFAG AVNAGQLFVA SAAEAGINIV ATREPDDGYF DNVWLKKPFT ADYWTELPSA DAQFTQGYAK GAAWNETHFD NPRFNELLLK ARATLDEQQR AGMYHEMQQL IHDESGAIIP MFANNTWASK STLKHQDGLS SHRDLDDFRC IERWWFES
|
| |