Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_6676 |
Symbol | |
ID | 8022586 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012858 |
Strand | + |
Start bp | 106243 |
End bp | 107862 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644833543 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002984677 |
Protein GI | 241666593 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.479975 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.420932 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCAAAC GCTGGCTGCA ACAGACGACC ATGGCAACGA TGGTGGCGCT CGCTCCATTG TCCGTCATGG CTGATGAAAC GCCCAAGCAG GGCGGCGATA TCGTCGTCAC CTACAAGGAC GACATCACCA CGCTCGACCC GGCGATCGGC TACGACTGGG TCAACTGGTC GATGATCAAG AGCCTCTATT CCCGCCTGAT GGACTATACG CCCGGCACGC CGAACCCAGT TCCCTCGCTT GCCGAAAGCT TCACCGTTTC GCCCGACGGC TTGACCTATA CCTTCAAGCT GCACAAGGGC GTGAAGTTCT CGAACGGCCG CGAGGTGGTC GCCTCCGACG TGAAATATTC GATCGAACGC GCCGTCGACC CGAAGACGCA AGGCCCCGGC GCCGGCTTCT TCGGCGCCAT CAAGGGCTTC GAGGATGAAA CCGGCGGCAA GACGACGACG CTCTCCGGCA TCGATACGCC TGACGATAGC ACCGTCATCT TCAACCTCTC TCGCCCAGAC GCCACCTTCC TGCACGTGCT TGCCATCAAC TTCGCCTCGG TCGTGCCGAA GGAAGCCGTC GAGGCTGCCG CCGGCGACTT CGGCAAGAAG CCGGTCGGCT CCGGCACCTT CATCCTGAAG GACTGGACGA TCGGCCAGCA GCTCGTTTTC GAGCGCAACA AGGATTATTT CGTCAAGGGC GTTCCCTATA TCGACAGCTT CAAGGTCGAG GTCGGCCAGG AGCCGCTGGT GGCGCTCTTG CGCCTGCAGA AGGGCGAGGT CGATATTGCC GGCGACGGCA TTCCGCCGGC AAAGTTCCTC GAAATCAAGA ATTCGGCCGA TGGCGCACAG ATGATCGTCG ACGGCGAACA GCTGCACACC GGCTACATCA CGCTGAACAC CAAGGTAAAG CCCTTCGACA ACGTCAAGGT TCGCCAGGCG CTGAACATGG CGATCAACAA GGACCGCATC ACCCGCATCC TCAACGGCCG CGCAACGCCT GCCAACCAGC CGCTGCCGCC GCTGATGCCG GGTTACGACA AGGCCTTCAC CGGCTATACC TATGACGTGG CGAAAGCCAA GGCGCTGCTT GCCGAAGCCG GTTATCCCGA TGGCTTCGAA ACCGTGCTCT ACTCCACCAA CACCGATCCG CAGCCGCGTA TCGCCCAGGC AATCCAGCAG GATCTGGCCG CCGTTGGCGT CAAGGCCGAA GTCCGGGCGC TGGCCCAGGC AAACGTCATC TCGGCCGGCG GCACGGAAGG CGAAGCGCCG ATGATCTGGT CGGGCGGCAT GGCCTGGATC GCCGACTTCC CGGATCCGTC CAACTTCTAT GGCCCGATCC TCGGTTGCGC CGGCGCGGTC CCGGGCGGCT GGAACTGGTC GTGGTACTGC AACGCCGATC TCGACAAGCG CGCCGTTGCC GCCGACTCCA TGTCCGATCC GGCAAAGGCA ACCGAACGCA CCGCCGCCTG GGGCAAGATC TTCACCGACA TCATGGCAGA TGCGCCGTGG ATCCCTGTCA TCAACGAACG CCGCGTCGTC GCCAAGTCGC TGCGCATGGG CGGCGCTGAC AACATCTACA TCGATCCGAC CCGCGTCATC AATTACGACG CGATCTACGT CAAGCAGTAA
|
Protein sequence | MFKRWLQQTT MATMVALAPL SVMADETPKQ GGDIVVTYKD DITTLDPAIG YDWVNWSMIK SLYSRLMDYT PGTPNPVPSL AESFTVSPDG LTYTFKLHKG VKFSNGREVV ASDVKYSIER AVDPKTQGPG AGFFGAIKGF EDETGGKTTT LSGIDTPDDS TVIFNLSRPD ATFLHVLAIN FASVVPKEAV EAAAGDFGKK PVGSGTFILK DWTIGQQLVF ERNKDYFVKG VPYIDSFKVE VGQEPLVALL RLQKGEVDIA GDGIPPAKFL EIKNSADGAQ MIVDGEQLHT GYITLNTKVK PFDNVKVRQA LNMAINKDRI TRILNGRATP ANQPLPPLMP GYDKAFTGYT YDVAKAKALL AEAGYPDGFE TVLYSTNTDP QPRIAQAIQQ DLAAVGVKAE VRALAQANVI SAGGTEGEAP MIWSGGMAWI ADFPDPSNFY GPILGCAGAV PGGWNWSWYC NADLDKRAVA ADSMSDPAKA TERTAAWGKI FTDIMADAPW IPVINERRVV AKSLRMGGAD NIYIDPTRVI NYDAIYVKQ
|
| |