Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5344 |
Symbol | |
ID | 8007302 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | - |
Start bp | 752674 |
End bp | 754194 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 644822248 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002973508 |
Protein GI | 241113673 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.297716 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.516905 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACTGC ATTTGTTAGC TGCCTGCTTT TCAACAACGG TGCTCGCGTT GAGCGGCGGT ACGGCACACG CTCAGGATGC CAAGAGCAAC GTCACTGTTG TGCTTGCCGA AACCGTCGAT GTCGTCGAAC CCTGTATGGC AGCGCGCCAG GACGTCGGCC GGGTCATTTC TGAAAACGTC AACGAGATGC TGGTGGAGTT CGACTACGTC AATGGCGGCC TCAAACCTCG CCTGGCGACG GAATGGTCGA AGATCGATGA CGACACCTGG GAGTTCAAGC TGCGCCCGAA TGTCAAATGG CACGATGGCA AGCCGTTCAC CGCCAAGGAT GTCCAATTCA CCATCGAGCG CAACAAGAAT AAGAAGCTCA GCTGTGAGAC CGGCGGCAAA TATTTCGGCG GCACGGAGTT CAGCTTCGAA ACGCCCGATG CCAACACGAT CCGCATTACA ACAAAACCGG CGCAGCCGAT TCTTCCGCTT CTGATGACGG TGATGGCTGT GGAATCGGCC GAGGCGACAC CAGCCGACGA ATTCACCCGC AAGCCGATTG GCACTGGCCC TTATACGTTC GACAAATGGG AAATCGGCCA GTCAATCGTG CTGAAACGCA ATCCGGAATA TTGGGGGGAG AAACCCCAGG TGGAACAGGC GACATATCTG TTCCGCTCAG ACAGCGCCGT TGCAGCCGCC ATGGTCGATG CTGGCGAAGC CGATATCGTT CCGGCCGTAT CCGTACAGGA TGCCACCAAC AAGGAAACCG ATTTCGCCTA CCCGAATTCG GAAACGACAT CGCTGCGCAT CGATACGCGC GCAGCACCCC TTAACGACCG GCGCATCCGC GAAGCGATGA ACCTCGCCAT CGATCGTCAG GCGATGCTCG GAACGCTGTT CCCCGAACAG GCAAAGATCG CGACACAGCT CGTTGTGCCG ACCACGATCG GTTACAATGC CGATATCCCC GCTTGGCCCT ATGATCCCGA AAAAGCAAAG GAACTGGTCG CAGCAGCGAA AGCGGACGGC GTCCCGGTCG ATCGTGAGAT CCGTATCATC GGTCGCAATG GACAATATCC AAACGCAACC GAAGCGATGG AAGCGATGAT GGCGATGCTT CAGGAAGTCG GCTTGAACGT AAAGCTCGAC ATGTATGACG TGTCCGTGTG GAACGGCTAC TTCGTTGCAC CCTTTGTCGC CGATTCCGGT CCGACGCTGA CCCAGTCGCA GCACGACAAT GCGACCGGCG ACCCCGTCTT CACCGCATTC GTGAAATACG CGACCGACGG TTCCCACTCC ATGGTTCGCG ATCCGGCCGT TGACGCGCTG ATCGCCAAGG CGACGTCTGC CACCGGCGAC GAGCGCACAA AACTCTGGAA GGAGCTTTTC GCCAAGGTGA ACACCGAAAT CATCGCCGAC ATCCCGATGT TTCACATGGT CGGTTTCACC CGCGTCTCGC CGCGTCTCGA CTTCAAGCCG ACGATCGCGA CGAATTCCGA ACTGCAGCTG TCGCAGATCC GCTTCAAGTA A
|
Protein sequence | MKLHLLAACF STTVLALSGG TAHAQDAKSN VTVVLAETVD VVEPCMAARQ DVGRVISENV NEMLVEFDYV NGGLKPRLAT EWSKIDDDTW EFKLRPNVKW HDGKPFTAKD VQFTIERNKN KKLSCETGGK YFGGTEFSFE TPDANTIRIT TKPAQPILPL LMTVMAVESA EATPADEFTR KPIGTGPYTF DKWEIGQSIV LKRNPEYWGE KPQVEQATYL FRSDSAVAAA MVDAGEADIV PAVSVQDATN KETDFAYPNS ETTSLRIDTR AAPLNDRRIR EAMNLAIDRQ AMLGTLFPEQ AKIATQLVVP TTIGYNADIP AWPYDPEKAK ELVAAAKADG VPVDREIRII GRNGQYPNAT EAMEAMMAML QEVGLNVKLD MYDVSVWNGY FVAPFVADSG PTLTQSQHDN ATGDPVFTAF VKYATDGSHS MVRDPAVDAL IAKATSATGD ERTKLWKELF AKVNTEIIAD IPMFHMVGFT RVSPRLDFKP TIATNSELQL SQIRFK
|
| |