Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4978 |
Symbol | |
ID | 6978072 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | - |
Start bp | 620839 |
End bp | 622152 |
Gene Length | 1314 bp |
Protein Length | 437 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643394124 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002278942 |
Protein GI | 209547024 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.346425 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAACCT TTGCTGCATT GCTGCTGGGA GCGACGGCGC TCGTCGGTAC TTCGGCCTTG GCCCAAACCA CGCTGACGAT TGCGACAGTC AATAACAACG ACATGATTGT GATGCAGAAG CTGTCGAAGG ACTTTGAAGA GAAGAATCCT GACATCAAGC TCAACTGGGT CACCCTGGAA GAGAACGTCC TTCGTCAGAA GATCACGACC GACATCGCCA CCCAGGGCGG TCAGTACGAC ATCATGACGA TTGGCATGTT CGAGACCCCG CTGTTCGGCG AAAAGGGTTG GCTGTCCGAA TTCAAGGACG TTCCGGCCGA CTACAAGCTC GACGACGTTC TGAAGTCGGT CCGTGACGGC TTGTCCTTCG ATGGAAAGCT CTATGCTCTG CCTTTCTATG CTGAAAGCCA GATGACCTTC TACCGCAAGG ACCTGTTCGA CAAGGCTGGG ATAACGATGC CTGATCAGCC GACCTGGGAA CAGATCGGCC AGTTCGCAGA GAAGATCACC GACAAGGACA AGGAAATCTA TGGTGTGTGC CTGCGTGGCA AGCCGGGCTG GGGAGAAAAT ATGGGTCAGA TCGGCCCAGT CGTAAACAGC TACGGTGGCC GCTGGTTCGA TATGGACTGG AAGCCGCAGC TGACGACCGA GCCTTGGAAG GAAGGCGTCA CTACCTACGT CGATCTCCTA AAGAAGTATG GCCCTCCCGG CGCATCGTCC AACGGCTTCA ACGAGACCCT GTCGCTTTTC GCCAGCGGCA AATGCGGCAT GTGGGTTGAC GCAACCGTCG CCGCGGGCTT CCTGACCGAC AAGAAGCAGA GCCAGGTTGC TGACAAGATG GGTTACGCCC ATCCGCCGAT TGGCAAGTTC GATAAGGGAA ACCATTATCT GTGGTCCTGG GCACTGGCAG TTCCGGTTTC GTCGAACGAA CCTGACGCAG CGAAAAAGTT CATCTACTGG GCGACCTCGC AGGACTATAT CAAGCTGGTA GCCAAGGAAA ATGGATGGGC GGCGGTACCT CCCGGCACCC GCACCTCCAC ATATGACACA CCGGAATACA TCAGCGCTGC TCCGTTCGCA AAGTTGACCC TGGAGACGAT CCAGACGGCA AACCCGACCG ACGCTACGCA GGAGAAGGTT CCGTACCGTG GCATTTCCTA CGTTGGTATT CCGGAGTTCC AGAGCTTCGG TACTGCCGTC GGTCAGAAGA TGTCCGCTGT TATTGCTGGC CAGAGCACCG TCGATGAAGC GCTGAACGAA TCTCAGAAGC TTGTCGAGCG CACGATGAAG CAGGCCGGTT ACCCGAAGAA ATAA
|
Protein sequence | MKTFAALLLG ATALVGTSAL AQTTLTIATV NNNDMIVMQK LSKDFEEKNP DIKLNWVTLE ENVLRQKITT DIATQGGQYD IMTIGMFETP LFGEKGWLSE FKDVPADYKL DDVLKSVRDG LSFDGKLYAL PFYAESQMTF YRKDLFDKAG ITMPDQPTWE QIGQFAEKIT DKDKEIYGVC LRGKPGWGEN MGQIGPVVNS YGGRWFDMDW KPQLTTEPWK EGVTTYVDLL KKYGPPGASS NGFNETLSLF ASGKCGMWVD ATVAAGFLTD KKQSQVADKM GYAHPPIGKF DKGNHYLWSW ALAVPVSSNE PDAAKKFIYW ATSQDYIKLV AKENGWAAVP PGTRTSTYDT PEYISAAPFA KLTLETIQTA NPTDATQEKV PYRGISYVGI PEFQSFGTAV GQKMSAVIAG QSTVDEALNE SQKLVERTMK QAGYPKK
|
| |