Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_6409 |
Symbol | |
ID | 8016908 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012854 |
Strand | - |
Start bp | 121590 |
End bp | 122687 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644828204 |
Product | putative sugar ABC transporter, substrate-binding protein |
Protein accession | YP_002979404 |
Protein GI | 241554191 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1879] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000000237151 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 0.403501 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCGCA GAACATTATT GCAGGCAACG GTCGCAACCG TCGCCGTTAT GCTCAGCATG CCGGCATGGG CCGATTCAAT GGCCGATGCC AAGGCCGTGG TTGAAAAGTA TGCTTCCAAG GTGAGCGCCT GGGACGGCCC GACGAGCGGC CCCAAGGGCG CTGCCGGAAA GAACATCGTC ATTCTTGCCG GCGATATGAA AAACGGTGGC ATCCTCGGCG TGGTCAATGG CGTTCAGGAA GCGGCAGGCG CGCTCGGCTG GACGGTAAAG GCGCTTGACG GCGCCGGCTC GATCGGCGGG CGGACCGCGG CTTTCGGCCA GGCCATGGCG CTGAAGCCGG ATGGCATCAT CATCAACGGC TTTGACGCGG TCGAGCAGAA GCCGGCGATG GAAGCAGCCA AGGCTGCCGG CATTCCGATG GTTTCCTGGC ACGCTGCCTC GGCGGTCGGT CCGGTTCCCG AAGTCGGTGT CTTTGCAAAC GTGACTACCG ATGCGATGGA AGTATCGAAG TCCGCGGCGG ATTGGGCCTT TGCCGATGCC GCCGGCAAGC CGGGCGTCAT CATCTTTACC GACTCCACCT ATGCGATCGC GATTGCCAAG GCCGATCGCA TGAAGAAGGA AATCGAAGAT CTCGGCGGCA CGGTGCTCGA ATATGTCGAC ACGCCGATCG CCGAAACCTC GCAGCGCATG CCGCAGCTGA CCACGTCGCT TCTACAGAAG TATGGCGCGA AGTGGACGCA TTCGCTGGCG ATCAACGACC TCTATTACGA CTTCATGGGA CCTTCGCTCG CCTCGGCCGG CATTGCAGGC GATGGAAAAC CGGTGAACGT TGCAGCCGGC GACGGGTCGG AAAGCGCCTA TCAGCGTATC CGCGCCAAGC AATTCCAAGC CGTCACGGTC GCCGAGCCGC TCAACCTTCA GGGCTGGCAG CTCGTCGACG AATTGAACCG CGCCTTTGCG AAAGCGCCAT GGTCCGGCTA CGTGTCGCCA CTCCATGTGG TGACCAGCCA GAACGTCGAG TTTGACGGTG GGCCGAAGAA CAGCTTCGAT CCCGACAACG GCTATCGCGA TCAATACAAG AAAATCTGGG GCAAATAA
|
Protein sequence | MKRRTLLQAT VATVAVMLSM PAWADSMADA KAVVEKYASK VSAWDGPTSG PKGAAGKNIV ILAGDMKNGG ILGVVNGVQE AAGALGWTVK ALDGAGSIGG RTAAFGQAMA LKPDGIIING FDAVEQKPAM EAAKAAGIPM VSWHAASAVG PVPEVGVFAN VTTDAMEVSK SAADWAFADA AGKPGVIIFT DSTYAIAIAK ADRMKKEIED LGGTVLEYVD TPIAETSQRM PQLTTSLLQK YGAKWTHSLA INDLYYDFMG PSLASAGIAG DGKPVNVAAG DGSESAYQRI RAKQFQAVTV AEPLNLQGWQ LVDELNRAFA KAPWSGYVSP LHVVTSQNVE FDGGPKNSFD PDNGYRDQYK KIWGK
|
| |