Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5113 |
Symbol | |
ID | 8006974 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | + |
Start bp | 514247 |
End bp | 516106 |
Gene Length | 1860 bp |
Protein Length | 619 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644822027 |
Product | 4-alpha-glucanotransferase |
Protein accession | YP_002973287 |
Protein GI | 241113452 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1640] 4-alpha-glucanotransferase |
TIGRFAM ID | [TIGR00217] 4-alpha-glucanotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.0943987 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACCTG CTGAGTTCGA CAAGCTCGCC CGTCGGCACG GCATCAGCCC GACCAGGCCC AGTCCCGACA ATCGGGAGGT GGCGATATCA GCCGAGACCA AGCGCAAGAT CCTCTCGGCA CTCAAGATCG ACGTGCCGGG AAGCGCCGAT CCCGCGACCT GCGCGTTACG GCCCAAGCCG GCGGCTAAGA AAATCGCCAG GTCATTTCTG CCCGATTTCC TTTCCGGCAC GCGCGTCTGG GGCGTGAGCC TGCAGCTCTA CGAGCTCCGT TCGGCGCGCA ACTGGGGCAT AGGAGATTTC GAGGATCTGT CGGATATGGC GGATCTCGCA GGATCGTTGG GCGCCGATTT CATCGGCCTC ACCCCGCTTC ATGCGCCGTT TCTCGCCGAT CCAGACCGTT GCAGTCCCTA CGAGCCGTCG AGCCGCCAGC ATCTCAACCC GCTCTATATC GCGGTCGAGC GATTGCCGGG TTTTGCCTGC GGTCCCGAAC TCGAACGGCA TTTGGAAAGC CTTCGCCAAA CCGATCTCGT CGACTACGTC GGCGTCGCAC AGATCAAGCT GAGGGTCCTG CGCGATCTTT GGCCGGCCTG GCGACAGAGC AGCGTGATCG ACGATGCCTA TGATCCGGCC GATTTCGACG CCTTCATCAC GCAAGGTGGT AACAACCTGC GGCTGCATGC GCTTTTCGAA TGCCTCTCGT TTTCCATGGT CGAGCGCGGG ACAGGCGCCG GCTGGCAACG TTGGCCGGCC GATTTCCAGC GCTTCGAAAG CGCCGCCGTT GCCGAATTCG AGCGCGAACA CGCGGACGAC GTCCTTTTCC ACATGTGGCT GCAATGGCTC GCCCACCGGC AACTGATGCA GGCGGCGGAC CGGGCGCGGA AGGCAGGCCT GCGGATCGGG CTCTATCTCG ATCTTGCCGT CGGGGAGGCG GTCGACGGTT CGGCGACATG GAGCGAGCCG GATACCTATG TCTCGAAGGC CACCATCGGC AGCCCACCCG ATCCATTCGC CGTGGACGGG CAGGACTGGC ACCTTGCCGG ATATCTGCCT TCCGCGATCG CGGCAGGAGA AATGTCGCCT TTCCGGCGCA TGCTGAGCGC TGCCATGCGA TATGCGGGCG CCATCCGCAT CGACCACGCC GCAGCACTTC GCCGCCTCTT CCTGGTGCCA TTGGGCAGCA GGCCGGACGG CGGCGCCTAT GTCTGTTATC CCGCGGATCG GCTGCTGCAG ATCCTCGCCG AAGTGTCCGC AGAGCATCGG TGCCTCGTTA TCGGAGAAGA CCTGGGCCTG ATCCCCAAGG GATTGCAGGA CGATCTTGCG ACGGCCCACA TTCTCTCCTA CCGCATCCTT TCCTATGAAC AGGACGAGAA GGGCTTCAAG CCGGCGGACG TCTATCCCGC CCTGGCTTTG GCCTGCATTT CGACGCATGA CCACCAAACC CTTGCCGGCT GGTGGCGCGG CGCCGACATT CAGGCCCGGT GCGACCACGG CATCGTGCCG CCCGATCTCA CCAAGAAGCA CCTCGAAGAC CGCAAGCGCG AGCGGAGGAA CCTGAAAGCG GCGCTGAAGG CTGCCGGCCT CGAACTGCCG GCCAGGCTTT TCCAAGCGCG GGCAAGCGAG GAGACGCTGA GGGAATTGAC CGTCAGCGCT TATCGCTTCA TCGCCAGGAC GCCCTCGCTT CTCGCCGCGG TCCGTCTCGC CGATCTCACC GACGAAAAAA GGCCAACCAA TGTGCCAGGC ACCAGCGACA GCTATCCGAA CTGGAAGCCG AAGCTATCGG TTTTGCTCGC CGATCTGATG TCGAGCCCGC TGCTGAAGAG CGTGACGGCG GCAATGCGGG AGGAACGTCC GCGGGAGTGA
|
Protein sequence | MKPAEFDKLA RRHGISPTRP SPDNREVAIS AETKRKILSA LKIDVPGSAD PATCALRPKP AAKKIARSFL PDFLSGTRVW GVSLQLYELR SARNWGIGDF EDLSDMADLA GSLGADFIGL TPLHAPFLAD PDRCSPYEPS SRQHLNPLYI AVERLPGFAC GPELERHLES LRQTDLVDYV GVAQIKLRVL RDLWPAWRQS SVIDDAYDPA DFDAFITQGG NNLRLHALFE CLSFSMVERG TGAGWQRWPA DFQRFESAAV AEFEREHADD VLFHMWLQWL AHRQLMQAAD RARKAGLRIG LYLDLAVGEA VDGSATWSEP DTYVSKATIG SPPDPFAVDG QDWHLAGYLP SAIAAGEMSP FRRMLSAAMR YAGAIRIDHA AALRRLFLVP LGSRPDGGAY VCYPADRLLQ ILAEVSAEHR CLVIGEDLGL IPKGLQDDLA TAHILSYRIL SYEQDEKGFK PADVYPALAL ACISTHDHQT LAGWWRGADI QARCDHGIVP PDLTKKHLED RKRERRNLKA ALKAAGLELP ARLFQARASE ETLRELTVSA YRFIARTPSL LAAVRLADLT DEKRPTNVPG TSDSYPNWKP KLSVLLADLM SSPLLKSVTA AMREERPRE
|
| |