Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5097 |
Symbol | |
ID | 8007689 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | + |
Start bp | 488783 |
End bp | 492007 |
Gene Length | 3225 bp |
Protein Length | 1074 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644822011 |
Product | alpha amylase catalytic region |
Protein accession | YP_002973271 |
Protein GI | 241113436 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.511822 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.0191149 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCATTCT CTCCAAACCA GGGTGTGTTG AGCACCCCAC CCCGGATCTA TTATGTCAAT CCACTCCTTC TTCAGGGTAT CGATGCATGG CGCGAGGTTT TCGACCATGC GGCGGATACC GGTTTCGACC GCGTGCTGAC GGCACCTCTC TTCGACCGGG GCGGGGAGCG CAGCATCTTT GCCTCCCAGG ATCTGAAGCG CCTTGATCCG CAACTGTCGC TCGGAAGCGC GGTCGAAAAT GGGGTGGGAC GCCTGGCGGA GGCCGCCCGT AAGAGCGGCG TCGCCTTGAT GATGGACCTG ATGCTCGACG GCAAGGCCCG GGATCCCAAG GTGGGCTTTC ACCCTGTCGA TCCGAGACGC TCGCCATTGG ATCCGGCAGA GCCGATGACT GCGATGGAGG CGGAGTCTCA ATCCCGGCTG TTGGAAGAAT GGACAGAACG GCTGCGGGGG CTGGCCGGTC TTGGCCTCGC CGGCTATCGG GCCCTGGGGA TCGATCGCAT TGGGCCGGCG GTTTTCAAAT CGCTGATTTC GGCGGTGCGC GAGAAGACGG ACGCGCAATT CCTTGCCTGG ACGCCCGGCA CTGATTTCGG CGTGAGGGAC GCGGTCAAGA ATACAGGCTT CGATGGATGT TTTTCATCGA TGGCCTGGTG GGATTTTGAC GAGAGATGGT TCATCGAAGA ACACCGGGTC CAGAAGCCGC TCGGCTGGCA GATCGCTTTT CCGGAACCGC CCTTCGGCAG GCGTATCGCC CACGGAACCG AGAGCCGCGA AATCCTCGAA CGGCGCGCCG TTAGGGCGCT ACGCCTTGCC GCCTCGCTCG GCGGCGGCCT GATGGTTCCG ATGGGCTTCG AATATGGCGC AGCCACGCCG CTCGATCCGA CCCATGGGAA TGGAACGGGT TTGCGGGGAC TGCGCCATGA TCTGGCTTTC GATATCTCCT CCGAAATTCG CCTTGCGAAC ACCGAGATCG GCAAGGAACC CCATGCACTT GCCTCATCGC TTCGGCTCAT CCGCAACGCC AACGGACCGG TCTCGGCGCT GCTGCAATCG GCGGCGGAAG ACCCTCGCAG CGCCGAAAAT GTGCGGTTCA TCCTGCTCAA CAGGGACCTT CGCAAAAGTG CGCCCGCTCC GGTAACAGCA CTGCGCGAAG CGGGCTCCGG TTTCCTTCCC GTCGCAGCCG ACGGCACGGT TCTGCGGCTG CGGGCAGGGG AGACCCTTGT CGTCGAAGGC AAGGCGCCGG CGCCGATCAC ATCGCGGCCA ATCCTCGACG TCGCGCAGGC GATAGCTTCA CCCCGGCTCG CCATCGAAAA TATCCTGCCG CGGGTCGACG ACGGGCGCTT TCCTGTCAAA CGGGTGGTCG GCGACATCCT CACCGTCGAG GCCGATATAT TCGCCGACGG CCACGACCCG ATCGCCGTGG TCCTGCTCTG GCGGCCGCTC GATGCGGCGG ACTGGAACGA GACGGAGATG CAGCTGGTCG AAAACGACCG CTGGCGCGCC GAATTCCTGC TGGAGCGCAT CGGACGCTAT GAGTTTGCCG TCGAGGCATG GAAGAACCCG TTTGCGATCT TTCGCTACGA GTTGACAAAG AAGAACGATG CCAGGCTGGA CCTGAAGCTC GAACTCCAGG AGGGATTGAA CCTCATCCGC TCGGCTGAGG TGCATGCCGG CGCAACGCTC AATGCCGAAT TGAAGGCCCT TGGTGATAGT CTCGAAGGGG CGTCCGATAC CGAGCGCACG GCGATACTGC TCGATGCCGG AACATCCGAA TTGATGAACA AGGCGGACAA CAGGCCGTTT CGGCTTCGCT CGACGGCAAG CGCCGTCGAT GCCGAACGCA AGGAGGCCGC CTTTGCCAGC TGGTACCAGA TCTTCCCGCG CTCGCAGAGC GGCGATCCGG ACCGGCACGG CACCTTCGAC GACGTCATCC CGCGACTGCC CGCCATCCGC GACATGGGCT TCGACGTGCT TTATTTCCCA CCGATTCATC CGATCGGCTC GACCAACCGG AAAGGTCGCA ACAACACCCT AAAAGCTGCG CCGGGCGACC CCGGAAGTCC CTATGCCATC GGCTCCGAGG ACGGCGGCCA TGACGCCATC CATCCCGAGC TCGGCGAGTT CGAGGATTTC CGCCGGCTGG TCGATGCGGC AGGCCGGCAT GGCCTGGAGA TTGCTCTCGA CCTCGCGATC CAGGCGTCGC CGGATCACCC CTGGCTGAAG GAGCATCCCG GCTGGTTCGA CTGGCGCCCC GACGGCACGA TCAAATATGC TGAAAACCCG CCCAAGAAAT ACGAGGACAT CGTCAACGTC GATTTCTACA CGAAAGACGC GCTGCCTTCA TTATGGGTGG AGCTCAGGGA TGTCGTCCAG CTTTGGGTGG ACCAGGGCGT CAAGCTGTTT CGCGTCGACA ATCCGCACAC AAAGCCATTT CCGTTCTGGG AATGGCTGAT CGGCGATATC AGGGGCCGTC ATCCCGATGT CGTCTTCCTG TCGGAAGCCT TCACCAAGCC GAAGGTGATG TACCGGCTGG CAAAGATCGG CTTCTCCCAA TCCTATACCT ACTTCACCTG GCGCAATGCC AAGTGGGAGC TCGAGCAATA TATGCGGGAG CTGACCGAGA CGGCGCCGAA GGAATTCTTC CGACCGCATT TCTTCGTGAA CACGCATGAT ATCAATCCGG ATTTCCTGCA GAACGCGCCG CGCCCGGCCT TTCTGATCCG CGCAGCACTT GCCGCTACCC TGTCGGGATT GTGGGGCGTT TATAACGGTT TCGAACTTTG CGAGGGGCGT CCCGATGCCA AGCGCAAGGA GTATGCCGAC AGCGAGAAAT ACGAAATCCG CGCCTGGGAC TACGATCGGC CGGGCAATAT CATCGCCGAA ATCAGGACGC TCAATCGCAT CCGCAACGAA AACACCGCGC TGCATTCGCA TCTCGGGCTG ACGCTGCTGA ATGCGCGAAA TGACAATATC CTGTTTTTCG AGAAGGCGAG CCGTGCCCGC GACAATGTCC TGCTGATCGC CATCAGCCTC GATCCCCACA ATTTCCAGCA GAGCGACGTC GAGCTGCCGC TCTGGCAGTG GTCGCTGGGC GACGGCGGCA CGCTGGATGT CGAAGATCTG ATCGGCGGGC ATCGTTTCAA GTGGACCGGC AAATGGCAGA GCATCAGTCT CAATCCTGAG GTCCTGCCCT ATGCGATCTG GCGTATTCGC TCAACGGAGG CATGA
|
Protein sequence | MSFSPNQGVL STPPRIYYVN PLLLQGIDAW REVFDHAADT GFDRVLTAPL FDRGGERSIF ASQDLKRLDP QLSLGSAVEN GVGRLAEAAR KSGVALMMDL MLDGKARDPK VGFHPVDPRR SPLDPAEPMT AMEAESQSRL LEEWTERLRG LAGLGLAGYR ALGIDRIGPA VFKSLISAVR EKTDAQFLAW TPGTDFGVRD AVKNTGFDGC FSSMAWWDFD ERWFIEEHRV QKPLGWQIAF PEPPFGRRIA HGTESREILE RRAVRALRLA ASLGGGLMVP MGFEYGAATP LDPTHGNGTG LRGLRHDLAF DISSEIRLAN TEIGKEPHAL ASSLRLIRNA NGPVSALLQS AAEDPRSAEN VRFILLNRDL RKSAPAPVTA LREAGSGFLP VAADGTVLRL RAGETLVVEG KAPAPITSRP ILDVAQAIAS PRLAIENILP RVDDGRFPVK RVVGDILTVE ADIFADGHDP IAVVLLWRPL DAADWNETEM QLVENDRWRA EFLLERIGRY EFAVEAWKNP FAIFRYELTK KNDARLDLKL ELQEGLNLIR SAEVHAGATL NAELKALGDS LEGASDTERT AILLDAGTSE LMNKADNRPF RLRSTASAVD AERKEAAFAS WYQIFPRSQS GDPDRHGTFD DVIPRLPAIR DMGFDVLYFP PIHPIGSTNR KGRNNTLKAA PGDPGSPYAI GSEDGGHDAI HPELGEFEDF RRLVDAAGRH GLEIALDLAI QASPDHPWLK EHPGWFDWRP DGTIKYAENP PKKYEDIVNV DFYTKDALPS LWVELRDVVQ LWVDQGVKLF RVDNPHTKPF PFWEWLIGDI RGRHPDVVFL SEAFTKPKVM YRLAKIGFSQ SYTYFTWRNA KWELEQYMRE LTETAPKEFF RPHFFVNTHD INPDFLQNAP RPAFLIRAAL AATLSGLWGV YNGFELCEGR PDAKRKEYAD SEKYEIRAWD YDRPGNIIAE IRTLNRIRNE NTALHSHLGL TLLNARNDNI LFFEKASRAR DNVLLIAISL DPHNFQQSDV ELPLWQWSLG DGGTLDVEDL IGGHRFKWTG KWQSISLNPE VLPYAIWRIR STEA
|
| |