Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4705 |
Symbol | |
ID | 8007180 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | - |
Start bp | 70820 |
End bp | 73846 |
Gene Length | 3027 bp |
Protein Length | 1008 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644821638 |
Product | glycoside hydrolase family 38 |
Protein accession | YP_002972898 |
Protein GI | 241113063 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0383] Alpha-mannosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTCTGA CCATTGCCCA GCGCCTCGAT CGTCTTAAAG TTCGCATTGC CGAATTGGCT CATTGGCGGG ATCGACAAAG TGCTGCGATC GACGGCTGGA CTTTCGAAGG CGAGCCGATC GAACATCACC AGGATTGGCC GCACCGCCAG GGGGTGGTGC ATTTTGCCGT GATCGCTGAA GCGCCGGAGG CGTGGCCTCT CGAGGATATT CGCCTGCAAC TCGATCTTGG TGGCGAGAGC CTGATTACGC TGAGCTACCC CGACGGCGAG ACTGAAACAT TCGGTCTCGA TCCTTATCAT CAGGAATTCC CGGTAAAGGG CCGTCGCTTT TCGATCGCAA CCGAAACCGT CGCCCGGTTT CCATTCGGCG AGCCCAATCG CGCTCCACGG CTCAACAAGG CCCGATTTAT CTGGCTCGAT GGTCCCGCTC ATCGCATGCA TCTTCTGCTG AAGCAGGTTG CCGAGGCAAT CGAGGTGCTC GGCGAGCATG AAGTCGTGCC GCATCTGATG GATGCCGCCG AGCATGCGTT GCGCAGCCTC GACTGGCCGT CGGATACGGC GGCCTATATA TCCCGCACAT CCGGCGCCGT CATGCAGCAG AAGATCTGGG AACTGCCGGA GCTTGAGGTT AATCCGGCAG GCCTCACGGA CGAGCAGAGC GGCTCGGCAG CCACGGCCTT CGAGGCTCTG ACGGCGCGGT TGAAGGAGTT GCAGAAGCGT TTCCCGCCGA ATGGAGAACT GTTGCTGACG GGCCATGCGC ATATCGATCT CGCCTGGCTT TGGCCTTATC GCGAGACGCG GCGGAAAATG CGGCGCACCT TCAATACGGC GCTGTCGCTG ATGGAGCGCT CCGACGATTT CCGCTTCAAT CAATCGACCG CCCATTATTA CGCGCAGATG GAAGAGGAAG ACCCGGAGCT CCTCGATCGC ATCAAGCAGA AGGTCGCGGA AGGAAAGTGG GAAACCGTCG GCGGCATGTG GGTCGAGCCC GACACGAACA TGCCGACCGG CGAAAGCCTC GCCCGTCAGG TTCTCTATGG CCAGCGTTAT TTCGAAAAGA CCTTCGGCAC GCGCCATACG GTCTGCTGGC TGCCAGATTG CTTCGGCTTT TCAGGCGCGC TGCCGCAAAT CTTGAGGCAG GGCGGCATCG ACAGCTTCTT CACGATCAAG GTCAATTGGA GCGAGACCAA CCATATTCCA TCCGATCTCT TCTGGTGGAA GGGTCTCGAC GGCAGCCAGG TGCTGACCCA TACGTTCGAC AATCCCATGC AGGGTTATAA CGGCTTCGTG CAAGCCGATT GCTACGTGCC GACATGGAAG AATTTCCGCG GCAAGACACA GCACGATACC TCGCTTCTGG CCGTCGGTTA TGGCGACGGC GGCGGCGGCG TCACGCCCGA GATGGTCGAG CGCGAGGTGC AGCTACGCGA TTTCCCGGCT ATCCCGCAGG CACGCTGGGG CACCGTCAAA AGCTATTACG AACAGGCGCA CCGCACCGCA GGCGAAAAGA ACCTTCCGGT ATGGGATGGC GAAATCTATC TCGAACTGCA CCGCGCCACG CTGACATCGC AAAGCGGCGT CAAGCGCAAG CATCGCCAGG CCGAACGGGC GCTGATCACC GCCGAAACCA TCTCTTCGCT GGCGCATATG CTCGGAGCAG ACAGGCCGAA AAGCCTCGAA GCGGATTGGC GCGTGGTGCT GAAGAACGAA TTTCACGACA TTCTGCCGGG CTCGAGCATC CGTGAAGTTT ATCAGGATGC CGAGCAGGAA CTGGGCGGCG TCATCGAACG TGCTGGAACC GAACAGGCAA ATGCCTTGCA AGCGCTGTCG GCCAAGCTGC CGAAAGGCGG GGTTGGCGAT GCCCTCGTCG TCGTCAATCC GTCTCTTGCG GCCAGACCGC TCAGCGCAAC GCTCTCCGAC GGCACGGTCG TCGCGGCCGC CGATCTCGTC GCCCCCTTGT CCGTCGCGGT CTTTGACAAG GGCTCGCTCA AACCGGCGGG TGGGCTGAAG GCAGGCCCCG ACCGTCTCGA AAACGACTAC CTCGTCGTTG GCATCGGCAA GGACGGGGCG GTTTCGAGTC TCATTCACAA GGCCACGGGC CGGGAAGCGG TCGACGGCTC GGCAAATCAG CTCTGGGTCT ATCCGGCCGA CAAGCCGCGC AACTGGGACG CCTGGGACAT CGATGCGGAT TATGCCGAAA AGGCCGTCCG CCTCGAGGCG CCTGACAGTG TCACCCTCGT GGAAGACGGG CCGCACCGCG CGGCGATCCG CGTCGTTTAC CGCTACCGGA ATTCGAGCGT CACGCAGACC TATGTGCTGA CGGCCAACGC CAGGCGGCTC GACATCGAAA CGACGATCGA CTGGCATGAC CGCCGCACCC TGCTGCGAAC CCTGAACCCG GTTGCCGCGC AGGCCCGCAA GGCGACCTTC GAATGCGCCT TCGGCATTGT CGAGCGAGCA ACGCACACGA ATACGTCCTG GGAGCAGGCG ATGTTCGAGG CCGTCGCGCA CCGCTTCGTC GATATCAGCG AGCCGGACTT CGGGGTCGCG CTGATCAACA ATGCCAAATA CGGCCATAGC GCCCGCGGCA ACGTGATCGG CATGAGCCTT GTGCGCGGGC CGATCTATCC GGATCCGCTG GCCGACGAAG GTGAGCAGAG CTTCACCTAT GCGCTGATGC CGCATGAAGG CGCCTGGCAT GAAGGCGGCG TCCTCGACGA GGCGATCGAT CTCAACCAAC CGCTCGTCTC GGCTGAGGCC AGCGGCCTCT CCGCCGGCAC TTTCGCGCCT CTTGCGATCA CCGGCATCCC CGTCGCGTTC TCAGGCCTGA AGCCGGCGGA AGAGGGCGAC GGCCTCATCC TGCGCCTCTA TGAGCCAGCC GGCCGGCGCG GCAGGCTCGC CCTCGGGCTA CCTTCCGGAT GGGCAGCATC GCAGCCGCTG AACATTCTCG AAGAGCCGAT GGAGCGGAAG GGACCTGCCG ACATCATGCC GTTCGAAGTC AGGACCTGGA AGCTGCAGAA TGGCTGA
|
Protein sequence | MPLTIAQRLD RLKVRIAELA HWRDRQSAAI DGWTFEGEPI EHHQDWPHRQ GVVHFAVIAE APEAWPLEDI RLQLDLGGES LITLSYPDGE TETFGLDPYH QEFPVKGRRF SIATETVARF PFGEPNRAPR LNKARFIWLD GPAHRMHLLL KQVAEAIEVL GEHEVVPHLM DAAEHALRSL DWPSDTAAYI SRTSGAVMQQ KIWELPELEV NPAGLTDEQS GSAATAFEAL TARLKELQKR FPPNGELLLT GHAHIDLAWL WPYRETRRKM RRTFNTALSL MERSDDFRFN QSTAHYYAQM EEEDPELLDR IKQKVAEGKW ETVGGMWVEP DTNMPTGESL ARQVLYGQRY FEKTFGTRHT VCWLPDCFGF SGALPQILRQ GGIDSFFTIK VNWSETNHIP SDLFWWKGLD GSQVLTHTFD NPMQGYNGFV QADCYVPTWK NFRGKTQHDT SLLAVGYGDG GGGVTPEMVE REVQLRDFPA IPQARWGTVK SYYEQAHRTA GEKNLPVWDG EIYLELHRAT LTSQSGVKRK HRQAERALIT AETISSLAHM LGADRPKSLE ADWRVVLKNE FHDILPGSSI REVYQDAEQE LGGVIERAGT EQANALQALS AKLPKGGVGD ALVVVNPSLA ARPLSATLSD GTVVAAADLV APLSVAVFDK GSLKPAGGLK AGPDRLENDY LVVGIGKDGA VSSLIHKATG REAVDGSANQ LWVYPADKPR NWDAWDIDAD YAEKAVRLEA PDSVTLVEDG PHRAAIRVVY RYRNSSVTQT YVLTANARRL DIETTIDWHD RRTLLRTLNP VAAQARKATF ECAFGIVERA THTNTSWEQA MFEAVAHRFV DISEPDFGVA LINNAKYGHS ARGNVIGMSL VRGPIYPDPL ADEGEQSFTY ALMPHEGAWH EGGVLDEAID LNQPLVSAEA SGLSAGTFAP LAITGIPVAF SGLKPAEEGD GLILRLYEPA GRRGRLALGL PSGWAASQPL NILEEPMERK GPADIMPFEV RTWKLQNG
|
| |