Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_0358 |
Symbol | |
ID | 6979072 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 361945 |
End bp | 363597 |
Gene Length | 1653 bp |
Protein Length | 550 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643395070 |
Product | alpha amylase catalytic region |
Protein accession | YP_002279883 |
Protein GI | 209547966 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.404275 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGTGG CTTCCCAATC GATCTCGACC CCCGATAAGG ACTGGTGGCG CGGCGCGGTG ATCTATCAGA TCTACCCGCG CTCCTACCAG GATTCGAACG GCGACGGCAT CGGCGACCTG AAGGGCATCA CCGCCCGCCT GCCGCATGTG GCAAGCCTCG GCGTCGATGC GATCTGGATC TCGCCCTTCT TCACCTCGCC GATGCGCGAT TTCGGTTACG ACGTTTCCGA CTACGAAAAT GTCGATTCGA TCTTCGGCAC GCTGGTGGAT TTCGACACGA TGATCGCTGA GGCCCATCGC CTCGGCATCC GCGTGATGAT CGACCTTGTC ATCTCGCACA GCTCGGATCA GCACCCCTGG TTCGTGCAAA GCCGCTCCAG CAAGACCAAC GCCAAGGCCG ATTGGTATGT CTGGGCCGAT GCCAAGCCGG ACGGTACGCC GCCGAACAAC TGGCTGTCGA TCTTCGGCGG CTCGGCATGG GCGTGGGATC CGACGCGCAT GCAATATTAC CTGCACAACT TCCTGACCTC GCAGCCGGAT ATGAACCTGC ATAATCCCGA GGTGCAGGAC CGTCTGCTCG ATGTCGTGCG CTTCTGGCTC AACCGCGGCG TCGACGGCTT CCGCCTCGAC ACCATCAATT TCTATTTCCA CGACCCGCTG TTGCGCGACA ATCCGGCACT TGCGCCTGAG CGCCGCAACG CCTCGACGGC GCCGGCGGTC AATCCCTATA ATTTCCAGGA GCATGTCTAC GACAAGAACC GGCCGGAGAA CCTTGCCTTC CTGAAGCGCT TCCGCGCCGT TCTGGAAGAA TTCCCGGCGA TTGCCGCCGT CGGCGAAGTC GGCGACAGCC AGCGCGGCCT CGAAATCGTC GGCGAATATA CCTCCGGCAA CGACAAGATG CATATGTGCT ATGCCTTCGA ATTCCTGGCG CCCGATCCGC TGACGGCCGA GCGCGTCGAA GAGGTGATGC AGGATTTCGA AGCCGCAGCA CCGGATGGCT GGGCCTGCTG GGCCTTCTCC AATCACGACG TCATGCGCCA TGTCAGCCGC TGGGGCGGGC TGGTCGCCGA TCATGACGCC TTCGCCAAGC TCTATGCCTC GCTGCTCCTG ACGCTGCGCG GCTCGGTCTG CCTCTATCAG GGCGAGGAGC TGGCGCTGAC CGAAGCCGAT CTCGCCTATC AGGATCTGCA GGATCCCTAC GGCATCCAGT TCTGGCCGGA GTTCAAGGGC CGCGACGGCT GCCGCACGCC GATGGTCTGG GACAGCCAGG TCGCCCAGGG CGGCTTTTCC ACGGTCAAGC CCTGGCTGCC GGTGCCGGTC GAGCATATTC TGCGCGCCGT CAGCGTCCAG CAGGGCGACG AGGCTTCGGT GCTGGAGCAC TATCGCCGTT TCATCGCTTT CCGCAAATTG CACCCGGCCT TTGCCAAGGG CGAGATCGAA TTCGAGGAGC CGCAGGGCGA CGCCCTGGTC TTCACCCGTG AATACGGCAA CGAGAAGCTG CTCTGCATCT TCAACATGAG CCCGGCTGAA ACCGGCGTCA CTTTGCCCGG CGGCGAATGG CAGGCGTTGA CGGGGCATGG CTTTATCAGC AACAACTATG GCGACAAGAT CGATATTCCG GCCTGGGGGG CGTATTTCGC CCGTCTCGCT TAA
|
Protein sequence | MNVASQSIST PDKDWWRGAV IYQIYPRSYQ DSNGDGIGDL KGITARLPHV ASLGVDAIWI SPFFTSPMRD FGYDVSDYEN VDSIFGTLVD FDTMIAEAHR LGIRVMIDLV ISHSSDQHPW FVQSRSSKTN AKADWYVWAD AKPDGTPPNN WLSIFGGSAW AWDPTRMQYY LHNFLTSQPD MNLHNPEVQD RLLDVVRFWL NRGVDGFRLD TINFYFHDPL LRDNPALAPE RRNASTAPAV NPYNFQEHVY DKNRPENLAF LKRFRAVLEE FPAIAAVGEV GDSQRGLEIV GEYTSGNDKM HMCYAFEFLA PDPLTAERVE EVMQDFEAAA PDGWACWAFS NHDVMRHVSR WGGLVADHDA FAKLYASLLL TLRGSVCLYQ GEELALTEAD LAYQDLQDPY GIQFWPEFKG RDGCRTPMVW DSQVAQGGFS TVKPWLPVPV EHILRAVSVQ QGDEASVLEH YRRFIAFRKL HPAFAKGEIE FEEPQGDALV FTREYGNEKL LCIFNMSPAE TGVTLPGGEW QALTGHGFIS NNYGDKIDIP AWGAYFARLA
|
| |