Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4038 |
Symbol | |
ID | 5318338 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | - |
Start bp | 499771 |
End bp | 501405 |
Gene Length | 1635 bp |
Protein Length | 544 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640775846 |
Product | alpha amylase catalytic region |
Protein accession | YP_001312779 |
Protein GI | 150376183 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | [TIGR02456] trehalose synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0583407 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGAGG CGCCGTGGTT CACAAGCTCG GTCATCTATG GAATCGACGT GCGAAGGTTC GCTGACGGAA ACGGCGACGG GATCGGCGAC TTCATCGGAC TGAGGGAGCG GGTAGTCTAT CTGAGCCACC TCGGCATCGA CTGCGTCTGG CTGTCGCCCT TCTTCAGGTC CCCCTTCGCC GACAATGGTT ACGACGTCAG CGACTATTAC TCTGTCGATC CTGCGCTCGG GACGCTGGAC GACTTCCTGA ACTTTCTGCA CGCTGCCGGC GAACACGGTA TTCGCGTCAT CATCGATCTC GTCGCCAACC ACACATCGAG CGAGCATCCG TGGTTTCAGG CCGCACGGCG GGATGCGAGG TGCCGCTTCC GCGATTACTA TGTCTGGTCC GCCAGCCCGC CGCCGGTCGC TCCGGACAAC AAGACGGCAT TTCCCGGTGA GGAAAGCAGC GTCTGGACGT ATGACGAGCT CGCCCAGGCC TATTATTTTC ACAAGTTTCG CCACTTCCAG CCGGACCTGA ACATCGCCAA TCCGGCGGTG CGCGACGAAT TGCTCCGCGT GGTCGATTAC TGGCTGACAT TGGGCGTCGA CGGCTTCCGG GTTGATGCCG CGCCCTTCGT CATCGGCGAG ACGGGCATAG AACACGCTGA TCCCAGGGAT CCTCATGGCT TCCTGCGGGA GATGCGTGAG CTGGTCGAAG GCAGACGCCG GGATGGTCTT CTGCTCGGCG AGGCGGACCT CTCGCCTGAA AAGCTGCGCC CCTATTTCGG TGAAGGGAAA CTCGATCTTC TGTTCAACTT CGTTCTGAGC GCGTCTTTCG CGGCAAGCCT CGCGCGGCAG AAGGCCGATC TCATAGGTCA GGCGCTTTCG ATAATGCCGG AGCCGCCTCC CCATCGAGGC TGGGTCAATT TCCTTCGCAA TCTCGACGAG CTCAACCTCG ACCGCCTGCC GGAAGACATC CAGCAGGAGA CCTTTGCCGC CTTCGCTCCG GACGAGGAGA TGCGGATCTA CGGACGCGGC ATCAGGCGTC GGCTCGCACC GATGCTCGAG GGAAACCAGA CGAGATTGGA ACTGGCGTTC AGCCTGCTTC TTTCTTCTCC GGGCGTGCCC CTTGTCCTCT ACGGCGACGA AATAGGCATG GGCGAAGACC CTTCCCGCCC GGGCCGTGAG CCCGTCCGCG TCCCCATGCA GTGGAACGCT GGCGCCAATG CCGGCTTTTC CACGGCCCAG CGCGCCAGGC TCATACAGCC AATCGTGACC GACGGACCCT TCGCCTTCAA GCGGATCAAT GTCGAAGCAC AGCGAGAGGA CCCCCGGTCG CTCCTCAACC GCGTCCGGGC GATGATCCTG ATGCGGCGCA GTCACAAGCT TTTTCAAAGG GGCCGGCCGA TCGTGCTGCA TACACGGGAT CCCGCGCTGT TTGCGCTCGC CTATTCCGAC GGCACCGAGC TGTTCGTCGT GCTGCATAAT CTAACGGAGG CCAAGCGGCG GGCGGAAGTC GAACTGCCCG GCGCCATCGA CGCCAGGCTC AAGGATGTTT TCGGCGAAGG CGAGGTCGAG CTCTCCGGCC AGCATCTGAC GATGGGTCTT GGCCCATTCG GCTATGCCTG GCTCCATTCG GGAAGGAAGG ACTGA
|
Protein sequence | MNEAPWFTSS VIYGIDVRRF ADGNGDGIGD FIGLRERVVY LSHLGIDCVW LSPFFRSPFA DNGYDVSDYY SVDPALGTLD DFLNFLHAAG EHGIRVIIDL VANHTSSEHP WFQAARRDAR CRFRDYYVWS ASPPPVAPDN KTAFPGEESS VWTYDELAQA YYFHKFRHFQ PDLNIANPAV RDELLRVVDY WLTLGVDGFR VDAAPFVIGE TGIEHADPRD PHGFLREMRE LVEGRRRDGL LLGEADLSPE KLRPYFGEGK LDLLFNFVLS ASFAASLARQ KADLIGQALS IMPEPPPHRG WVNFLRNLDE LNLDRLPEDI QQETFAAFAP DEEMRIYGRG IRRRLAPMLE GNQTRLELAF SLLLSSPGVP LVLYGDEIGM GEDPSRPGRE PVRVPMQWNA GANAGFSTAQ RARLIQPIVT DGPFAFKRIN VEAQREDPRS LLNRVRAMIL MRRSHKLFQR GRPIVLHTRD PALFALAYSD GTELFVVLHN LTEAKRRAEV ELPGAIDARL KDVFGEGEVE LSGQHLTMGL GPFGYAWLHS GRKD
|
| |