Gene Smed_4038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4038 
Symbol 
ID5318338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp499771 
End bp501405 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content62% 
IMG OID640775846 
Productalpha amylase catalytic region 
Protein accessionYP_001312779 
Protein GI150376183 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID[TIGR02456] trehalose synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0583407 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGAGG CGCCGTGGTT CACAAGCTCG GTCATCTATG GAATCGACGT GCGAAGGTTC 
GCTGACGGAA ACGGCGACGG GATCGGCGAC TTCATCGGAC TGAGGGAGCG GGTAGTCTAT
CTGAGCCACC TCGGCATCGA CTGCGTCTGG CTGTCGCCCT TCTTCAGGTC CCCCTTCGCC
GACAATGGTT ACGACGTCAG CGACTATTAC TCTGTCGATC CTGCGCTCGG GACGCTGGAC
GACTTCCTGA ACTTTCTGCA CGCTGCCGGC GAACACGGTA TTCGCGTCAT CATCGATCTC
GTCGCCAACC ACACATCGAG CGAGCATCCG TGGTTTCAGG CCGCACGGCG GGATGCGAGG
TGCCGCTTCC GCGATTACTA TGTCTGGTCC GCCAGCCCGC CGCCGGTCGC TCCGGACAAC
AAGACGGCAT TTCCCGGTGA GGAAAGCAGC GTCTGGACGT ATGACGAGCT CGCCCAGGCC
TATTATTTTC ACAAGTTTCG CCACTTCCAG CCGGACCTGA ACATCGCCAA TCCGGCGGTG
CGCGACGAAT TGCTCCGCGT GGTCGATTAC TGGCTGACAT TGGGCGTCGA CGGCTTCCGG
GTTGATGCCG CGCCCTTCGT CATCGGCGAG ACGGGCATAG AACACGCTGA TCCCAGGGAT
CCTCATGGCT TCCTGCGGGA GATGCGTGAG CTGGTCGAAG GCAGACGCCG GGATGGTCTT
CTGCTCGGCG AGGCGGACCT CTCGCCTGAA AAGCTGCGCC CCTATTTCGG TGAAGGGAAA
CTCGATCTTC TGTTCAACTT CGTTCTGAGC GCGTCTTTCG CGGCAAGCCT CGCGCGGCAG
AAGGCCGATC TCATAGGTCA GGCGCTTTCG ATAATGCCGG AGCCGCCTCC CCATCGAGGC
TGGGTCAATT TCCTTCGCAA TCTCGACGAG CTCAACCTCG ACCGCCTGCC GGAAGACATC
CAGCAGGAGA CCTTTGCCGC CTTCGCTCCG GACGAGGAGA TGCGGATCTA CGGACGCGGC
ATCAGGCGTC GGCTCGCACC GATGCTCGAG GGAAACCAGA CGAGATTGGA ACTGGCGTTC
AGCCTGCTTC TTTCTTCTCC GGGCGTGCCC CTTGTCCTCT ACGGCGACGA AATAGGCATG
GGCGAAGACC CTTCCCGCCC GGGCCGTGAG CCCGTCCGCG TCCCCATGCA GTGGAACGCT
GGCGCCAATG CCGGCTTTTC CACGGCCCAG CGCGCCAGGC TCATACAGCC AATCGTGACC
GACGGACCCT TCGCCTTCAA GCGGATCAAT GTCGAAGCAC AGCGAGAGGA CCCCCGGTCG
CTCCTCAACC GCGTCCGGGC GATGATCCTG ATGCGGCGCA GTCACAAGCT TTTTCAAAGG
GGCCGGCCGA TCGTGCTGCA TACACGGGAT CCCGCGCTGT TTGCGCTCGC CTATTCCGAC
GGCACCGAGC TGTTCGTCGT GCTGCATAAT CTAACGGAGG CCAAGCGGCG GGCGGAAGTC
GAACTGCCCG GCGCCATCGA CGCCAGGCTC AAGGATGTTT TCGGCGAAGG CGAGGTCGAG
CTCTCCGGCC AGCATCTGAC GATGGGTCTT GGCCCATTCG GCTATGCCTG GCTCCATTCG
GGAAGGAAGG ACTGA
 
Protein sequence
MNEAPWFTSS VIYGIDVRRF ADGNGDGIGD FIGLRERVVY LSHLGIDCVW LSPFFRSPFA 
DNGYDVSDYY SVDPALGTLD DFLNFLHAAG EHGIRVIIDL VANHTSSEHP WFQAARRDAR
CRFRDYYVWS ASPPPVAPDN KTAFPGEESS VWTYDELAQA YYFHKFRHFQ PDLNIANPAV
RDELLRVVDY WLTLGVDGFR VDAAPFVIGE TGIEHADPRD PHGFLREMRE LVEGRRRDGL
LLGEADLSPE KLRPYFGEGK LDLLFNFVLS ASFAASLARQ KADLIGQALS IMPEPPPHRG
WVNFLRNLDE LNLDRLPEDI QQETFAAFAP DEEMRIYGRG IRRRLAPMLE GNQTRLELAF
SLLLSSPGVP LVLYGDEIGM GEDPSRPGRE PVRVPMQWNA GANAGFSTAQ RARLIQPIVT
DGPFAFKRIN VEAQREDPRS LLNRVRAMIL MRRSHKLFQR GRPIVLHTRD PALFALAYSD
GTELFVVLHN LTEAKRRAEV ELPGAIDARL KDVFGEGEVE LSGQHLTMGL GPFGYAWLHS
GRKD