Gene Smed_5985 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5985 
Symbol 
ID5320287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp941255 
End bp942697 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content59% 
IMG OID640777661 
Productglucan 1,4-alpha-glucosidase 
Protein accessionYP_001314593 
Protein GI150377998 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3387] Glucoamylase and related glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.377487 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGTTG CATTTCGCAG TAAGCCAGTG ACAGTTGAGG ACCCGGCATA TAGAGGTCTG 
GCTATCGGCA CTTATGCGCC TGCAGTCACA CCGGCGCCTG CCTTCGCGCA GACCGACCTG
GCTGCATTGT CGCGCTACTA TTCTCTTCTG ATGATGCGCA ACATCACCAG CGACGGCTAC
GTCATCGAGG ATCCAGCATC GCCCGGCGTC TTTTCGGTAC CGGGCTGTGT CATCGCCGCG
CCTTCCTATC CAGCGAACAC GCCGGGTGTC GACCAGGACT ATGTTTTCAA CTGGGTCCGC
GACGGAGCTA TGACGGCCAT CGAGATCGCG CTTGCCGACT TGCCGCGCGT TTCGGGCGGG
GGTGTGCCGA GCCTGATCGA CTACGTGAAC TTCGCCGCGC TGTGTCAGGC GAATGCGAAG
AATTCCGCGA CCGCCACACT TGGCCATGCC TGCTTCACCA TCACCGGCAA GGTTCGTCCG
TGGTCGGAGC AAAATGACGG GCCGGCCATT CAGTCGATCG CCATACTGAC CTTGTTCGGT
CAGTTGGATG GCGCCACGCA GAAAATCGCT AAACGACTGG TTGAGACTAA CCTCTCTTAT
CTTCTCGAAG TTTACCAGAA CAAGACCACA AATCTCTGGG AGGAGTATGA GGGCTATTCC
TTTTTCGCAA GAGCCGTACA GCTGCGCTTT TTCCGGGAGA TTTCCAGAAA CACGATCGCT
ATTGCCGTGC CTGCCGGGGT GGCCGATGCC ATCTCCTGGC TGCAAACCCA GCTGGCCAAC
CACTGGAATG GGCAGCTCTA TGTGAGCGTT CTGGATGTCG CGGCGCAAGC CGGTTATGAC
GCGAACATCG ATATCGTCTC TTCGGTCTGC TATGGCGGGA TCCATCCGGC TGACACCAAG
CTTCTGGCAA CGGCGGCAAT CCTGCGGCGC CAGTGGGCTG ATATTTCGTC TTCGAGCTAT
TTCCCAATCA ATGGCGCCGA CGCGGCCAAA GGGCTCGGAC CCGCCTTCGG GCGCTATCCG
GGCGACCATT ACGATGGCGA TGTGGCGGCT CCGGTGGTCG GCGGGCATCC CTGGGCTCTG
TGCACCGCCA ACTTCGCCGA GTTTCAATAT CGGCTTGCCA ATGCCATAGA CGCCAGCGGC
GCCATTCCTC TCGATCAGTT CTCCGAACCC TTCTTCGCTG AATTGGGGCT TGGCGCATCC
AGCAGCGCCG CCGACGCGTC AACTGCCTTA CGCGCTTCGT CTGACGCCAT GCTGCGCGCC
ATCGTCTACC ACAGCGATCA CTACGAACTG AGCGAGCAGT TCGATGGAAC CACTGGCTAT
GAGAAAAGCG TCCGGAACCT GACCTGGAGC TACGCCTCTT TTCTCTCGGC AGTCAGAGCC
CGCTCCGGCG GTATCCCAGC CGGTAAGAAC AAACCCCGAA ACTCCCGCAG CCGAAGTTCG
TGA
 
Protein sequence
MTVAFRSKPV TVEDPAYRGL AIGTYAPAVT PAPAFAQTDL AALSRYYSLL MMRNITSDGY 
VIEDPASPGV FSVPGCVIAA PSYPANTPGV DQDYVFNWVR DGAMTAIEIA LADLPRVSGG
GVPSLIDYVN FAALCQANAK NSATATLGHA CFTITGKVRP WSEQNDGPAI QSIAILTLFG
QLDGATQKIA KRLVETNLSY LLEVYQNKTT NLWEEYEGYS FFARAVQLRF FREISRNTIA
IAVPAGVADA ISWLQTQLAN HWNGQLYVSV LDVAAQAGYD ANIDIVSSVC YGGIHPADTK
LLATAAILRR QWADISSSSY FPINGADAAK GLGPAFGRYP GDHYDGDVAA PVVGGHPWAL
CTANFAEFQY RLANAIDASG AIPLDQFSEP FFAELGLGAS SSAADASTAL RASSDAMLRA
IVYHSDHYEL SEQFDGTTGY EKSVRNLTWS YASFLSAVRA RSGGIPAGKN KPRNSRSRSS