Gene Smed_4021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4021 
Symbol 
ID5318830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp477697 
End bp479901 
Gene Length2205 bp 
Protein Length734 aa 
Translation table11 
GC content64% 
IMG OID640775829 
ProductAmylo-alpha-16-glucosidase 
Protein accessionYP_001312762 
Protein GI150376166 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3408] Glycogen debranching enzyme 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAACCA ATCTCTCCGG AGCGGCGGCC GAAGCCGCTC CCGATGTGAC ATCTGCGGAC 
GAACTGGGGC CGGCAGCGGT CTCGCGTTAC GAGCGAAGCT CCAGATCCCT GAAACACGGC
GATACCTTCG CCGTGTTCGA CCATAACGGC GACGCGTTTT CCTCTCCTTC CAATCCGGAG
GGCATCTTCC ATCGCGATAC CCGCCATCTC TCTCAATTCG TGCTGAGGCT GAACGGCGCC
AGATCGCTGC TGCTGAGTTC CACGCTCCGC GACGACAATG CCACGCTCGA CTGCGATCTC
ACCAATCCGG CTCTCGTGCT CGATACCGGT GGCGAGGTTT TGAAGCATGA CCTCATTCAT
GTGCGGCGCA CCCGCTTTCT CTGGCAGGAG GCCTGCTACG AACGACTGGT GCTGCGCAAC
TTCGATGAGG CCCCGAGAAG GCTGCAGGTC GAGCTGTCCT TTGCCGCCGA CTTCGCCGAC
CTCTTCGAAG TGCGCGGCAC CCCAAGGCGG CGGCGCGGCA CCCATCATAA CGCGCTCGTC
GGCAATGGCC GGGTGGTCCT TGCCTATGAC GGCCTCGACG GCCAGACCCG CAGCACGACT
CTTCGCTTCG ATCCGGAGCC GCAGAGGCTT ACAGGGCAGG AGGCAAGCTT TGTCGTCGAG
CTCGCTCCGA ACCAGGCGCA ATCGATCTTC ATCGAGATCG TTTGCAACAG CCTTGAGGAC
AATCCGCACC CCCCGGCCTT CAACTTCTTT CTCGCGCTGC GCGACGCCCG CAGGGCGCTA
CGCTACTCCG CCTCGCGCGC GGCCGCGGTC GTGACCTCCA ATGCGGTGTT CAACGAGGCC
GTCAGACGCA GCGTCGCCGA TCTCTATATG CTGCTTACGG AGACGCCCGA GGGTCCCTAC
CCCTATGCCG GCATTCCCTG GTTCAGCACC GTCTTCGGCC GGGATGCGCT CATCACCGCG
CTTGAGACCC TGTGGCTGGA CCCGGCGATC GCGAAGGGCG TGCTCAGACA TCTCGCAGCA
AACCAGGCGA CCGATTTCGA CCCAGTTGCC GATGCGGAGC CGGGCAAGAT CCTGCATGAG
ATGCGTTATG GCGAAATGGC GGAACTCGGC GAAGTTCCTT TCCGGCGCTA TTATGGCAGC
GTCGATTCCA CCCCTCTCTT CATCATGCTT GCCGGCGCCT ATCTCGACCG TACCGGTGAT
ACCGACACCG TGCGGGGCCT CTGGCCGAAT ATAGTGGCCG CACTCGACTG GATCGATCGT
TTCGGCGACC GCGACGGCGA CGGCTTCGTC GAATATGGAA GCCGGACGGC GAAGGGCCTG
GTGAACCAGT GCTGGAAGGA CAGCCATGAT TCGATCTTCC ACGCCGATGG CAGACTGGCT
AAAGGGCCGG TCGCGACCGC CGAGGTGCAG GCATATATCT TCGGAGCCTG GCGGGCGGCC
GCCCGCCTGT CGCGCAAGCT CGGCCATGCA GAGGATGCGC TGAGGCTCGA GCAACGGGCC
GAAGACCTGC GCATCCGTTT CGACGCGGCC TTCTTCGACG AGGAGCTTGG CACCTATTCG
CTGGCACTCG ACGGGGACAA GAACCCCTGC CGCGTCCGTT CCTCCAATGC GGGGCATGCG
CTCTTCACCG GCATCGCACT GCCTGAGCGC GCCGGCAGGG TCGTGTCCAC GCTGATGGCC
CAGTCCTCCT TCTGCGGCTG GGGCGTGCGC ACGATCGCCG CTTCTGAGGC GCGCTACAAT
CCCATGAGCT ACCACAATGG CTCGGTCTGG CCGCATGACA ATGCTCTGAT CGCCGCAGGC
TTCGTACGCT ACGGCTTCCA GGCAGAAGCG GCCAGCATCT TCGAAGGGCT GTTCGCCGCC
TCCACCTATA TCGACCTCAG GCGCTTGCCG GAGCTCTTCT GCGGCTTTTC GCGCCAGCGC
GCCCGCGGCC CTACCTTCTA CCCGGTTTCC TGCGTGCCGC AGGCCTGGGC GGCAGCGGCA
CCCCTTTATC TGCTTCAGTC GATGATCGGC CTCGGCTTCG ACGCCGAGAA GTCGCAGGTC
ATGTTGACCG AACCCACCCT CCCGCCCTTC CTCGATGAAG TCGTGCTGAA ACGGCTGCGG
GTCGGGCCTG GGATCGTCGA CATTGCACTG AGGCGATCGC GATCCCAGGT GGTCGTGGAC
GTGCTGGAGC GGAAGGGCGG CGTGAAGGTA CTGACGACCC ATTGA
 
Protein sequence
MGTNLSGAAA EAAPDVTSAD ELGPAAVSRY ERSSRSLKHG DTFAVFDHNG DAFSSPSNPE 
GIFHRDTRHL SQFVLRLNGA RSLLLSSTLR DDNATLDCDL TNPALVLDTG GEVLKHDLIH
VRRTRFLWQE ACYERLVLRN FDEAPRRLQV ELSFAADFAD LFEVRGTPRR RRGTHHNALV
GNGRVVLAYD GLDGQTRSTT LRFDPEPQRL TGQEASFVVE LAPNQAQSIF IEIVCNSLED
NPHPPAFNFF LALRDARRAL RYSASRAAAV VTSNAVFNEA VRRSVADLYM LLTETPEGPY
PYAGIPWFST VFGRDALITA LETLWLDPAI AKGVLRHLAA NQATDFDPVA DAEPGKILHE
MRYGEMAELG EVPFRRYYGS VDSTPLFIML AGAYLDRTGD TDTVRGLWPN IVAALDWIDR
FGDRDGDGFV EYGSRTAKGL VNQCWKDSHD SIFHADGRLA KGPVATAEVQ AYIFGAWRAA
ARLSRKLGHA EDALRLEQRA EDLRIRFDAA FFDEELGTYS LALDGDKNPC RVRSSNAGHA
LFTGIALPER AGRVVSTLMA QSSFCGWGVR TIAASEARYN PMSYHNGSVW PHDNALIAAG
FVRYGFQAEA ASIFEGLFAA STYIDLRRLP ELFCGFSRQR ARGPTFYPVS CVPQAWAAAA
PLYLLQSMIG LGFDAEKSQV MLTEPTLPPF LDEVVLKRLR VGPGIVDIAL RRSRSQVVVD
VLERKGGVKV LTTH