Gene Smed_4931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4931 
Symbol 
ID5318247 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1440647 
End bp1442044 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content59% 
IMG OID640776714 
Productglycoside hydrolase family protein 
Protein accessionYP_001313646 
Protein GI150377050 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2273] Beta-glucanase/Beta-glucan synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.533696 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGAA CCATACTGAA CGCGGTTGGA ACGCCGCTTT ACTACAGCGG CAGTTCGACG 
GCGTGGTTTT CCGCCACCGG GTCAGGACCA ACACTTTATG GCACCGCCGG CAATGATTCC
ATCTGGGGCG ACAGTTCCGT GAACGTAACG ATGATCGGCG GTCGAGGCGA CGATATCTAT
TATCTCTATT CCTCGATCAA CCGGGCCTTC GAGGCAGCCG GCGAGGGCGT TGACACGATA
AACACGTGGA TGAGCTATAC GCTTCCGGAG AATTTCGAGA ACCTTACCGT CACTGGAAAC
GGCCGTTCCG CTTTCGGCAA TGAGACGGAT AACATCATTA AAGGCGGTTC GGGCAGCCAA
ACGATCGACG GCCGCGGCGG CAACGACGTC TTGATCGGGG CTGGCGGCGC CGACACCTTC
GTGTTCGCGC GGGGCAACGG AAGCGACCTG ATCACCGATT TCAACTATGA TGACATTATC
CGCCTCGACG ATTACGGCTT CACATCGTTC GAGCAGGTTC TGGCCAATGT CGCGCAGGAA
GGCGCGGATC TCCGGCTTCA TCTCGCTGAT GGCGAGAGCC TGGTCTTCGC GAACACGACG
GCAGACGAAC TGGAGGCGCG CCAATTCAGG CTCAGTCTCG ACCGTTCCGT CCTGTCGCAG
ACCTTTTCCG ACGAGTTCAA TACGCTCGAT CTGCGAAACG GCACAAGCGG TGTCTGGGAC
GCGAAATACT GGTGGGCGCC GGAAAAGGGC GCGACACTTT CCAGCAATGG CGAGCAGCAA
TGGTATATCA ATCCGAGCTA CGCGCCGACT GCTTCCGCCA ACCCGTTTTC CGTCAGTAAC
GGCGTGCTGA CGATCACGGC GGCCCCGGCA TCGGAGGCGA TCCAGGCCGA GATCAACGGC
TATGATTACA CGTCCGGCAT GCTGACGACC CACTCGTCCT TCGCTCAGAC CTACGGTTAT
TTCGAAATGC GTGCGGACAT GCCGGACGAC CATGGCGTTT GGCCTGCCTT CTGGCTGCTG
CCCGCCGACG GATCCTGGCC GCCGGAACTC GATGTGGTGG AAATGCGCGG ACAGGACTCC
AACACCGTGA TTACCACGGT GCATTCCAAC GAAACGGGCT CGAGGGCCAG CATCGAAAAT
GCGGTGAAGG TGGCCGACGC AAGCGGCTTC CATACCTATG GTGTGCTTTG GACGGAGGAA
GAAATCGTCT GGTATTTCGA CGATGCGGCG ATTGCACGCG CCGATACCCC TGCCGATATG
CACGACCCGA TGTACATGCT CGTCAACCTT GCCGTCGGCG GCATGGCCGG CACGCCGAAT
GATGGGCTCG TCGACGGGTC AGAAATGAAG ATCGATTACA TTAGAGCCTA TGCGCTCGAT
GCAGACTGGC AGATCTGA
 
Protein sequence
MSRTILNAVG TPLYYSGSST AWFSATGSGP TLYGTAGNDS IWGDSSVNVT MIGGRGDDIY 
YLYSSINRAF EAAGEGVDTI NTWMSYTLPE NFENLTVTGN GRSAFGNETD NIIKGGSGSQ
TIDGRGGNDV LIGAGGADTF VFARGNGSDL ITDFNYDDII RLDDYGFTSF EQVLANVAQE
GADLRLHLAD GESLVFANTT ADELEARQFR LSLDRSVLSQ TFSDEFNTLD LRNGTSGVWD
AKYWWAPEKG ATLSSNGEQQ WYINPSYAPT ASANPFSVSN GVLTITAAPA SEAIQAEING
YDYTSGMLTT HSSFAQTYGY FEMRADMPDD HGVWPAFWLL PADGSWPPEL DVVEMRGQDS
NTVITTVHSN ETGSRASIEN AVKVADASGF HTYGVLWTEE EIVWYFDDAA IARADTPADM
HDPMYMLVNL AVGGMAGTPN DGLVDGSEMK IDYIRAYALD ADWQI