Gene Smed_1160 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1160 
Symbol 
ID5322006 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1237755 
End bp1238789 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content61% 
IMG OID640790101 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_001326846 
Protein GI150396379 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0839942 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.394325 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGTATG CTTTTCGCAT GAGCGAATCA AAAGCATTCA TTTCCGGCTG CAAGGGCCTT 
ACGCTGACGC AGGAAGAGCG CGACTTCTTC GCCGGCGAGC GCCCATGGGG CTTCATTCTC
TTCGGACGGA ATATCGGCGA GGAAGAGCAG ATCTGCGACC TGGTGGCGAG CCTGCGCGAC
AGCATAGGTA ACCCGGGAGC ACCGGTGTTG ATCGATCAGG AGGGCGGCCG TGTTCAGCGC
ATACGCCCGC CGCTCGTCGC GCAATATCCC AATGGCGCGG CGATCGGCGA AATCTATCGC
CGGGATCGTG AACTCGGTGT GCGTGCCGCG TGGCTCATGG GACGCCTGCA TGCGTTCGAC
CTGATGCGCT TCGGCATCAC GGTCGATTGC CTGCCGGTAC TCGACGTACC GGTTCCCGGG
AGCCACGACG TGATCGGCAA CCGCGCCTAT GGGCATGATC CGGCGACGGT CACTGAGATC
GGCCGCGCCA TGAGCGAAGG GTTGAAGGCT GGGGGCATGC TGCCGGTCAT GAAGCATATG
CCCGGTCACG GCCGAACCTT CGTCGATTCG CATCACAGCC TGCCGGTCGT CAGCGCCGGC
CTCGATGAAT TGAAGAGTAG CGATTTTCTT CCCTTTGCGG CGATGAAGGA TGAAGCGATG
GCCATGTCCG CGCACATGGT CTTCACTGCA ATCGACCCGG ACAACCCCGC AACGACCTCC
ACAAAGGTCG TTCGCGAGAT CATTCGAGGC CATATTGGCT TCGACGGCCT GTTGATGTCC
GACGACGTTT CCATGAATGC CCTTGCCGGG GACATGGCCG CACGCGCCCG CGGAATAATC
GCCGCCGGTC TTGATCTCGT ATTGCATTGT CATGGCATTA TGGAGGAAAT GAAAGCTGTG
GCAGATGTCG TTCCGGTCAT CTCCGGGGAG AGGCTCCGCC GGGCTAAGGC TGCCGAGGCA
GCCTTCCGGG AACCGGACAG TTCGGTCGAA GCTGCACTGC GCACAGAGTT TAACGCAATG
TTCGCGCTCG CCTAG
 
Protein sequence
MQYAFRMSES KAFISGCKGL TLTQEERDFF AGERPWGFIL FGRNIGEEEQ ICDLVASLRD 
SIGNPGAPVL IDQEGGRVQR IRPPLVAQYP NGAAIGEIYR RDRELGVRAA WLMGRLHAFD
LMRFGITVDC LPVLDVPVPG SHDVIGNRAY GHDPATVTEI GRAMSEGLKA GGMLPVMKHM
PGHGRTFVDS HHSLPVVSAG LDELKSSDFL PFAAMKDEAM AMSAHMVFTA IDPDNPATTS
TKVVREIIRG HIGFDGLLMS DDVSMNALAG DMAARARGII AAGLDLVLHC HGIMEEMKAV
ADVVPVISGE RLRRAKAAEA AFREPDSSVE AALRTEFNAM FALA