Gene Smed_4948 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4948 
Symbol 
ID5318159 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1460367 
End bp1461395 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content63% 
IMG OID640776731 
Productglycosyl transferase family protein 
Protein accessionYP_001313663 
Protein GI150377067 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.217559 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCAG CCGTGCCGAC AAATGTCTGC ATAATCATCG CCGCGAAGAA CGCCGCCGAC 
ACCATTGCCC GTGCGGTCTC CTCGGCGCTT GCCGAGCCGG AGGCGGCGGA AGTGGTCGTC
ATCGATGACG GCTCGACCGA TGACAGTGCT GCGGTCGCGC GCGCTGCAGA TGATGGCTCC
GGCCGGCTGA ACGTCGTTCG CTTCGAGGAA AATCGCGGCC CGGCGGCGGC ACGTAATCAT
GCAATCGAGA TTTCGAAGTC TCAGCTTCTT GGCGTGCTCG ACGCGGATGA TTTCCTCTTC
CCCGGCCGGC TGCGCCAGTT GCTTTCGCAG GAGGGCTGGG ACTTCATCGC CGATAATATC
GCCTTTATCG ATGCCGCGCA GGCGGCAAAC GCACAAGCCA GTATCGATCG CTTCGCCCCT
GCTCCTCGGC TCATCGATCT TGTCGGCTTC ATCGAGGGAA ACATTTCGCG CCGCGGCGTG
CGGCGCGGAG AAATCGGATT CCTGAAGCCG TTGATGCGGC GCGCCTTCCT CGACCAGCAT
GGCCTGCGCT ATAACGAGAC TTTGCGCCTC GGCGAAGATT ACGACCTCTA CGCTCGCGCG
CTGGCGAAGG GCGCACGCTA CAAGATCATC CACAGCTGCG GCTATGCCGC GGTCGTGCGC
GGCGACTCGC TGAGCGGCAG TCATCGAACC ATCGACTTGA AGCGTCTCTA TGAGGCCGAT
CGTGCAATTC TCGCCGGAAA CAAGCTGAGC AACGACGCCG AAGCAGCCCT ACGCCGACAC
GAGCGACACA TCCGGGACCG CTACGAACTG CGCCATTTTC TCGATCTCAA GAACCAGCGG
GGCTTCGCTC GTGCAATCAG CTATGCCCTG ACCCACCCAG CGGCTCTGCC GGCGATCCTG
GGTGGCATCC TTGCGGATAA GACCGAGCGT TTTCGTCCGT CCCGCGCGCC GGCTCCCCTT
GCCCTCGGCG GAACGGGCGA TGTCCGCTAT CTGCTCGAGG CCTTGGCCAT GGATCAGCCT
CAAAAATAG
 
Protein sequence
MTAAVPTNVC IIIAAKNAAD TIARAVSSAL AEPEAAEVVV IDDGSTDDSA AVARAADDGS 
GRLNVVRFEE NRGPAAARNH AIEISKSQLL GVLDADDFLF PGRLRQLLSQ EGWDFIADNI
AFIDAAQAAN AQASIDRFAP APRLIDLVGF IEGNISRRGV RRGEIGFLKP LMRRAFLDQH
GLRYNETLRL GEDYDLYARA LAKGARYKII HSCGYAAVVR GDSLSGSHRT IDLKRLYEAD
RAILAGNKLS NDAEAALRRH ERHIRDRYEL RHFLDLKNQR GFARAISYAL THPAALPAIL
GGILADKTER FRPSRAPAPL ALGGTGDVRY LLEALAMDQP QK