Gene Smed_4708 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4708 
Symbol 
ID5318858 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1225798 
End bp1227054 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content63% 
IMG OID640776506 
Productglycosyl transferase group 1 
Protein accessionYP_001313438 
Protein GI150376842 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.514087 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGTCC TCGTCGCGGC TCACAACCAT CCGGCCCTTC ACCCGGGAGG CACGGAAATT 
TTCGCCCACG ACCTGTTCCG CGCCTATAAG CGCGCGGGCT GCGAATCGCT CTTCCTGGGC
GCCACCAACC AGATCCACCG ACAGGCACGC CCGGGCACCA GCTTTCAAGG GATCGGCCCG
GCAGGAGACG AACTGCTGCT GTGGTCTGGC CACTTCGACC GATTTTTCAT GAGCCAGATC
GATCTCTACG GCGTCGTTCC CGACCTGGCG GAACTGCTGC GCGACTTCCG GCCGGACGTC
GTCCACATTC ACCACCTGCT GCTGCTCGGC GCGGAGTTTC CACATATCGT GCGCCGTACG
CTGCCTGAGT GCCGGATCGT CATGACGCTG CATGACTATT ATCCCATCTG TCATCACGAC
GGCTTGATGG TGAGGACGAG CGGCAAAGAG CTTTGCCACG GAGCGAGCCC CGACAGATGC
CATGCCTGCT TCAAGGACAT AGCACTCGAC CGGTTCGCGC TGCGCGAACG CCACCTGAAG
GCGCTGTTGA GCGACGTCGA CCGGTTCGTG TCGCCGAGCA ATTTCCTTAA AACGCGCTTC
GTCGAATGGG GGTTATCGGA AGACGCAATC AGCGTCATTC CGAACGGATT GCCGCCGCGC
AAGGAACCGG CGGCAGTTCG TCGGATCGGC TCGGATCGTC CGATCTTCGG CTACTTCGGC
AATCTCAATC CGTGGAAGGG CGTCGCTGTA CTGCTCGAAG CGGCGCGGCA GCTCATCGCA
GAGGGGCTGG AGTTCGAGCT GCGCGTTCAT GGCGGCGCCC CCTTCCAAAG CGAGAGCTTC
GTCGAAGAGA TCACGCGCCT GTTCCAGGAG ACGGCACCAA CCGTACAGCA GCGGGGGCCC
TATCGGCGCG AGGACGTGAT CGACCTCGTC GCCTCGGTGG ATTGCACGAT CGTGCCCTCG
ATCTGGTGGG AGAATGCGCC ATTGGTCATC CAGGAGGCGC AGGCTCTCGG GCGGCCGGTC
ATAGCCAGCA ACATCGGCGG CATGGCCGAG TTGATCGAGG ATGGGTCAAA CGGGCTCACC
GTCGCGCCCA ACGATCCGCG GGCGCTGGCC TCTGCCATGC GCCGTCTTGC ACAGGACGGC
GGATTGGCGC GCCGGCTTGC CGCAAACGCG CACGAACCCG AGAACATCGA CACGACCGCC
CGACGCTATC TCGAATTGAT CGACACGATT GCGCCGTCAC GAATCGAAGC GGCATAA
 
Protein sequence
MRVLVAAHNH PALHPGGTEI FAHDLFRAYK RAGCESLFLG ATNQIHRQAR PGTSFQGIGP 
AGDELLLWSG HFDRFFMSQI DLYGVVPDLA ELLRDFRPDV VHIHHLLLLG AEFPHIVRRT
LPECRIVMTL HDYYPICHHD GLMVRTSGKE LCHGASPDRC HACFKDIALD RFALRERHLK
ALLSDVDRFV SPSNFLKTRF VEWGLSEDAI SVIPNGLPPR KEPAAVRRIG SDRPIFGYFG
NLNPWKGVAV LLEAARQLIA EGLEFELRVH GGAPFQSESF VEEITRLFQE TAPTVQQRGP
YRREDVIDLV ASVDCTIVPS IWWENAPLVI QEAQALGRPV IASNIGGMAE LIEDGSNGLT
VAPNDPRALA SAMRRLAQDG GLARRLAANA HEPENIDTTA RRYLELIDTI APSRIEAA