Gene Smed_0852 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0852 
Symbol 
ID5321690 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp910744 
End bp911913 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content63% 
IMG OID640789789 
Productglycosyl transferase group 1 
Protein accessionYP_001326542 
Protein GI150396075 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.248563 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAAC GCCCCTTGCG TATAATTCAT TGCTTCAGGT CGCCGGTCGG CGGCATCTTC 
CGGCACGTGC GCGACCTTGC CGAAGCGCAC GCGAAAGCCG GGCATCAGGT CGGTATTCTT
TGTGACAGCA CCACGGGCGG TGCCCACGAG GATGCATTGT TCGAAGAGGT TCGTCCACAT
CTGGACCTCG GCATCGTCCG CGTGCCGATC CATCGCTCGG TCGGAGCCTC GGACGCGGCT
GCGCTGTGGC GCAGCTACAA GGAAATCAGA AGCTTGCAAC CGGATGTGCT GCACGGGCAC
GGCGCCAAGG GCGGCGTGTT GGCGCGCATC GCCGGTTCAG CCCTGCGGGT CAACAAGTAT
CGCGTAGCCC GCCTCTATTC GCCCCATGGG GGAAGCCTGC ATTATGATCG GCGGTCCCTG
GCAGGTTCGT TCATTCTTCG CATCGAGCGC CTGCAGGAAC GCCTGACCGA CGCACTCGTC
TTCGTTTGCG AGTATGAGCG CGGCACCTAC TGCGCCAAGG TGGGCCAACC AATTGCACGC
AGCGAACTGA TCTATAACGG CATCGAAGAT GCGGAGTTCG AGCGAGTCGA AGCCGACCCT
GGCGCCGCAG ATTTCCTCTA TATCGGGATG ATGCGCGACC TGAAGGGCCC GGACTTCTTC
ATCGAAGGGT TTGCCGCAGC CGAAGAGATC GCCGGCCGAA GGCTTTCCGC CCTGATGGTC
GGAGATGGTC CCCAACAGCG GCAATACGAA GAGATGACGC TGCGAATGGG TCTAGGAGAT
CGGGTTCGGC TGCTACCGGC GATGAGGGCG CGCAAAGCTT TCGCTCTTGC CCATGTCGTC
GTCATTCCCT CGCGTGCCGA ATCCATGCCC TATATCGTTC TGGAAGCGCT CGCCGCAGGC
AAGCCGGTCA TCGCAACCCG CGTAGGCGGC ATCCCGGAGG TTCTCGGGGC CGCTAGCGAG
GCGCTCGTGC GTCCCGACGA TGCAGAAGCA CTTGCCCGGC TCATGGCCGA GGCAATTGCC
GACGATGGCT GGGCTGCCCG GACAATGCCC GACGCCGAAG GCTTCAAGTC CCGCTTCGCG
GCGTCCGTGA TGACCAGACA CGTCATGCAG CTGTATCGGG AGCTTACGGC AGAATCGCTC
GTGCCGCATG GGCGGCTGCG TACAACGTAA
 
Protein sequence
MSERPLRIIH CFRSPVGGIF RHVRDLAEAH AKAGHQVGIL CDSTTGGAHE DALFEEVRPH 
LDLGIVRVPI HRSVGASDAA ALWRSYKEIR SLQPDVLHGH GAKGGVLARI AGSALRVNKY
RVARLYSPHG GSLHYDRRSL AGSFILRIER LQERLTDALV FVCEYERGTY CAKVGQPIAR
SELIYNGIED AEFERVEADP GAADFLYIGM MRDLKGPDFF IEGFAAAEEI AGRRLSALMV
GDGPQQRQYE EMTLRMGLGD RVRLLPAMRA RKAFALAHVV VIPSRAESMP YIVLEALAAG
KPVIATRVGG IPEVLGAASE ALVRPDDAEA LARLMAEAIA DDGWAARTMP DAEGFKSRFA
ASVMTRHVMQ LYRELTAESL VPHGRLRTT