Gene Smed_4802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4802 
Symbol 
ID5318689 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1321175 
End bp1322296 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content60% 
IMG OID640776596 
Productglycosyl transferase family protein 
Protein accessionYP_001313528 
Protein GI150376932 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.858117 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATATTC TCACGCGCAC GGCAGCGAAG CTCGCCAACG GGCGGATCGA GGAACCGCGC 
GCGCTAAATC GGATCGCCCG CGTCACCGTC GTTGTTCCTT GCTACAACTA CGGACGTTAT
CTGCGGCAAT GTGTCGAAAG CGTCACGCTG AACCAACCAG GCGTCGACGT CGAAGTCATC
ATCGTGGATG ATCGATCGAC TGACGATAGC GCCTATGTGG CGCGCTCAAT CCAGGATGCA
GACAAGCGCG TACACTTGAT AGCGCACAAA CAAAACAAGG GTCACATTGC AACCTATAAC
GACGGGCTGG AAGCCGCGAC CGGCGAATTC GTGCTGTTAC TTTCTGCCGA CGATCTGGTG
ACACCGGGTG CACTGACCCG TGCTGCTGAA TTCCTTGCTG CGGAACCGTC TGTCGGGCTT
GTTTATGGTA ACGCGATCCA CTTCCATGGC GAGTTGCCCG AGAGCCGAAT TGCCGGAGGG
AGTTGGATCG TATGGCCCGG CGTCGATTGG CTGCGGATCC GCTGCCGGTC GGGATTCAAT
ACCATCACCT CCCCGGAGGC GGTTATGCGC ACGGCAGTGC TGCGCGAAAT CGGCAACTAT
CGCGCCGACC TGCCGCATGC TGGCGACTTC GAGATGTGGC TTCGCACCTC TGCAGTGTCA
GACATCGGCT TCCTTGCCGG CGTTGATCAG GCCTATTACC GACATCACGC GACCAATATG
AACAAACAGG ATTTCGGCTC GGGCACCGCC CTCGGTCAGC TGATCGACCT CAAACAGCGC
TGGCAGTCCT TCGAAGCGGT ATTCAGCGGC GTTGGATCTG GGCTGGAGGA GGGACCACAA
CTGCTGGAGC TTGCCCGCAG CACCATCGCG CGTCAGGTGC TGGAGCGCAT CAACTATGCC
CATGCCAAAG GTTGGCGTGA TTTTCCCACA ACGGAATTCG AGGCTCTCGC GAGAGAAATC
CACCACAGCC CTGCGTCCAC CAGAGCGGGA AAAGTGCTCG CCAAAAGGAG ACACGACGGA
ACAGGCAGGC TTCCTGCTCA TGCTCTATGG CCGGCATGGG CGGTGCGCTG GCGCCTGGAG
GAATGGTGCC GCCGGTGGCG TCGCGGCCAG ATCGGCGTCT AG
 
Protein sequence
MDILTRTAAK LANGRIEEPR ALNRIARVTV VVPCYNYGRY LRQCVESVTL NQPGVDVEVI 
IVDDRSTDDS AYVARSIQDA DKRVHLIAHK QNKGHIATYN DGLEAATGEF VLLLSADDLV
TPGALTRAAE FLAAEPSVGL VYGNAIHFHG ELPESRIAGG SWIVWPGVDW LRIRCRSGFN
TITSPEAVMR TAVLREIGNY RADLPHAGDF EMWLRTSAVS DIGFLAGVDQ AYYRHHATNM
NKQDFGSGTA LGQLIDLKQR WQSFEAVFSG VGSGLEEGPQ LLELARSTIA RQVLERINYA
HAKGWRDFPT TEFEALAREI HHSPASTRAG KVLAKRRHDG TGRLPAHALW PAWAVRWRLE
EWCRRWRRGQ IGV