Gene Smed_4566 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4566 
Symbol 
ID5319234 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1053994 
End bp1055109 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content63% 
IMG OID640776367 
Productglycosyl transferase group 1 
Protein accessionYP_001313299 
Protein GI150376703 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.416296 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.520192 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCGCA ATGCGGCAAA CACAGGCAAT CCGATGATAA TGCATGTCAT TACCAATTTT 
ACCGCGAGTG CCGGCGCCGA AACGATGCTG GCGCGGCTGC TGCACGGATC GACGGACGAG
CGCATCATCG TGGTTTCGCT GATCGGCGTT TCGGACCGGA ACCGCCGCCT CGCCGACAAT
CCGAGAGTTT CCTATGTTTC GCTGGCAGCG GCATCGCTGA CAGCGCTTCC GGGCGCGATT
CTTCGGCTTG CGACGCTGAT CCGGAAAGAG CGGCCCGATG TTATCCTCTG CTGGATGTAC
CACGCGATGG TCGCCGGGAG CCTGGCGGCG GGGCTGGCCC GGCACGGGGC GCCGGTTTTC
TGGAACGTTC GCCAATCGCT GGACGATCCC GCTTCCCTCA CGCGCAGTTC GCGCGTTGCG
ATCGCGGCCG CGAAACTGCT GTCGCGCCGG CCGACGGGTA TTATCTACAA CAGCGCCCGC
GCGCTCGATC TGCATCGCGC CTACGGCTAT ACAAATCAAA ATGCGGTCGT CATACCCAAC
GGCTTCGAAC TCCCGGAGCT TGCGGCGCCC GAACCGCGGG CGGCCCGGCG CATCGGCATC
GTGGGCCGCT TTCACCCGCA GAAAGATCAC GGGACGTTTT TCAAAGCCGC CGCCCAGGTG
TTGAAGACCC ATCCGCAGGC AGTCTTTTCC GCAGCCGGCA ACGGGCTGGT CTGCGACAAC
CCGGAGGTCA TGGAACTGAT CGCGAAAGCG GGCCTCCCGG CCCACGCCGT CGATCTGCGG
GGGGAGGTCA GCGATATGCC TGCATTCTAT CGAAGCATCG ACCTGTTGGT GCTTTCGTCG
CGGACCGAAG GCTTCCCGAA TGTCATCGCT GAGGCCATGA GCTACGGCAA GCCGATCGTC
ACGACGGATG TTGGTGACGC GGCGGTCGTC GCCGGAAGGG CCGGCATCGC CGTACCGCCG
CGCAATCCGC AGGCTCTCGC CGAGGCAATG CGCGCCTTCC TCGATCTGTC CGAAGCAGAA
TACGCGCGCT ATGCGCGCAC CGCCCGAGAG CGCATCGAGA ATGAGTACGC GCTTGCCGCT
GTGAGTGCGA AATATTCAAA ATTTCTAACG GCTTAA
 
Protein sequence
MRRNAANTGN PMIMHVITNF TASAGAETML ARLLHGSTDE RIIVVSLIGV SDRNRRLADN 
PRVSYVSLAA ASLTALPGAI LRLATLIRKE RPDVILCWMY HAMVAGSLAA GLARHGAPVF
WNVRQSLDDP ASLTRSSRVA IAAAKLLSRR PTGIIYNSAR ALDLHRAYGY TNQNAVVIPN
GFELPELAAP EPRAARRIGI VGRFHPQKDH GTFFKAAAQV LKTHPQAVFS AAGNGLVCDN
PEVMELIAKA GLPAHAVDLR GEVSDMPAFY RSIDLLVLSS RTEGFPNVIA EAMSYGKPIV
TTDVGDAAVV AGRAGIAVPP RNPQALAEAM RAFLDLSEAE YARYARTARE RIENEYALAA
VSAKYSKFLT A