Gene Smed_4597 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4597 
Symbol 
ID5318507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1096819 
End bp1098066 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content61% 
IMG OID640776398 
Productglycosyl transferase group 1 
Protein accessionYP_001313330 
Protein GI150376734 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.211799 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.982264 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCCTGA AACCGAAGAT CGCCGTCGTG CTGAAGGGTT ATCCGCGCCT TTCGGAAACG 
TTCATCGCAC AGGAATTGCT GGGCCTTGAA AGAGCAGGGC ATGAACTCGT CCTAATCGCC
CTGCGCCGAC CGACTGATGG GAAGCGCCAC CCCGTCCATG ATGAAATTGG AGCGCCCGTC
CATTACCTTC CGGAATATCT GCACGAGGAG CCGTGGCGCG TCCTTCGCGC CCTAACGAAG
ACCGTGACGA AACGCTCCTT CTGGCGGATG CTCCGGCCGT TCTTCAGGGA TCTCGCGCGC
GACAGATCGC GCAACCGTTT CCGTCGCCTC GGCCAGGCTC TGGTGCTCGT CGCCGAATGG
CCGGACGATG CCGGCTGGTT GCATGCTCAT TTCATACATA CGCCGGCATC GGTGACAAGC
TATGCCAGCA TGATCTCCGG CATACCCTGG ACTTGTTCCG CGCATGCCAA GGACATCTGG
ACTTCGCAGG ATTGGGAGCT TTCCGACAAG CTCGGCCGCG CGCGCTGGAC GGTGACCTGC
ACGCGAAGCG GCTATGAGCA CCTGCGGGAC CTGTCGAGCG ACAAGACCCG AGTGCATCTG
AGTTATCACG GCCTCGATCT CGATCGGTTC CCGTCCTTCG AAGGTGAGCA TTCCCGGCGC
GATGGCAGTG TTCCGGACGA CCCGGTGCGC ATCGTCAGTG TCGGACGCGC CGTCTCGAAA
AAGGGATATG ACCTTCTCTT GAAGGCGCTG TCGCTGCTGC CTGCGGACCT CAGCTGGCGC
TTCGATCATA TAGGTGCGGG CGAGCTCACC GGCAAGCTCC AGGCGCTTGC CGGTGAACTC
GGCCTCGAAG ATCGCTTAGG ATGGCACGGC GCACTGGATC AAAAGGAGGT TCTGAGCCGC
TACCGAGAGG CCGACATCTT CGCGCTCGCC TCTCGGGTCG CGGCGAATGG TGACCGAGAC
GGCCTGCCGA ATGTTCTCGT AGAGGCATCG AGTCAGCGCC TTGCCTGTAT CTCGACCGCG
GTCTCCGGAA TACCCGAACT TATCGATGAC GGTCATAATG GTATGCTGGT GCCGCCGGAA
AATCCGACGG CACTTGCCGC GGCAATAGAG CGATTGATCC GCGATCCGGA TCTTCGCCGG
CAACTTGGTG CCGCCGCGGA ACGGCGCGTG CGCGCCGATT TCGACCACCA TTCGAGCGTC
GGTCAGTTGA TCGGGCTCTT CGAAAGCGAA TGGAGAAGAA GCCCTTGA
 
Protein sequence
MSLKPKIAVV LKGYPRLSET FIAQELLGLE RAGHELVLIA LRRPTDGKRH PVHDEIGAPV 
HYLPEYLHEE PWRVLRALTK TVTKRSFWRM LRPFFRDLAR DRSRNRFRRL GQALVLVAEW
PDDAGWLHAH FIHTPASVTS YASMISGIPW TCSAHAKDIW TSQDWELSDK LGRARWTVTC
TRSGYEHLRD LSSDKTRVHL SYHGLDLDRF PSFEGEHSRR DGSVPDDPVR IVSVGRAVSK
KGYDLLLKAL SLLPADLSWR FDHIGAGELT GKLQALAGEL GLEDRLGWHG ALDQKEVLSR
YREADIFALA SRVAANGDRD GLPNVLVEAS SQRLACISTA VSGIPELIDD GHNGMLVPPE
NPTALAAAIE RLIRDPDLRR QLGAAAERRV RADFDHHSSV GQLIGLFESE WRRSP