Gene Smed_4583 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4583 
Symbol 
ID5318999 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1077445 
End bp1078635 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content61% 
IMG OID640776384 
Productglycosyl transferase group 1 
Protein accessionYP_001313316 
Protein GI150376720 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAAA ATTCGAACTC CCCGGCTGTC GTGGCTGGAA ACATGTCCGG GGCCCGTGTG 
ACGATTATCC TGCCGAGCCT TGGGGCCGGC GGCACCGAAC ATGTGGTAAA GCTGGTCGCC
AACCATTGGG CCCAGCTCGG TTGCAAGGTG ACGCTGATCA CGCTCGAACT GCCTCATGCC
AGACCTTATT ACGAATTCGA TCCGAGGATC GCGATCGAAC GTCTCGGTTT GCCGCCGCAG
CAAGGCGGGA AGATTCGGGC GGGCCTGCTC GTGCTCAGGA GAATCTACCG CCTGCGCTCT
GCAATTCGCC ACTCGCAGCC CGATTTCGTC TTGAGCTTCC TGACCCGGAC CAATGTACTG
ACGCTTCTTG CAACGATCGG ACTGCCGGTG CCTGTGGTCG TTTCCGAGCG CAATAATCCG
GCGCTGCAGC CTTTCGGTGT GTTCTGGAAA TGGATTCAGC GCCGTTTGTA TCCGCGCGCA
TTCGGGCTCG TGACTATGAC GAGGGGCGCT CTCGACTATT TTCCGGAGAA GATGCGCAGC
CGAGGGTGGG TTATCGCCAA TGCCGTCGAT CTCCCCGGCG AATGGCAGAA GAGACGCGGC
AACAATATCC TGACCGCCGT CGGCCGGCTG ACGCGACAGA AAGGCTTCGA TCTCCTGATC
GAGGCCTTTG CGAGGATTGC CTCGAGGCAC CCCGAATGGA AGCTCGTCAT CTGGGGCGAG
GGCGACGACA GGAAGTCGCT CGAGGCCCTG CGGGATGCGT TGGATATGAC CGACAGGGTG
GAGATGCCGG GCGTGACGCA AAGGCCCGGA GTGTGGGTTG AGACGGCTGA CGTATTCGTA
TTGTCGTCGC GCTACGAGGG ATGGGGCATC GTTCTGCTCG AGGCCATGGC TGCAGGGCTT
CCCGTGGTTT CCTTTGCATG CGAGTGGGGC CCCTCGGACA TGGTGGAGCA TGGGGAGGAT
GGACTTCTCG TTCCCAGCAA TGACGTGGAT GCTCTTGCCG AGGCGCTCTC CAGGGTCCTT
GCCGACGGCG AGCTCAGAAG CCGTCTGGCT GCAAATGCAG AGGCGAGCGC CAAGAGATAC
TTGCCGGATC GCATACTTTC GCAATGGGAC GCAGTCGCCT TATCGGCCTT GAAACATACG
GCTCGCGACC ATGCCGCAAC GGCTTCGGTC GTCGGAGCCG GCTCGGCTTG A
 
Protein sequence
MSENSNSPAV VAGNMSGARV TIILPSLGAG GTEHVVKLVA NHWAQLGCKV TLITLELPHA 
RPYYEFDPRI AIERLGLPPQ QGGKIRAGLL VLRRIYRLRS AIRHSQPDFV LSFLTRTNVL
TLLATIGLPV PVVVSERNNP ALQPFGVFWK WIQRRLYPRA FGLVTMTRGA LDYFPEKMRS
RGWVIANAVD LPGEWQKRRG NNILTAVGRL TRQKGFDLLI EAFARIASRH PEWKLVIWGE
GDDRKSLEAL RDALDMTDRV EMPGVTQRPG VWVETADVFV LSSRYEGWGI VLLEAMAAGL
PVVSFACEWG PSDMVEHGED GLLVPSNDVD ALAEALSRVL ADGELRSRLA ANAEASAKRY
LPDRILSQWD AVALSALKHT ARDHAATASV VGAGSA