Gene Smed_4359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4359 
Symbol 
ID5318382 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp857527 
End bp858525 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content61% 
IMG OID640776164 
Productglycosyl transferase family protein 
Protein accessionYP_001313097 
Protein GI150376501 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.947946 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAACGTAT CCGTTATCAT CAAGACATTG AATGAAGAAA AGCGAATTGC CGCGACTATC 
GAAAGCGCGC TTGCGGCGCT TGAAAGGACC AGCGGCGAGG TCGTCATCGC CGATAGCGGC
TCCTCGGACC GCACGATCGA GATCGCCTCG CAATACCCGG TCGTGATCGC CCAGATCGTG
CCTCCCGCGC GGCCGAGCTG CGGTATCGGC CCGCAGCTCG GTTTTCAGCA CTCCCGCAAG
GACTACATCT GTCTCATAGA CGGCGATATG CTGCTGGACG AGACGTTCCT CGAAGACGCC
ATCCGATTCC TGGCCGAACA CCCTGCAATT GCCGGCGTTA CCGGACGTGT CGAGGAAATG
CACATCTCGA ATCTCGAATT CGCGCGCCGC GTCAGCCGCA ATGCGCCTGA GAACCGGACG
GGTGCGGTCG ATCGCATGAA CGGAGGCGGG CTCTACAGGC GGAGTGCCAT CGAGAGCGTC
GGCTACCTTT CGGATCGCAA CCTTCACGGC TACGAGGAGT TCGATTTGGG CATCCGCCTC
AGGAGCGCGG GTTGGGGGCT CTATCGCCTC GATCGCCGGT TCGTCAGGCA CTTCGGCCAT
ACGGTCAACT CTTACCGGCT GCTCGTGCGC CGCTGGAAAA GCAAGTACCT CTTCGGTATC
GGCGAACTTC TGCGGGCATC GCTCGGCAAG CCCTATTTTT TTCAGCTCCT GCACGAGTTG
CCGGAGTTGA AGCTCTGGGG GGGCGTTTAT CTCTGGTGGC TAGCCTGCCT TGGCCTGATC
CTGTTTCTGC CCGACACGCT CCTGGCAATG GCCGCCGTCT GCCTGAGTTT CGCGGGTGCG
GTCCTGCTCG TTAGTTTCCG AAAGGGCGGT CTCAGCATGG GGCTGTATAC GGTCGTCGCC
TGGTTTTTCC ATGCCGCCGC GCTTCCTATC GGTCTGCTGC GAAGGCGGCG GCAGCCGGCA
GAGCCGATCG AAAGCAGAAT TTTCGGAACG GCGACATGA
 
Protein sequence
MNVSVIIKTL NEEKRIAATI ESALAALERT SGEVVIADSG SSDRTIEIAS QYPVVIAQIV 
PPARPSCGIG PQLGFQHSRK DYICLIDGDM LLDETFLEDA IRFLAEHPAI AGVTGRVEEM
HISNLEFARR VSRNAPENRT GAVDRMNGGG LYRRSAIESV GYLSDRNLHG YEEFDLGIRL
RSAGWGLYRL DRRFVRHFGH TVNSYRLLVR RWKSKYLFGI GELLRASLGK PYFFQLLHEL
PELKLWGGVY LWWLACLGLI LFLPDTLLAM AAVCLSFAGA VLLVSFRKGG LSMGLYTVVA
WFFHAAALPI GLLRRRRQPA EPIESRIFGT AT