Gene Smed_4710 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4710 
Symbol 
ID5318860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1227899 
End bp1230148 
Gene Length2250 bp 
Protein Length749 aa 
Translation table11 
GC content63% 
IMG OID640776508 
Productglycosyl transferase family protein 
Protein accessionYP_001313440 
Protein GI150376844 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.436323 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.224848 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGTTCG ACGACCGCAT GGGGGCCGCC AGCGGCATTT TCGAGGACGC GAAGGTTTTT 
CGGCTCGGTT CCGAAGTGGC CGTATTGATC TCGGACCTCA AGAAAAAACT GCCGGCCGCA
GCAAAACACA GGCTCGTCAT GCCGGAGCAA CCCGTGCCGC TGGTCAGCAC CATCCTTCCT
CTGGCGGAAG GCGGTCAGCG GGTCCTTTGG GCCATGCGCC CAGGGGGCGA ACCGCGCCGC
GGACAGATTT CCGTCGATAG CGACTTCGTC CGAACGGTTG TCCTGCAGCC CGTCGGCGAA
CTTCCGCCCC TCGACGTGGA AGCCCTGTTT GCCGCGCTAA CACCGGAGGG CTGCGTCAAG
TTTGTGAACA CTCTGCTTAC TGTTTGGCGG AGCGCCTTCC GCCTGTCCCG GGATGCATTC
TTCATAGGCC TCGTCGAGGA CGCCCTTCAT GCGCTGACAT CGAGGCCGCG GCCTGCAAAG
ATCGCTTGCC CGATCATGCC GGGCCGGTAT CTGATCGAGA CCGCGATAAC TCCCGACTTC
GGTGAGATCA GCGCGGTCTA TTCGCTCGGT GCCAATGCCG TTTTGCCGCT GGAGTCGCAA
GCTGTGACCG GCCCAGATCG CGAGCACGAC CTGCGCCCCT GCCACTTCAT TGTCGAGTCT
CCTCGACACC CTCAATCCTA TGTTCTCGTC GGAAAACGAG GCGTGGCTGT TCGCGAACTC
TCCTCCGGAA AACCACATTA CACCAGTCTC CAGGCATGGT GGGCCGAACG GGGTGGCGCG
CCGGAGCTGC GCGAATTCGT CGTCCGATGG CTTTCGACGA CACCGGAGGG CGGACATTCG
ACCGCCGTCG ACCTCCAGCT TCGGACACCG CTTCCGGCAA GGCGGATCGA CAGATCATCG
ATGCATCCTT CCGCAGAAGT GGATCTCGCA TTGAGCCTTT CGGGCGGTCT GCTCGCCGGG
GGCTGGTCCC ATGATCCAAC AGCCACGCTC GCCGGCATCG ACTACCTGAA TGAGAACGGC
ACGGCGGTGC CCCTCGACGG CAATTGGTAC GAGTTTCCTG CCTGGGCACG CGGAACGGAC
GACAATACGC GCGCCGACGT AACCGGCTTC GTCGCCTGGC TGCCGTTAAA CGAGGCGCCG
GGCGCCTTGC TCCAGCCGCG CTTCCAGATG CGGCTCGCTT CCGGCGTCGT AAAACCGCTC
GTGCCGAAGC CACAACCCTT CGAAGCTTCG GCGCAGCGCA ACCGCATCCT GCGCGCGGTG
CCGCCCCAGC ACGCAGTCGA CCTGGCCTTC CGGACGATCC TTGCCCCCGC GCTGCAGGAC
GTAGAACATC GATTGGGCAG AACTGCCACG GTCGACTATA CCAAGGATTA CGACCTGCCG
CAGACGGCGC CCCTGGTTTC GATCGTCGTG CCGCTTTACC GTGTTCTCGA TTTTCTGCGG
TTCCAGCTCT CCGGCATGGC GACCGACCGC TGGCTCGCAA AGAATGTCGA GATCATCTAC
GTTCTGGATT CACCCGAGAT CCAGGACGAG ACGGAGCATC TTCTTGGCGG GCTGCATCTC
CTCCACGGGC TGCCGATGAA GCTCGTGGTG ATGAGCCGCA ACGGGGGCTA TGCCCGGGCC
TGCAATGCCG GCGCGCGCTT CGCCCGCGGC GCGGTCATCG TCATGCTGAA CTCCGACGTC
GTTCCGTCAG CAGCCGGCTG GCTGCAGAGG CTCATCCGGC CGCTCATGGA ACAGAAGAGC
CTCGGCGCCA TCGGCCCGAA GCTGATCTTC GAGGACGGAT CGCTCCAGCA CGCGGGACTC
TATTTCGCTC GCGATCAGCG CGGCATATGG CTCAACCACC ATTTCCACAA GGGCATGCCA
GGCGACTATG CGCCCGCTCA ATTTTCCAGG AGCGTCCCGG GGATCACAGG TGCGTGCCTC
GTCACGCGTC GGGAGACCTA CGAGCGCGTG GATGGATATA CGGAGGATTA CGTAATCGGC
GACTACGAGG ATAGCGACTT GTGCCTGAAG ATCCGCCGGT GCGGCTATGA CATCGTCTAC
GAGCCGTCTG CGTGTCTTTA TCATTTCGAA CGCCGCTCGA TCCGCCGCAG CGAGGACTAC
ATGCGCGGCG TCGCCAGCCA GTATAACTCC TGGCTGCACA CGGAGCGGTG GGATGACGAT
ATCGAGGCGC TGATGGCAGC ATATCTCGGC AGCGGCATCC AAGGGCTCGT CCGCCAAGAG
GCCGGTGCCA CAGCGAGGAG CGCCGCATGA
 
Protein sequence
MTFDDRMGAA SGIFEDAKVF RLGSEVAVLI SDLKKKLPAA AKHRLVMPEQ PVPLVSTILP 
LAEGGQRVLW AMRPGGEPRR GQISVDSDFV RTVVLQPVGE LPPLDVEALF AALTPEGCVK
FVNTLLTVWR SAFRLSRDAF FIGLVEDALH ALTSRPRPAK IACPIMPGRY LIETAITPDF
GEISAVYSLG ANAVLPLESQ AVTGPDREHD LRPCHFIVES PRHPQSYVLV GKRGVAVREL
SSGKPHYTSL QAWWAERGGA PELREFVVRW LSTTPEGGHS TAVDLQLRTP LPARRIDRSS
MHPSAEVDLA LSLSGGLLAG GWSHDPTATL AGIDYLNENG TAVPLDGNWY EFPAWARGTD
DNTRADVTGF VAWLPLNEAP GALLQPRFQM RLASGVVKPL VPKPQPFEAS AQRNRILRAV
PPQHAVDLAF RTILAPALQD VEHRLGRTAT VDYTKDYDLP QTAPLVSIVV PLYRVLDFLR
FQLSGMATDR WLAKNVEIIY VLDSPEIQDE TEHLLGGLHL LHGLPMKLVV MSRNGGYARA
CNAGARFARG AVIVMLNSDV VPSAAGWLQR LIRPLMEQKS LGAIGPKLIF EDGSLQHAGL
YFARDQRGIW LNHHFHKGMP GDYAPAQFSR SVPGITGACL VTRRETYERV DGYTEDYVIG
DYEDSDLCLK IRRCGYDIVY EPSACLYHFE RRSIRRSEDY MRGVASQYNS WLHTERWDDD
IEALMAAYLG SGIQGLVRQE AGATARSAA