Gene Smed_6186 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_6186 
Symbol 
ID5320488 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp1108381 
End bp1109661 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content56% 
IMG OID640777804 
Productglycosyl transferase family protein 
Protein accessionYP_001314736 
Protein GI150378141 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.353807 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACCTGC TTGACACAAC CAGCACCGCC GCTATCTCGA TCTACGCGCT GCTCTTGACC 
GCCTACAGGA GCATGCAAGC CCTACATGCT CGGCCGATAG ACGGTCCAGC AGTGTCGGCA
GAACCGGTCG AGACCCGCCC TCTGCCAGCC GTGGATGTTA TCGTCCCCAG CTTCAATGAG
GACCCAGGCA TCCTCTCGGC GTGCCTTGCG TCCATTGCAG ACCAGGATTA TCCCGGAGAA
TTGCGAGTCT ATGTCGTTGA TGATGGTTCT CGGAACCGCG AGGCCATTGT GCGTGCACGC
GCCTTCTATT CGCGCGATCC GAGGTTCAGC TTCATTCTGC TCCCAGAGAA CGTCGGAAAG
CGGAAAGCGC AGATTGCCGC GATAGGTCAA TCCTCTGGGG ATTTGGTGCT GAATGTCGAC
TCGGACAGCA CGATCGCTTT CGATGTGGTC TCCAAGCTTG CCTCGAAGAT GGGAGATCCA
GAGGTCGGTG CGGTTATGGG TCAACTCACG GCTAGCAATT CGGGTGACAC TTGGCTGACG
AAATTGATCG ACATGGAGTA TTGGCTTGCC TGCAACGAAG AACGCGCGGC ACAGGCTCGC
TTCGGTGCTG TTATGTGTTG CTGCGGCCCT TGTGCTATGT ACCGTCGGTC GGCGCTCGCT
TCGCTGCTTG ACCAGTACGA AACGCAACTG TTTCGCGGTA AGCTAAGCGA CTTCGGTGAG
GACCGCCATC TGACGATCCT CATGTTGAAG GCAGGTTTTC GAACTGAGTA TGTTCCAAAC
GCCATAGTGG CAACCGTCGT CCCGGATACA CTGAAACCGT ATCTGCGCCA ACAACTGCGT
TGGGCACGCA GCACGTTCCG TGACACGTTT CTAGTGCTCC CTCTGTTGCG CGGCCTCAAC
CCTTTTCTCA CATTGGACGT GGTCGGGCAG AATATCGGGC CACTGTTGCT CGCTCTGTCG
GTCGTGACGG GACTTGCGCA TTTCATAATG ACCGCCACAG TGCCATGGTG GACGATTTTG
ATTATTGCGT CCATGACCAT TATACGCTGC AGCGTCGTAG CATTGCATGC TCGCCAACTT
AGATTTCTTG GCTTCGTTCT GCACACACCC ATCAACCTCT TTCTCTTACT TCCGTTGAAA
GCTTATGCGT TGTGTACATT GTCCAATAGC GACTGGCTGT CACGCTACTC CGCGCCAGAA
GTACCAGTCA GCGGAGGAAA GCAGACTCCA ATTCAAGCCT CCGGCCGAGT GACACCTGAC
TGCACTTGCA GCGGCGAGTG A
 
Protein sequence
MYLLDTTSTA AISIYALLLT AYRSMQALHA RPIDGPAVSA EPVETRPLPA VDVIVPSFNE 
DPGILSACLA SIADQDYPGE LRVYVVDDGS RNREAIVRAR AFYSRDPRFS FILLPENVGK
RKAQIAAIGQ SSGDLVLNVD SDSTIAFDVV SKLASKMGDP EVGAVMGQLT ASNSGDTWLT
KLIDMEYWLA CNEERAAQAR FGAVMCCCGP CAMYRRSALA SLLDQYETQL FRGKLSDFGE
DRHLTILMLK AGFRTEYVPN AIVATVVPDT LKPYLRQQLR WARSTFRDTF LVLPLLRGLN
PFLTLDVVGQ NIGPLLLALS VVTGLAHFIM TATVPWWTIL IIASMTIIRC SVVALHARQL
RFLGFVLHTP INLFLLLPLK AYALCTLSNS DWLSRYSAPE VPVSGGKQTP IQASGRVTPD
CTCSGE