Gene Smed_5687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5687 
Symbol 
ID5319989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp655628 
End bp657925 
Gene Length2298 bp 
Protein Length765 aa 
Translation table11 
GC content58% 
IMG OID640777414 
Productglycosyl transferase group 1 
Protein accessionYP_001314346 
Protein GI150377751 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase
[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCCGC TTGAGCCACC ACTCCGTATT CTGTTCGTCT TCGCATGGCT GGTGGTCGGG 
GGCGAGGAGA CGGAGGTCCG CCTTCTGGCT CGCAGTCTCA ATCGAGTTCG CTACCGGATA
GATGTTGTCG CCTGTTTCCA TAAGCCCAAC ATGCCTAGCC AGACGCATGA ACAATTGACG
GCCCTGGGCG TCGACGTGGA TACGACGCCG TACGATCTCT CCTTCGACGA CACGGTTTCC
TATCTTGCAA ATAAGATACC GGGATATGAC GTCGTGGTCG CGTGTCAGAA TGTTGCAGAT
ATTTACCCGG CGTTGGAGCG TGTCCATCTG CGACCGCCGC TGATCGAGCA TGGCGGACTT
GTGTCCGAAG CGCTGGCAGG GCCGAAGCAT TTCACTAGCC GCTATGTCGG CGTATGCCGA
TCTATACGCG ACGCCGCTGC CTCGGTCATG CCGGGGCGCG GAGAAGATGC GATCGAAATC
CCCTCGATGG TCGATCTGAC GGAGTTCGAC GATACCCAGC GCGCCGCAAC GCGTGCGTCG
CTTGGTGTGG CAGAAAATAC AGTTCTTATC GGCTGGGTGG GGCGGCTCGA TCCAAAGAAA
AAAGTTGAGG ATTTCATCGA AGCAGCGGCA ATCGTGCATG CGGAAAATGA GTCTGCGCGG
TTCGTGATCG TCGGTGGTCC AGACGCGTTT ATGCCGGACT ATGCCGTGCA GCTGCAGTCG
CTAGTCGCTT CGTGCGGGCT GGGCGATGTG CTGACTTTTC TTGGGGACCG GAAGGATATT
CACGCGCTTC TGTCGGCTTT CGACATTTTC ATCTGGCTGT CGAAGGGCGA AGGTATGCCG
CATGTGATAG CGGAGGCGGG CGCGGCCTGC CTGCCTGTTA TCGCGACGCC TGACAACGGC
GCCTTGCAGC AGATCGATGA TGGGGTTTCT GGCATATTCG TGCCCTATAG CGACCCGGTC
GCGGTCGCGG CGGCGATGAA GAAGCTGATC GTGTCAGCCG ATTTGCGCAG AAAGCTCGGA
CAGGCTCTGC GTTCAAAGGT GGAAACTCAC TACAGTGTGG CGGCCGTAGT ACCCCAGTGG
GAAAGCGTCA TCCAGGAGGT TGTTGACGAG AGGCGGAAGT CGGGCCCCGC GGGCATATTC
CAGTCCTTTC TTCAGGGCGG TTTCGAGTGT TCCACGCATC GCTTGCGTCC ACGCGAGGGC
GAAATTACCG GAAAACGCCT GGATGTGCTC GCCGCGACTG GGCATGATGT CTATGCCGCG
GAAGACTATG CGCAGCTAGC TCGCCATGCG ATCCGCACCG TGCGCGACGG ATTGCGCTGG
CATCGGATCG AGACGCTCCC CCGGAAGTAC GACTGGTCAA GTTTCCTGTG GATGCTCAGG
GCAGCGAAGG AGACCAGGAC GCAGGTGGTC TGGGATCTGT TGCATTATGG CTGGCCCGAC
GACATTGACA TCTGGTCTCC GGATTTTGTG ACACGTTTCG CGCGCTTTGC GGGTGCTGCG
GCAAGTGTCG TACGGCAGGA AAGCGACGCC GTGCCATTTT ACTCGCCCGT CAACGAAATC
TCGTTCTTCT CGTGGGGCGG CGGCGACGCC GGCTACCTCA ATCCTTTTGC CAATGGGCGT
GGCTTCGAAT TAAAGGTTCA GCTGGCGCGT GCTTCTATCG AGGCAATGGA AGCCATTCTT
GCAGTGGATC CGAGAGCGCG CTTTGTCCAC TGCGATCCGG TGATCAATGT TATCGCCGAT
CCGTCGCGCC CTTGGGAGCG GCGCGCTGCT GAGGGCCACC GTCAATCCCA GTTTCAGGGC
TGGGATCTAC TTGCTGGACG GCTCTGGCCG CAGATTGGCG GTGCTGATAA GTTCCTCGAT
ATCATTGGCG TGAACTACTA CCATAACAAT CAATGGATCC ACGGCGGGCC GCCGATCGAC
ATTGATCATC CGCTCTACAA GCCGCTCAGG ACAATTCTGA TCGAAACATA TGCGCGATAT
GGTAAACCGC TCTTTCTCGC GGAAACGGGG ATAGAGGCGG AACGACGCGC CGATTGGATA
ACCTATGTTT ACGCCGAAGT CAGAGCCGCT ATGGACGCCG GAGTACCCGT AGAGGGCATA
TGCCTTTATC CAATTATCTC TCATCTCGGC TGGGATGACG AGCGGCCTTG TGAAAACGGT
CTGTTTGCGG CGCAGACTTC GGGTGATGGA CGTGCAGAAT ATGCACCTCT TGCTCGTGCT
TTGCGAGAGA TTCAGCGCAG ACTTGAACTT GTCCCGGGCG TCAAAAACAA TCCGCGACAA
TCCACGCCGA AGTTCTAA
 
Protein sequence
MRPLEPPLRI LFVFAWLVVG GEETEVRLLA RSLNRVRYRI DVVACFHKPN MPSQTHEQLT 
ALGVDVDTTP YDLSFDDTVS YLANKIPGYD VVVACQNVAD IYPALERVHL RPPLIEHGGL
VSEALAGPKH FTSRYVGVCR SIRDAAASVM PGRGEDAIEI PSMVDLTEFD DTQRAATRAS
LGVAENTVLI GWVGRLDPKK KVEDFIEAAA IVHAENESAR FVIVGGPDAF MPDYAVQLQS
LVASCGLGDV LTFLGDRKDI HALLSAFDIF IWLSKGEGMP HVIAEAGAAC LPVIATPDNG
ALQQIDDGVS GIFVPYSDPV AVAAAMKKLI VSADLRRKLG QALRSKVETH YSVAAVVPQW
ESVIQEVVDE RRKSGPAGIF QSFLQGGFEC STHRLRPREG EITGKRLDVL AATGHDVYAA
EDYAQLARHA IRTVRDGLRW HRIETLPRKY DWSSFLWMLR AAKETRTQVV WDLLHYGWPD
DIDIWSPDFV TRFARFAGAA ASVVRQESDA VPFYSPVNEI SFFSWGGGDA GYLNPFANGR
GFELKVQLAR ASIEAMEAIL AVDPRARFVH CDPVINVIAD PSRPWERRAA EGHRQSQFQG
WDLLAGRLWP QIGGADKFLD IIGVNYYHNN QWIHGGPPID IDHPLYKPLR TILIETYARY
GKPLFLAETG IEAERRADWI TYVYAEVRAA MDAGVPVEGI CLYPIISHLG WDDERPCENG
LFAAQTSGDG RAEYAPLARA LREIQRRLEL VPGVKNNPRQ STPKF