Gene Smed_5208 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5208 
Symbol 
ID5319510 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp167877 
End bp170057 
Gene Length2181 bp 
Protein Length726 aa 
Translation table11 
GC content57% 
IMG OID640776986 
Productcellulose synthase (UDP-forming) 
Protein accessionYP_001313918 
Protein GI150377323 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID[TIGR03030] cellulose synthase catalytic subunit (UDP-forming) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.823689 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAAAG TTTTCGTCCT TGCCGTGTGG GCGTTCATCT CTCTCTGTGT CGTGGCGATC 
ATCACTCTGC CCGTCAACTT ACAGACGCAG TTGATCGCTA GTCTGCTCAT CGTAACGCTG
ATGGCGATCA TCAAGCTCGT AGATGCCGGG GGGAGGTGGC GCTTGATTGC GCTCGCCTTC
GGAACGGCGG TGGTGCTGCG CTATGTCTAT TGGCGCACTA CCGGAACATT GCCGCCCATC
AACCAGCCTG AGAATTTCAT TCCCGGCTTT CTTCTCTATC TCGCTGAAAT GTATAGCGTC
ATGATGCTGG CGCTCAGTCT CTTCGTCGTC GCCATGCCGC TTCCCCCGCG ACCCTCCCGG
GCAGCAACAC CGGGCGACTA CCCCAAGGTC GACGTGTTCG TCCCTTCCTA CAACGAGGAC
GCATCTCTGC TTGCCAACAC GCTCGCGGCA GCCAAAGGCA TGGACTATCC GGCAGAGAAG
CTGAGGGTGT GGTTGCTTGA CGACGGCGGT ACGCTGCAGA AGCGGAACTC GACTAATCTC
GTTGAGGCGC AACGCGCCAC CGCACGCAAT CTGGAACTGC AAAAGCTGTG CACCGATCTC
GGGGTGCGTT ACCTCACGCG CGATCGCAAC GAACACGCCA AAGCGGGCAA TCTCAACAAT
GGCATGTCGC ATTCGGAAGG AGACCTGATT GCTGTTTTTG ACGCGGATCA TGCGCCTGCC
CGCGACTTCC TGCTGGAGAC CGTCGGCTAC TTCGAGGACG ATCCGCGCCT GTTTCTCGTG
CAGACGCCGC ATTTTTTTCT CAATCCGGAC CCCCTCGAGC GCAATCTGAG AACATTCGAG
AAGATGCCAA GCGAAAACGA GATGTTCTAC GGCATTATCC AGCGTGGCCT TGACAAGTGG
AACGCAGCTT TCTTCTGCGG TTCCGCAGCC GTCCTTAGAC GCAAGGCGCT CGAGGACACG
AGTGGCTTCA GCGGCAAGAG CATCACCGAA GATTGCGAAA CCGCGTTGGC GCTACACGGA
CGCGGTTGGA ACAGCGTCTA TGTCGATCGC CCGCTGATCG CAGGATTACA ACCCGCGACA
TTCGCGAGTT TCATCGGTCA GCGAAGCCGC TGGGCGCAGG GGATGATGCA AATCTTGATG
TTCCGATTTC CACTCTTTGA GGGGGGTCTG ACGATACCAC AGCGCCTCTG CTACATGTCG
TCGACACTGT TTTGGCTTTT CCCATTTCCG CGCACGATCT TTCTTTTCGC GCCGCTCTGC
TATCTTTTCT TCGATTTGCA GATATTTACC GCGTCGGGCG GCGAGTTCAT GGGGTATACG
CTTGCCTATC TGGTCGTTAA CCTGATGATG CAGAATTATC TCTATGGTTC GTTCCGCTGG
CCCTGGATTT CCGAGCTCTA CGAGTACGTC CAGACCGTTC ACCTTCTGCC GGCCGTCGTG
TCGGTGGTGT TGAATCCGCG CAAGCCGACT TTCAAAGTCA CGGCAAAGGA CGAATCCATT
CAGGAGAGCC GTCTATCGGA GATCGGACGA CCCTTTTTTG TGGTCTTCAT TGTCCTCTTC
GTCGCGTTGC TGGTGACCGC GTACCGTGTC TACACGGAAC CCTACAAAGC GGATATCACC
ATGGTCGTTG GAGGCTGGAA CCTTCTCAAT CTTATAATGG CCGGATGTGC TCTGGGCGTC
GTCTCTGAAC GCGGGGAAAA AGCCGCGTCG CGGCGCGTCA AAGTAAGTCG CCGCTGTGAA
TTCTCGACCG GCGAACGATG GTATCCGGCG ACGATCGAGG ATGTCTCCGC CAATGGCGCC
CGCATTCAGG TTTATGGCCT TGCTAAGGAT GATTTGTCCG TCGAGGCGCG GACCGAAATC
CGCTTCGAGC CTTTCGCCGG GGATGGCGCT ATCGAGATTC TGCCGATGGC GGTCAGAAAC
GTGGAAGTCA CCGGTGACAT AACCGCAGTC GGTTGCCGCT TCCTGCCGGA GGTTGCCCGG
CACCATAGCC TCGTCGCAGA TCTCCTGTTC GCGAATTCAC GGCAATGGAG CGAATTCCAG
CACAAGCGCC GCGGCAATCC AGGTCTGGTT CGCGGTACCG TGTGGTTCCT CTGGCTCGCC
TTCTATCAAA TGGGGCGGGG TCTAATCTAT TTCTTCCGCA GCCTCGGTTC TTCCCGCAAA
GAGGCGAGGG GGGTCCGTTG A
 
Protein sequence
MRKVFVLAVW AFISLCVVAI ITLPVNLQTQ LIASLLIVTL MAIIKLVDAG GRWRLIALAF 
GTAVVLRYVY WRTTGTLPPI NQPENFIPGF LLYLAEMYSV MMLALSLFVV AMPLPPRPSR
AATPGDYPKV DVFVPSYNED ASLLANTLAA AKGMDYPAEK LRVWLLDDGG TLQKRNSTNL
VEAQRATARN LELQKLCTDL GVRYLTRDRN EHAKAGNLNN GMSHSEGDLI AVFDADHAPA
RDFLLETVGY FEDDPRLFLV QTPHFFLNPD PLERNLRTFE KMPSENEMFY GIIQRGLDKW
NAAFFCGSAA VLRRKALEDT SGFSGKSITE DCETALALHG RGWNSVYVDR PLIAGLQPAT
FASFIGQRSR WAQGMMQILM FRFPLFEGGL TIPQRLCYMS STLFWLFPFP RTIFLFAPLC
YLFFDLQIFT ASGGEFMGYT LAYLVVNLMM QNYLYGSFRW PWISELYEYV QTVHLLPAVV
SVVLNPRKPT FKVTAKDESI QESRLSEIGR PFFVVFIVLF VALLVTAYRV YTEPYKADIT
MVVGGWNLLN LIMAGCALGV VSERGEKAAS RRVKVSRRCE FSTGERWYPA TIEDVSANGA
RIQVYGLAKD DLSVEARTEI RFEPFAGDGA IEILPMAVRN VEVTGDITAV GCRFLPEVAR
HHSLVADLLF ANSRQWSEFQ HKRRGNPGLV RGTVWFLWLA FYQMGRGLIY FFRSLGSSRK
EARGVR