Gene Smed_3671 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3671 
Symbol 
ID5318068 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp109534 
End bp111390 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content62% 
IMG OID640775484 
Productputative cellulose synthase protein 
Protein accessionYP_001312417 
Protein GI150375821 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.484697 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0086159 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCCGCA TTTCCGCCAA GAAGTCAACG ATTATGTCTG CCGGCGGCGA GCCGCTGCTG 
GTGCCTGTGT TCACGGGCCG ACGCCGGTTG GAATATCTAC TCGGTGCGGG GCTCTGGGTA
GCCGCACTCC TGTATTTCTG GGCGTGGTGG CTGGAGCCGA GCCACCACGT GGACGCTTTG
GGCAGCGCCA TCGTGACCGC GGTCCTTGCG TGGGTAACCC TTCTGCCTGC CTATTTCATT
GCAGTTTTCT ATCGCGCGGC AAAACCAAAC GGGCCGCTGC GGATCGCGAC GGGAAGTCGC
GTTGCCATGG TCGTTACCAA GGCGCCATCG GAACCTTTTG CCATCGTGGC GGAAACATTG
AAGGCGATGC TCGCGCAGGA CGTGCCACAT GACACGTGGC TCGCAGACGA GGACCCGTCC
GATGCGACAC TCGACTGGTG CCGCAGGCAT GGCGTTCTCG TGTCGACGCG CAAAGGCCTG
GCAGACTACC ACCGGGCCAC ATGGCCACGG CGTACGCGCT GCAAGGAGGG CAATCTTGCC
TTCTTCTACG ATCACTACGG ATATGACGGT TACGATTTCG TGGCTCAGCT CGACGCGGAC
CATGTTCCCG CACCCGGTTA CCTCTTTGAA GTACTGCGCC CCTTCGCCGA CCCGGAAGTG
GGTTACGTCT CGGCGCCGAG TATCTGTGAC CGCAATGCTT CGCAGAGCTG GTCGGCCCGC
GGGCGGCTCT ATGCGGAAGC GAGCATGCAC GGGGCCCTTC AGGCCGGCTA CAATGGCGGG
CTCGCGCCGC TTTGCATAGG CTCCCACTAC GCCGTGCGGA CGGCGGCGCT CAAAGAGATC
GGCGGGCTCG GCCCTGAACT TGCCGAAGAC CATTCGACGA CATTGATGAT GAATGCAGCA
GGTTGGCGCG GCGTTCACGC GCTGGACGCG ATCGCCCATG GCGACGGACC ACGCACTTTT
GCCGATCTGG TCACCCAGGA ATTCCAGTGG TCGCGCAGCC TGGTCATGCT GCTTCTGCAG
TATTCGCCAC GGCTCGTCGG ACGGCTTCCG CTGCGGTTGA AGTTCCAGTT TCTCTTCGCG
CAGCTCTGGT ATCCGCTTTT CGCCTGTTTC ATGGCCCTGA TGTTCGCGAT GCCTATCGTA
GCTCTTGCAC GTGGCGAAAC CTTTGTTGCC GTAACATATC CGGAGTTTCT GGCCTATTTT
GCACCATTGT CGGCCATTCT CGTCCTTCTG GCTTATCGGT GGCGGGCGAC CGGCGCCTTC
CGCCCCTGCG ACGCGAAGGT TCTCAGCTGG GAGTGCATGC TTTTCCTCTT CGCGCGCTGG
CCCTGGGCGC TTGCCGGGAC GCTGGCGGCC CTGCGCGATT GGCTCACCGG GTCCTTCGTG
GATTTCCGCG TCACGCCGAA GGGATCATCC GAGGTCGATC CGTTGCCGCT GCGTGTGCTC
GCGCCCTACT TCGCGCTTGC GATTGCGGCC GTCCTGCCGG TGTTCCTCGT CGAAGATGCA
GCGAAGGCCA GGGGCTTCTA CCTGTTCGCT ATCTTGAATG CTGCGATTTA CTGCCTGCTT
CTGCTTGTCA TCGTCATCAG GCATTCGAGA GAAAATGCCG TTGCAGCGGG TTCCCGCTTC
TATCGCCCCG CGATCACGGC CGGGCTCCTG GCTGTCATCG CGCTGCCAGG TATAGCAACG
GCTGAGCACG GGAAAGAAGG GCTTGAGGCG CTCGCCTGGG GCAGCGGTCA CCTGCGTCTG
TTCGAGGACC GCTATGCAGT CGCCGGAGCC GGGCAGGGCG GAACGACTTT GCGCAAGATC
GTCTTCAGTC CGCGTTGGAT TTCCGCCCCG CGCGGCGGCA AAGCAGAGAG GGGGTAG
 
Protein sequence
MTRISAKKST IMSAGGEPLL VPVFTGRRRL EYLLGAGLWV AALLYFWAWW LEPSHHVDAL 
GSAIVTAVLA WVTLLPAYFI AVFYRAAKPN GPLRIATGSR VAMVVTKAPS EPFAIVAETL
KAMLAQDVPH DTWLADEDPS DATLDWCRRH GVLVSTRKGL ADYHRATWPR RTRCKEGNLA
FFYDHYGYDG YDFVAQLDAD HVPAPGYLFE VLRPFADPEV GYVSAPSICD RNASQSWSAR
GRLYAEASMH GALQAGYNGG LAPLCIGSHY AVRTAALKEI GGLGPELAED HSTTLMMNAA
GWRGVHALDA IAHGDGPRTF ADLVTQEFQW SRSLVMLLLQ YSPRLVGRLP LRLKFQFLFA
QLWYPLFACF MALMFAMPIV ALARGETFVA VTYPEFLAYF APLSAILVLL AYRWRATGAF
RPCDAKVLSW ECMLFLFARW PWALAGTLAA LRDWLTGSFV DFRVTPKGSS EVDPLPLRVL
APYFALAIAA VLPVFLVEDA AKARGFYLFA ILNAAIYCLL LLVIVIRHSR ENAVAAGSRF
YRPAITAGLL AVIALPGIAT AEHGKEGLEA LAWGSGHLRL FEDRYAVAGA GQGGTTLRKI
VFSPRWISAP RGGKAERG