Gene Smed_3726 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3726 
Symbol 
ID5318592 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp167954 
End bp169948 
Gene Length1995 bp 
Protein Length664 aa 
Translation table11 
GC content58% 
IMG OID640775539 
Productcellulose synthase (UDP-forming) 
Protein accessionYP_001312472 
Protein GI150375876 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTTCAAT CTCTTCTAGC GCTTGCGCCG ACATTGCTCG TCCTTGCCTT TTTTCTGCTC 
GGCCCGTTCA ACTGGTCGCG CCATAAAAGC TGGGCGCGTG CGGTGACATG CGTTTTCGTG
GCTGCCATTG CGCTGCGTTA CCTCATCTGG CGCTTTACCG AAACCGTTTT GCCCTATCCG
ACCGATGGCG CCAATTTCTA TTGGGTATGG TTCGTCTTCA TCGCGGAGTT CCTCGCTTTT
TCCGAGGTGG TGCTCTTCCT CATCCTGATG AGCCGCTATG TGGATCGCAG TTCCGAGGCA
GACGACCTAG AGCGGAAGTT CTTCGCCGGG AATGAAAGGG AACTTCCTGC GGTCGACGTT
TTCATACCGA CCTATAACGA ACCGCTCGAC GTTCTCGAAC GGACGATCGT CGGAGCCCTT
GCGCTCGATC ATCCCAAAGA CAAGCTCAAG GTTTATGTGC TCGACGACGG CCGGCGCGAC
TGGCTCAAGA CCTTCTGCGA GGGACGAGGC GCGATCCACG TCACGCGCAG CGACAACGCG
CATGCCAAAG CGGGCAACAT GAACAACGGT TTACGCGCCA GTTCAGGCGA TTTCATTGCC
GTCTTCGATG CCGACTTCGT TCCCTATCGC AGCTTCCTAC GACGGACTTT GCCCTTCTTC
CTCGACGAGA CGATCGGCAT CGTGCAGACG CCTCAGCACT TCTTCAATGT CGATCCGATA
CAGTCCAATC TCGGTTTGGA GAATCTGTGG CCTGACGAGC AGCGCCTGTT CTTCGACGAA
ATCGCGCCGA GCCGCGACGG CTGGGACGTC AGCTTCTGCT GCGGCTCCTG TTCGATCGCC
CGGCGCAAAG CGGTCGACGC CATTGGCGGG TTTCCAACCG AGTCGATCAC CGAAGATCTG
CTGACGACGC TGTCGATGCT GAACAGGGGT TTCAAGACTC GCTATCTCAA CGAGCGGCTT
TCAATGGGCT TGGCGGCCGA GAACCTCACC GGCTACTTCG TGCAGCGCGA GCGCTGGTGT
CAGGGTGGCA TCCAGACGCT TTACCTCCAT AACGGCCCCC TGCGCGGGCC TGGGCTCTCG
CTGTTTCAGC GCGTCATGTT CCTGCCCATG TCGTGGCTTG TGCAGTATCT CGTTCGCTTC
ATCGTCCTCC TTATTCCGAT AGTCTATCTT TGGTTCGGCG CTCTGCCGCT CTATTTCACC
GATGTCGCCG ACTACGTCTC GAACCAGGTG CCCCTGCTTG CGGCCTATTT CCTCCTGATG
TTCTGGCTCA CGCCGACGCG CTACCTTCCG CTGGTCTCCA CCGCGGTCGG CACTTTCTCG
ACCTTTCGCA TGCTGCCGAC CGTCCTTTCG AGCTTGGTCA GGCCTTTCGG CAAACCCTTC
AAAGTGACAC CCAAGGGCAG CAGCAACGAG GAAAATAGCT TCGACGCCTA TACTTTCACC
TGGATCGCTG GCTTCATCGT GGTCACCGCG CTTGGCCTTC TGATCAACAT CGTTCCGGAG
ACGGCGCGGA TAGAGGGTTC ATTCTCGGCG ATCGCGGCGC TTTGGTCCGG CATCAACATC
GTGGTGCTGA TCATCGCGTC CCTCATCTGC TTCGAGAAAC CGCGGCGCCT GCTCCAGGCG
TTCAAGCTCG ACGAAGCGGT GAAAGTGGAC GGTGTCGAAG GGCGTCTCGT CAGTCTTTCC
CTGGACAAGG CAGTTGTGGC CGTTTCGACG GAAACGCGGT TCAAATCGAC CAAGGTCGGC
CTGAATATAG AAGGCTTCGC GCCTCTTGAG GCGGATCTGA AGCAGGTTAC CCAGCGGCGC
GGAGATATCA CGCGCACCGG TGACAAACAG CGCTACTACC TTCATCTTCA CTACGACCTG
CGTGGAGCTG AGCGCGACAA GATGATCGTA AAGCTCTACA CCGGCCGCTA TTCCCGCGAT
GTTCCCGATA TCGACAAGAT CGCCGTCTCC GTGAACTTGC TGCTGCGCGC ATTCGGTCGG
ACGCGAACCG CTTGA
 
Protein sequence
MVQSLLALAP TLLVLAFFLL GPFNWSRHKS WARAVTCVFV AAIALRYLIW RFTETVLPYP 
TDGANFYWVW FVFIAEFLAF SEVVLFLILM SRYVDRSSEA DDLERKFFAG NERELPAVDV
FIPTYNEPLD VLERTIVGAL ALDHPKDKLK VYVLDDGRRD WLKTFCEGRG AIHVTRSDNA
HAKAGNMNNG LRASSGDFIA VFDADFVPYR SFLRRTLPFF LDETIGIVQT PQHFFNVDPI
QSNLGLENLW PDEQRLFFDE IAPSRDGWDV SFCCGSCSIA RRKAVDAIGG FPTESITEDL
LTTLSMLNRG FKTRYLNERL SMGLAAENLT GYFVQRERWC QGGIQTLYLH NGPLRGPGLS
LFQRVMFLPM SWLVQYLVRF IVLLIPIVYL WFGALPLYFT DVADYVSNQV PLLAAYFLLM
FWLTPTRYLP LVSTAVGTFS TFRMLPTVLS SLVRPFGKPF KVTPKGSSNE ENSFDAYTFT
WIAGFIVVTA LGLLINIVPE TARIEGSFSA IAALWSGINI VVLIIASLIC FEKPRRLLQA
FKLDEAVKVD GVEGRLVSLS LDKAVVAVST ETRFKSTKVG LNIEGFAPLE ADLKQVTQRR
GDITRTGDKQ RYYLHLHYDL RGAERDKMIV KLYTGRYSRD VPDIDKIAVS VNLLLRAFGR
TRTA