Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_3726 |
Symbol | |
ID | 5318592 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | - |
Start bp | 167954 |
End bp | 169948 |
Gene Length | 1995 bp |
Protein Length | 664 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640775539 |
Product | cellulose synthase (UDP-forming) |
Protein accession | YP_001312472 |
Protein GI | 150375876 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTTCAAT CTCTTCTAGC GCTTGCGCCG ACATTGCTCG TCCTTGCCTT TTTTCTGCTC GGCCCGTTCA ACTGGTCGCG CCATAAAAGC TGGGCGCGTG CGGTGACATG CGTTTTCGTG GCTGCCATTG CGCTGCGTTA CCTCATCTGG CGCTTTACCG AAACCGTTTT GCCCTATCCG ACCGATGGCG CCAATTTCTA TTGGGTATGG TTCGTCTTCA TCGCGGAGTT CCTCGCTTTT TCCGAGGTGG TGCTCTTCCT CATCCTGATG AGCCGCTATG TGGATCGCAG TTCCGAGGCA GACGACCTAG AGCGGAAGTT CTTCGCCGGG AATGAAAGGG AACTTCCTGC GGTCGACGTT TTCATACCGA CCTATAACGA ACCGCTCGAC GTTCTCGAAC GGACGATCGT CGGAGCCCTT GCGCTCGATC ATCCCAAAGA CAAGCTCAAG GTTTATGTGC TCGACGACGG CCGGCGCGAC TGGCTCAAGA CCTTCTGCGA GGGACGAGGC GCGATCCACG TCACGCGCAG CGACAACGCG CATGCCAAAG CGGGCAACAT GAACAACGGT TTACGCGCCA GTTCAGGCGA TTTCATTGCC GTCTTCGATG CCGACTTCGT TCCCTATCGC AGCTTCCTAC GACGGACTTT GCCCTTCTTC CTCGACGAGA CGATCGGCAT CGTGCAGACG CCTCAGCACT TCTTCAATGT CGATCCGATA CAGTCCAATC TCGGTTTGGA GAATCTGTGG CCTGACGAGC AGCGCCTGTT CTTCGACGAA ATCGCGCCGA GCCGCGACGG CTGGGACGTC AGCTTCTGCT GCGGCTCCTG TTCGATCGCC CGGCGCAAAG CGGTCGACGC CATTGGCGGG TTTCCAACCG AGTCGATCAC CGAAGATCTG CTGACGACGC TGTCGATGCT GAACAGGGGT TTCAAGACTC GCTATCTCAA CGAGCGGCTT TCAATGGGCT TGGCGGCCGA GAACCTCACC GGCTACTTCG TGCAGCGCGA GCGCTGGTGT CAGGGTGGCA TCCAGACGCT TTACCTCCAT AACGGCCCCC TGCGCGGGCC TGGGCTCTCG CTGTTTCAGC GCGTCATGTT CCTGCCCATG TCGTGGCTTG TGCAGTATCT CGTTCGCTTC ATCGTCCTCC TTATTCCGAT AGTCTATCTT TGGTTCGGCG CTCTGCCGCT CTATTTCACC GATGTCGCCG ACTACGTCTC GAACCAGGTG CCCCTGCTTG CGGCCTATTT CCTCCTGATG TTCTGGCTCA CGCCGACGCG CTACCTTCCG CTGGTCTCCA CCGCGGTCGG CACTTTCTCG ACCTTTCGCA TGCTGCCGAC CGTCCTTTCG AGCTTGGTCA GGCCTTTCGG CAAACCCTTC AAAGTGACAC CCAAGGGCAG CAGCAACGAG GAAAATAGCT TCGACGCCTA TACTTTCACC TGGATCGCTG GCTTCATCGT GGTCACCGCG CTTGGCCTTC TGATCAACAT CGTTCCGGAG ACGGCGCGGA TAGAGGGTTC ATTCTCGGCG ATCGCGGCGC TTTGGTCCGG CATCAACATC GTGGTGCTGA TCATCGCGTC CCTCATCTGC TTCGAGAAAC CGCGGCGCCT GCTCCAGGCG TTCAAGCTCG ACGAAGCGGT GAAAGTGGAC GGTGTCGAAG GGCGTCTCGT CAGTCTTTCC CTGGACAAGG CAGTTGTGGC CGTTTCGACG GAAACGCGGT TCAAATCGAC CAAGGTCGGC CTGAATATAG AAGGCTTCGC GCCTCTTGAG GCGGATCTGA AGCAGGTTAC CCAGCGGCGC GGAGATATCA CGCGCACCGG TGACAAACAG CGCTACTACC TTCATCTTCA CTACGACCTG CGTGGAGCTG AGCGCGACAA GATGATCGTA AAGCTCTACA CCGGCCGCTA TTCCCGCGAT GTTCCCGATA TCGACAAGAT CGCCGTCTCC GTGAACTTGC TGCTGCGCGC ATTCGGTCGG ACGCGAACCG CTTGA
|
Protein sequence | MVQSLLALAP TLLVLAFFLL GPFNWSRHKS WARAVTCVFV AAIALRYLIW RFTETVLPYP TDGANFYWVW FVFIAEFLAF SEVVLFLILM SRYVDRSSEA DDLERKFFAG NERELPAVDV FIPTYNEPLD VLERTIVGAL ALDHPKDKLK VYVLDDGRRD WLKTFCEGRG AIHVTRSDNA HAKAGNMNNG LRASSGDFIA VFDADFVPYR SFLRRTLPFF LDETIGIVQT PQHFFNVDPI QSNLGLENLW PDEQRLFFDE IAPSRDGWDV SFCCGSCSIA RRKAVDAIGG FPTESITEDL LTTLSMLNRG FKTRYLNERL SMGLAAENLT GYFVQRERWC QGGIQTLYLH NGPLRGPGLS LFQRVMFLPM SWLVQYLVRF IVLLIPIVYL WFGALPLYFT DVADYVSNQV PLLAAYFLLM FWLTPTRYLP LVSTAVGTFS TFRMLPTVLS SLVRPFGKPF KVTPKGSSNE ENSFDAYTFT WIAGFIVVTA LGLLINIVPE TARIEGSFSA IAALWSGINI VVLIIASLIC FEKPRRLLQA FKLDEAVKVD GVEGRLVSLS LDKAVVAVST ETRFKSTKVG LNIEGFAPLE ADLKQVTQRR GDITRTGDKQ RYYLHLHYDL RGAERDKMIV KLYTGRYSRD VPDIDKIAVS VNLLLRAFGR TRTA
|
| |