Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5208 |
Symbol | |
ID | 5319510 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | + |
Start bp | 167877 |
End bp | 170057 |
Gene Length | 2181 bp |
Protein Length | 726 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640776986 |
Product | cellulose synthase (UDP-forming) |
Protein accession | YP_001313918 |
Protein GI | 150377323 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | [TIGR03030] cellulose synthase catalytic subunit (UDP-forming) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.823689 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTAAAG TTTTCGTCCT TGCCGTGTGG GCGTTCATCT CTCTCTGTGT CGTGGCGATC ATCACTCTGC CCGTCAACTT ACAGACGCAG TTGATCGCTA GTCTGCTCAT CGTAACGCTG ATGGCGATCA TCAAGCTCGT AGATGCCGGG GGGAGGTGGC GCTTGATTGC GCTCGCCTTC GGAACGGCGG TGGTGCTGCG CTATGTCTAT TGGCGCACTA CCGGAACATT GCCGCCCATC AACCAGCCTG AGAATTTCAT TCCCGGCTTT CTTCTCTATC TCGCTGAAAT GTATAGCGTC ATGATGCTGG CGCTCAGTCT CTTCGTCGTC GCCATGCCGC TTCCCCCGCG ACCCTCCCGG GCAGCAACAC CGGGCGACTA CCCCAAGGTC GACGTGTTCG TCCCTTCCTA CAACGAGGAC GCATCTCTGC TTGCCAACAC GCTCGCGGCA GCCAAAGGCA TGGACTATCC GGCAGAGAAG CTGAGGGTGT GGTTGCTTGA CGACGGCGGT ACGCTGCAGA AGCGGAACTC GACTAATCTC GTTGAGGCGC AACGCGCCAC CGCACGCAAT CTGGAACTGC AAAAGCTGTG CACCGATCTC GGGGTGCGTT ACCTCACGCG CGATCGCAAC GAACACGCCA AAGCGGGCAA TCTCAACAAT GGCATGTCGC ATTCGGAAGG AGACCTGATT GCTGTTTTTG ACGCGGATCA TGCGCCTGCC CGCGACTTCC TGCTGGAGAC CGTCGGCTAC TTCGAGGACG ATCCGCGCCT GTTTCTCGTG CAGACGCCGC ATTTTTTTCT CAATCCGGAC CCCCTCGAGC GCAATCTGAG AACATTCGAG AAGATGCCAA GCGAAAACGA GATGTTCTAC GGCATTATCC AGCGTGGCCT TGACAAGTGG AACGCAGCTT TCTTCTGCGG TTCCGCAGCC GTCCTTAGAC GCAAGGCGCT CGAGGACACG AGTGGCTTCA GCGGCAAGAG CATCACCGAA GATTGCGAAA CCGCGTTGGC GCTACACGGA CGCGGTTGGA ACAGCGTCTA TGTCGATCGC CCGCTGATCG CAGGATTACA ACCCGCGACA TTCGCGAGTT TCATCGGTCA GCGAAGCCGC TGGGCGCAGG GGATGATGCA AATCTTGATG TTCCGATTTC CACTCTTTGA GGGGGGTCTG ACGATACCAC AGCGCCTCTG CTACATGTCG TCGACACTGT TTTGGCTTTT CCCATTTCCG CGCACGATCT TTCTTTTCGC GCCGCTCTGC TATCTTTTCT TCGATTTGCA GATATTTACC GCGTCGGGCG GCGAGTTCAT GGGGTATACG CTTGCCTATC TGGTCGTTAA CCTGATGATG CAGAATTATC TCTATGGTTC GTTCCGCTGG CCCTGGATTT CCGAGCTCTA CGAGTACGTC CAGACCGTTC ACCTTCTGCC GGCCGTCGTG TCGGTGGTGT TGAATCCGCG CAAGCCGACT TTCAAAGTCA CGGCAAAGGA CGAATCCATT CAGGAGAGCC GTCTATCGGA GATCGGACGA CCCTTTTTTG TGGTCTTCAT TGTCCTCTTC GTCGCGTTGC TGGTGACCGC GTACCGTGTC TACACGGAAC CCTACAAAGC GGATATCACC ATGGTCGTTG GAGGCTGGAA CCTTCTCAAT CTTATAATGG CCGGATGTGC TCTGGGCGTC GTCTCTGAAC GCGGGGAAAA AGCCGCGTCG CGGCGCGTCA AAGTAAGTCG CCGCTGTGAA TTCTCGACCG GCGAACGATG GTATCCGGCG ACGATCGAGG ATGTCTCCGC CAATGGCGCC CGCATTCAGG TTTATGGCCT TGCTAAGGAT GATTTGTCCG TCGAGGCGCG GACCGAAATC CGCTTCGAGC CTTTCGCCGG GGATGGCGCT ATCGAGATTC TGCCGATGGC GGTCAGAAAC GTGGAAGTCA CCGGTGACAT AACCGCAGTC GGTTGCCGCT TCCTGCCGGA GGTTGCCCGG CACCATAGCC TCGTCGCAGA TCTCCTGTTC GCGAATTCAC GGCAATGGAG CGAATTCCAG CACAAGCGCC GCGGCAATCC AGGTCTGGTT CGCGGTACCG TGTGGTTCCT CTGGCTCGCC TTCTATCAAA TGGGGCGGGG TCTAATCTAT TTCTTCCGCA GCCTCGGTTC TTCCCGCAAA GAGGCGAGGG GGGTCCGTTG A
|
Protein sequence | MRKVFVLAVW AFISLCVVAI ITLPVNLQTQ LIASLLIVTL MAIIKLVDAG GRWRLIALAF GTAVVLRYVY WRTTGTLPPI NQPENFIPGF LLYLAEMYSV MMLALSLFVV AMPLPPRPSR AATPGDYPKV DVFVPSYNED ASLLANTLAA AKGMDYPAEK LRVWLLDDGG TLQKRNSTNL VEAQRATARN LELQKLCTDL GVRYLTRDRN EHAKAGNLNN GMSHSEGDLI AVFDADHAPA RDFLLETVGY FEDDPRLFLV QTPHFFLNPD PLERNLRTFE KMPSENEMFY GIIQRGLDKW NAAFFCGSAA VLRRKALEDT SGFSGKSITE DCETALALHG RGWNSVYVDR PLIAGLQPAT FASFIGQRSR WAQGMMQILM FRFPLFEGGL TIPQRLCYMS STLFWLFPFP RTIFLFAPLC YLFFDLQIFT ASGGEFMGYT LAYLVVNLMM QNYLYGSFRW PWISELYEYV QTVHLLPAVV SVVLNPRKPT FKVTAKDESI QESRLSEIGR PFFVVFIVLF VALLVTAYRV YTEPYKADIT MVVGGWNLLN LIMAGCALGV VSERGEKAAS RRVKVSRRCE FSTGERWYPA TIEDVSANGA RIQVYGLAKD DLSVEARTEI RFEPFAGDGA IEILPMAVRN VEVTGDITAV GCRFLPEVAR HHSLVADLLF ANSRQWSEFQ HKRRGNPGLV RGTVWFLWLA FYQMGRGLIY FFRSLGSSRK EARGVR
|
| |