Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_3671 |
Symbol | |
ID | 5318068 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | - |
Start bp | 109534 |
End bp | 111390 |
Gene Length | 1857 bp |
Protein Length | 618 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640775484 |
Product | putative cellulose synthase protein |
Protein accession | YP_001312417 |
Protein GI | 150375821 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.484697 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.0086159 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCCGCA TTTCCGCCAA GAAGTCAACG ATTATGTCTG CCGGCGGCGA GCCGCTGCTG GTGCCTGTGT TCACGGGCCG ACGCCGGTTG GAATATCTAC TCGGTGCGGG GCTCTGGGTA GCCGCACTCC TGTATTTCTG GGCGTGGTGG CTGGAGCCGA GCCACCACGT GGACGCTTTG GGCAGCGCCA TCGTGACCGC GGTCCTTGCG TGGGTAACCC TTCTGCCTGC CTATTTCATT GCAGTTTTCT ATCGCGCGGC AAAACCAAAC GGGCCGCTGC GGATCGCGAC GGGAAGTCGC GTTGCCATGG TCGTTACCAA GGCGCCATCG GAACCTTTTG CCATCGTGGC GGAAACATTG AAGGCGATGC TCGCGCAGGA CGTGCCACAT GACACGTGGC TCGCAGACGA GGACCCGTCC GATGCGACAC TCGACTGGTG CCGCAGGCAT GGCGTTCTCG TGTCGACGCG CAAAGGCCTG GCAGACTACC ACCGGGCCAC ATGGCCACGG CGTACGCGCT GCAAGGAGGG CAATCTTGCC TTCTTCTACG ATCACTACGG ATATGACGGT TACGATTTCG TGGCTCAGCT CGACGCGGAC CATGTTCCCG CACCCGGTTA CCTCTTTGAA GTACTGCGCC CCTTCGCCGA CCCGGAAGTG GGTTACGTCT CGGCGCCGAG TATCTGTGAC CGCAATGCTT CGCAGAGCTG GTCGGCCCGC GGGCGGCTCT ATGCGGAAGC GAGCATGCAC GGGGCCCTTC AGGCCGGCTA CAATGGCGGG CTCGCGCCGC TTTGCATAGG CTCCCACTAC GCCGTGCGGA CGGCGGCGCT CAAAGAGATC GGCGGGCTCG GCCCTGAACT TGCCGAAGAC CATTCGACGA CATTGATGAT GAATGCAGCA GGTTGGCGCG GCGTTCACGC GCTGGACGCG ATCGCCCATG GCGACGGACC ACGCACTTTT GCCGATCTGG TCACCCAGGA ATTCCAGTGG TCGCGCAGCC TGGTCATGCT GCTTCTGCAG TATTCGCCAC GGCTCGTCGG ACGGCTTCCG CTGCGGTTGA AGTTCCAGTT TCTCTTCGCG CAGCTCTGGT ATCCGCTTTT CGCCTGTTTC ATGGCCCTGA TGTTCGCGAT GCCTATCGTA GCTCTTGCAC GTGGCGAAAC CTTTGTTGCC GTAACATATC CGGAGTTTCT GGCCTATTTT GCACCATTGT CGGCCATTCT CGTCCTTCTG GCTTATCGGT GGCGGGCGAC CGGCGCCTTC CGCCCCTGCG ACGCGAAGGT TCTCAGCTGG GAGTGCATGC TTTTCCTCTT CGCGCGCTGG CCCTGGGCGC TTGCCGGGAC GCTGGCGGCC CTGCGCGATT GGCTCACCGG GTCCTTCGTG GATTTCCGCG TCACGCCGAA GGGATCATCC GAGGTCGATC CGTTGCCGCT GCGTGTGCTC GCGCCCTACT TCGCGCTTGC GATTGCGGCC GTCCTGCCGG TGTTCCTCGT CGAAGATGCA GCGAAGGCCA GGGGCTTCTA CCTGTTCGCT ATCTTGAATG CTGCGATTTA CTGCCTGCTT CTGCTTGTCA TCGTCATCAG GCATTCGAGA GAAAATGCCG TTGCAGCGGG TTCCCGCTTC TATCGCCCCG CGATCACGGC CGGGCTCCTG GCTGTCATCG CGCTGCCAGG TATAGCAACG GCTGAGCACG GGAAAGAAGG GCTTGAGGCG CTCGCCTGGG GCAGCGGTCA CCTGCGTCTG TTCGAGGACC GCTATGCAGT CGCCGGAGCC GGGCAGGGCG GAACGACTTT GCGCAAGATC GTCTTCAGTC CGCGTTGGAT TTCCGCCCCG CGCGGCGGCA AAGCAGAGAG GGGGTAG
|
Protein sequence | MTRISAKKST IMSAGGEPLL VPVFTGRRRL EYLLGAGLWV AALLYFWAWW LEPSHHVDAL GSAIVTAVLA WVTLLPAYFI AVFYRAAKPN GPLRIATGSR VAMVVTKAPS EPFAIVAETL KAMLAQDVPH DTWLADEDPS DATLDWCRRH GVLVSTRKGL ADYHRATWPR RTRCKEGNLA FFYDHYGYDG YDFVAQLDAD HVPAPGYLFE VLRPFADPEV GYVSAPSICD RNASQSWSAR GRLYAEASMH GALQAGYNGG LAPLCIGSHY AVRTAALKEI GGLGPELAED HSTTLMMNAA GWRGVHALDA IAHGDGPRTF ADLVTQEFQW SRSLVMLLLQ YSPRLVGRLP LRLKFQFLFA QLWYPLFACF MALMFAMPIV ALARGETFVA VTYPEFLAYF APLSAILVLL AYRWRATGAF RPCDAKVLSW ECMLFLFARW PWALAGTLAA LRDWLTGSFV DFRVTPKGSS EVDPLPLRVL APYFALAIAA VLPVFLVEDA AKARGFYLFA ILNAAIYCLL LLVIVIRHSR ENAVAAGSRF YRPAITAGLL AVIALPGIAT AEHGKEGLEA LAWGSGHLRL FEDRYAVAGA GQGGTTLRKI VFSPRWISAP RGGKAERG
|
| |