Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_0855 |
Symbol | |
ID | 5321693 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 914880 |
End bp | 916130 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640789792 |
Product | hypothetical protein |
Protein accession | YP_001326545 |
Protein GI | 150396078 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5653] Protein involved in cellulose biosynthesis (CelD) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.139807 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.24212 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGGCCA ATTCCGAGTT AACCCTTTCG CCGGAGTGCA TTGCCGGGCG GTATCATGAT GCCGCGATCA CCCGCCCGGG AGGCCACGCA GCCGGCAGGA TCACTTTCTC CATCCACACC GATATGGCCG AGCTCGAAGC AGAATGGCGC GTATTCGATG ATTGCAGCCT CAATTCGCTG CATCAGAGCT TCGACTGGTG CGCGTCGTGG GTAAAGACGC ATGGCAGCGA GCTGCTGATC GTGCGCGGCG CGGCTGGCAA GGAGCCGCTT TTCCTTCTCC CGTTCGAGAT CGAACGCGGC CGGCTTTTCC GTACGGCCCG CCTGATCGGC TCGGAGCATA GCAACCTTAA CACCGGCCTG TTTGACGGAC GGGATGGCGC CTTCTGCGCC GAGTGCGTAC TGGCGCTCGC CAGCGGCATT GGTCGCCAGC TCCGTCAGTT CGCGGACGTT CTCGTGCTCG AACGGACGCC ACGGATCTGG CGCGGTGCGC CACACCCGCT CGCCGCTCTG GCCGGTATCG AACACCCGAA CGCTTCGTTT CAATTGCCTC TTCTCGGCAC CATCGACCGC ACGCTCACTC AGCTGAATGC CAAACGGCGG CGCAAGAAAA TGCGCATTTC CGAACGGCGT CTCGCCGAGA TCGGCGGTTA TGATTATGTG ATCGCGCGGG AAAAGCCCGA GGCCCATGCC CTGCTCGAAA CATTCTTTAA GCAGAAGGCC GCCCGCTTCG AGGCAATCGG CCTGCCGGAC GCTTTTCGGC AAGCCGAGAC ACGCGCATTT TTCCATGCGC TGATCGATTC CGGCGCTGAC GAGCCGGACA GGCTCCTGGA GCTCAATGCG ATAAGGCTGA AGGGCGAGCA TGCAGGCCGG ATTTCTGCGA TCGCCGGTCT TTCGCGCAAG GGCGACCACG TTATCTGTCA GTTCGGCTCC ATCGACGAGG AAATCGCCGC CGGTGCTAGC CCGGGCGAAT TATTGTTCTA CAGAATCATC GAGCGGCTGT GTCGAGAAGG CGTCGCCCTT TTCGACTTCG GCATCGGCGA TCAAGCCTAC AAGCGGTCGT GGTGCACGAT TGAGACGCGG TTACGGGACA TCTTCCTGCC AATCACGCTC CGCGGTCGGG CCGCTGCCGC CGTGTTCCGC GCAGTTGCCC GTGCGAAGCG GTGGATCAAA GCCAACGAAA AGTTTTACGC CTTCATACAA AGGAAACGAC GGTTACGGCA GATGTCGGCT GCAAGCGCAG ATGAACCGTA G
|
Protein sequence | MMANSELTLS PECIAGRYHD AAITRPGGHA AGRITFSIHT DMAELEAEWR VFDDCSLNSL HQSFDWCASW VKTHGSELLI VRGAAGKEPL FLLPFEIERG RLFRTARLIG SEHSNLNTGL FDGRDGAFCA ECVLALASGI GRQLRQFADV LVLERTPRIW RGAPHPLAAL AGIEHPNASF QLPLLGTIDR TLTQLNAKRR RKKMRISERR LAEIGGYDYV IAREKPEAHA LLETFFKQKA ARFEAIGLPD AFRQAETRAF FHALIDSGAD EPDRLLELNA IRLKGEHAGR ISAIAGLSRK GDHVICQFGS IDEEIAAGAS PGELLFYRII ERLCREGVAL FDFGIGDQAY KRSWCTIETR LRDIFLPITL RGRAAAAVFR AVARAKRWIK ANEKFYAFIQ RKRRLRQMSA ASADEP
|
| |