Gene Smed_0855 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0855 
Symbol 
ID5321693 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp914880 
End bp916130 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content61% 
IMG OID640789792 
Producthypothetical protein 
Protein accessionYP_001326545 
Protein GI150396078 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5653] Protein involved in cellulose biosynthesis (CelD) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.139807 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.24212 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGCCA ATTCCGAGTT AACCCTTTCG CCGGAGTGCA TTGCCGGGCG GTATCATGAT 
GCCGCGATCA CCCGCCCGGG AGGCCACGCA GCCGGCAGGA TCACTTTCTC CATCCACACC
GATATGGCCG AGCTCGAAGC AGAATGGCGC GTATTCGATG ATTGCAGCCT CAATTCGCTG
CATCAGAGCT TCGACTGGTG CGCGTCGTGG GTAAAGACGC ATGGCAGCGA GCTGCTGATC
GTGCGCGGCG CGGCTGGCAA GGAGCCGCTT TTCCTTCTCC CGTTCGAGAT CGAACGCGGC
CGGCTTTTCC GTACGGCCCG CCTGATCGGC TCGGAGCATA GCAACCTTAA CACCGGCCTG
TTTGACGGAC GGGATGGCGC CTTCTGCGCC GAGTGCGTAC TGGCGCTCGC CAGCGGCATT
GGTCGCCAGC TCCGTCAGTT CGCGGACGTT CTCGTGCTCG AACGGACGCC ACGGATCTGG
CGCGGTGCGC CACACCCGCT CGCCGCTCTG GCCGGTATCG AACACCCGAA CGCTTCGTTT
CAATTGCCTC TTCTCGGCAC CATCGACCGC ACGCTCACTC AGCTGAATGC CAAACGGCGG
CGCAAGAAAA TGCGCATTTC CGAACGGCGT CTCGCCGAGA TCGGCGGTTA TGATTATGTG
ATCGCGCGGG AAAAGCCCGA GGCCCATGCC CTGCTCGAAA CATTCTTTAA GCAGAAGGCC
GCCCGCTTCG AGGCAATCGG CCTGCCGGAC GCTTTTCGGC AAGCCGAGAC ACGCGCATTT
TTCCATGCGC TGATCGATTC CGGCGCTGAC GAGCCGGACA GGCTCCTGGA GCTCAATGCG
ATAAGGCTGA AGGGCGAGCA TGCAGGCCGG ATTTCTGCGA TCGCCGGTCT TTCGCGCAAG
GGCGACCACG TTATCTGTCA GTTCGGCTCC ATCGACGAGG AAATCGCCGC CGGTGCTAGC
CCGGGCGAAT TATTGTTCTA CAGAATCATC GAGCGGCTGT GTCGAGAAGG CGTCGCCCTT
TTCGACTTCG GCATCGGCGA TCAAGCCTAC AAGCGGTCGT GGTGCACGAT TGAGACGCGG
TTACGGGACA TCTTCCTGCC AATCACGCTC CGCGGTCGGG CCGCTGCCGC CGTGTTCCGC
GCAGTTGCCC GTGCGAAGCG GTGGATCAAA GCCAACGAAA AGTTTTACGC CTTCATACAA
AGGAAACGAC GGTTACGGCA GATGTCGGCT GCAAGCGCAG ATGAACCGTA G
 
Protein sequence
MMANSELTLS PECIAGRYHD AAITRPGGHA AGRITFSIHT DMAELEAEWR VFDDCSLNSL 
HQSFDWCASW VKTHGSELLI VRGAAGKEPL FLLPFEIERG RLFRTARLIG SEHSNLNTGL
FDGRDGAFCA ECVLALASGI GRQLRQFADV LVLERTPRIW RGAPHPLAAL AGIEHPNASF
QLPLLGTIDR TLTQLNAKRR RKKMRISERR LAEIGGYDYV IAREKPEAHA LLETFFKQKA
ARFEAIGLPD AFRQAETRAF FHALIDSGAD EPDRLLELNA IRLKGEHAGR ISAIAGLSRK
GDHVICQFGS IDEEIAAGAS PGELLFYRII ERLCREGVAL FDFGIGDQAY KRSWCTIETR
LRDIFLPITL RGRAAAAVFR AVARAKRWIK ANEKFYAFIQ RKRRLRQMSA ASADEP