Gene Smed_2521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2521 
Symbol 
ID5323388 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2619430 
End bp2620704 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content63% 
IMG OID640791463 
Producthypothetical protein 
Protein accessionYP_001328186 
Protein GI150397719 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5653] Protein involved in cellulose biosynthesis (CelD) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGGCA ATGTCCATCT TCCCGCAGAC GCGAATGGCA GCGCGGACCG CCTGCTTGGC 
GGCCTTGCGC ATTCTGGTCT CGACGGGTTG CAGTCCGAGC GGACGCTGAT GATCGGAAGG
CCCGGACGGC ATCTCGCGGT CTATCCCGCG AGGGCCGGAT ACGAGCTGCA ACGGGAGCTC
GACTTCCTTT CCAATCGGGC CATCGAGCAG AACGTCTTCT TCACCGGCCG CTTTCTCGCG
CCGGCCATGC CGCGCCTGGA AGATCGGGTG GTTCGGCTCG CGGTGATCCG CGACCAGAGC
GAGCAGAGTA GCCGCATCCG GTTCCTTATG CCTTTCTCCA TAGAGAAGCC CGGCTTCGCG
ATCGGCGCAT CCATCATCCG CGCCTGGTCC AATCCTTTCG GGCCGCTGGG AACGCCCCTT
CTCGACGCCG AGGACGCAGC CGAAACGATC AGCAATCTCT ATGCCGCGCT GGCTACACCC
TCAGCCGGCC TCCCGCCAGT GCTCGTGCTC CCCGACATCA GGTTGAACGG AAAATTCGCC
CAGCTTGCGC GTGCCGTCGC GATCGGCGAA AACCTGCCGC TGACGGTGAC CGACACCTTC
AGGCGGCCGA TGCTCGAAAG CCTGCTGGAC GGCCCTACCT ATCTGAGCGA GGCAATCGGC
CCGCAGCGCC TCAGGGAGCT GAGGCGGCAG TGGAACAATC TCGCAAAGCA GGGATCGCTG
ACTTACAGTG TTGCGCGTCG GCCCGATGAT ATCCGCCTGC GCATGGAGGA GTTTCTGGTC
CTCGAAGCAT CCGGTTGGAA AGGCCGCGAA CGCAGTGCCA TGATCATGGA CCGCTTTCGC
GCCGCCTTCG CAAGAGAGGC TATAACCAAC CTCGCAGAGG CGGATAGCGT CCGCATTCAC
ACACTCGACC TGAACGGCAA GGCGATCGCG ACCATCATCG TCCTGATAAT GGCGGGCGAA
GCCTATACCT GGAAGACGGC CTATGACGAG CGTTACGCCA AATATTCTCC CGGCAAGCTG
CTCGTCGCCG AGTTGACGGA GTGGCATCTC GACGATGCCA ACATCATCCG CTCCGATTCC
TGCGCGGTGC CCGATCATCC GGTGATGAGC CGCCTGTGGC AGGAGCGTGA GGAGATGGGC
ACACTTGTGA TCGGGCTCGG GCAGAACCGC GACCGCGACG TGCGCCAGGT CGCAGCACAG
CTTCACCTCT ATCGCAACAC CCGCAATATG GCTCGGCTGC TGCGCGAGAA GATCCGGGCG
CTGGCAGGTC GCTGA
 
Protein sequence
MIGNVHLPAD ANGSADRLLG GLAHSGLDGL QSERTLMIGR PGRHLAVYPA RAGYELQREL 
DFLSNRAIEQ NVFFTGRFLA PAMPRLEDRV VRLAVIRDQS EQSSRIRFLM PFSIEKPGFA
IGASIIRAWS NPFGPLGTPL LDAEDAAETI SNLYAALATP SAGLPPVLVL PDIRLNGKFA
QLARAVAIGE NLPLTVTDTF RRPMLESLLD GPTYLSEAIG PQRLRELRRQ WNNLAKQGSL
TYSVARRPDD IRLRMEEFLV LEASGWKGRE RSAMIMDRFR AAFAREAITN LAEADSVRIH
TLDLNGKAIA TIIVLIMAGE AYTWKTAYDE RYAKYSPGKL LVAELTEWHL DDANIIRSDS
CAVPDHPVMS RLWQEREEMG TLVIGLGQNR DRDVRQVAAQ LHLYRNTRNM ARLLREKIRA
LAGR